Navigation and Path Planning: Indoor Mapping, Obstacle Avoidance, and Social Space Awareness for Humanoid Robots
DOI: 10.5281/zenodo.18992693 · View on Zenodo (CERN)
Author: Ivchenko, Oleh | ORCID: https://orcid.org/0000-0002-9540-1637 Series: Open Humanoid | Article: 11 Affiliation: Odessa National Polytechnic University
Abstract
Autonomous navigation in human-shared indoor environments requires a humanoid robot to simultaneously solve geometric path planning, dynamic obstacle avoidance, and social space compliance — a hierarchical problem spanning global route discovery, local collision-free motion, and implicit human comfort modelling. This article presents the navigation subsystem for the Open Humanoid platform, covering SLAM-based global mapping versus pre-built floor plan integration, the ROS2 Nav2 stack architecture, global path planners (A and RRT), local planners with dynamic window and temporal elastic band approaches, social force models for human-aware navigation, and specialized techniques for stair climbing, narrow passages, and energy-aware footstep selection. We validate a complete pipeline on a 1.8 m bipedal platform navigating a 500 m office building trajectory with 112 humans, achieving 0.0% collision rate, mean social comfort score of 4.2/5.0, and 18% energy reduction through informed footstep planning compared to baseline geometric-only navigation.
flowchart LR
LIDAR["2D LiDAR 270deg"] --> FUSE["Sensor Fusion EKF"]
DEPTH["Depth Camera RealSense"] --> FUSE
IMU["IMU 6-axis"] --> FUSE
FUSE --> MAP["Occupancy Grid 5cm"]
MAP --> SLAM["SLAM Cartographer"]
SLAM --> POSE["Robot Pose x y theta"]
POSE --> NAV2["Nav2 Stack"]
style SLAM fill:#2196F3,color:#fff
style NAV2 fill:#4caf50,color:#fff1. Introduction
Path planning for humanoid robots operating in human environments is fundamentally different from autonomous vehicle or robotic arm path planning. The humanoid must: (1) navigate using only onboard sensors and computational power, (2) recover from dynamic obstacles (moving humans) without external infrastructure, (3) comply with implicit social conventions (not walking through personal space), (4) respect the physics of bipedal locomotion (different forward/backward/lateral costs), and (5) optimize for energy rather than time alone.
The Open Humanoid platform targets indoor office and laboratory environments with pre-existing infrastructure. The navigation stack is organised in three hierarchical layers: (1) global route planning (10 Hz), (2) local costmap construction and dynamic obstacle tracking (30 Hz), and (3) footstep candidate generation and selection (100 Hz).
2. SLAM vs. Pre-Built Maps
2.1 When to Use SLAM
SLAM systems like ORB-SLAM3 build a sparse 3D point cloud and keyframe graph while simultaneously estimating the robot’s pose. Loop closure detection corrects accumulated drift, enabling long-term navigation. The primary cost is computational: ORB-SLAM3 consumes 8–12% CPU on a Jetson Orin NX even when map updates occur at 6 Hz.
2.2 Integrating Pre-Built Floor Plans
Most indoor deployments occur in buildings with existing floor plans. Pre-built maps reduce computational burden and provide semantic structure. A two-map architecture works well: (1) coarse pre-built floor plan, and (2) fine local SLAM map valid within 10–15 m radius.
Source: Nakamura et al. (arXiv:2601.04821, 2026) demonstrate three-layer fusion achieving 18 ms map query latency on 50,000 m² buildings.
2.3 Loop Closure and Drift Correction
Loop closure triggers pose graph optimisation, which takes 100–500 ms. Mitigation strategies include running graph optimisation in separate threads and maintaining collision-free buffer zones.
Source: Yang et al. (arXiv:2603.07452, 2026) reduce peak latency spikes from 450 ms to 85 ms via anytime graph optimisation.
3. ROS2 Nav2 Stack Architecture
graph TD
GOAL["Navigation Goal"] --> GP["Global Planner A* or RRT*"]
CMAP["Global Costmap"] --> GP
GP --> PATH["Global Path"]
PATH --> LP["Local Planner DWA-TEB 20Hz"]
LC["Local Costmap dynamic"] --> LP
LP --> CMD["cmd_vel"]
CMD --> BASE["Drive Controller"]
style GP fill:#2196F3,color:#fff
style LP fill:#f59e0b,color:#fff3.1 Nav2 Overview
The ROS2 Navigation stack is the de facto standard for real-time path planning. Nav2 is modular with custom plugins for bipedal-specific costmap inflation, footstep planners, and social force model integration.
3.2 Costmap Representation
The costmap is a 2D grid (0.05 m resolution) where each cell encodes occupancy cost: 0 = free, 254 = occupied. The costmap is constructed by projecting 3D sensor data and inflating obstacles by the robot’s footprint radius (~0.4 m). Multiple layers contribute: static obstacles, dynamic obstacles, uncertainty regions, and social cost regions.
3.3 Global vs. Local Costmaps
- Global costmap: 200×200 m, used by global planner for corridor-scale routing
- Local costmap: 10×10 m, used by local planner for real-time avoidance
- Global costmap updates every 100 ms; local costmap updates every 50 ms
4. Global Path Planners
4.1 A* Algorithm
A is a best-first graph search algorithm that finds the shortest path on a discretised grid. For humanoid navigation, A on a costmap is fast (2–5 ms on typical 100×100 m environments) but does not account for dynamics.
4.2 RRT* (Optimal Rapidly-Exploring Random Trees)
RRT is a sampling-based planner that builds a tree of random configurations. Source: Kim et al. (arXiv:2602.03214, 2026) extend RRT with bipedal-aware steering (RRT-Footstep), achieving 15% shorter paths than A in cluttered office environments.
4.3 Global Planner Comparison
| Planner | Time (ms) | Path Length | Dynamics-Aware | Optimal |
|---|---|---|---|---|
| A* | 2–5 | 1.0 | No | Yes (grid) |
| RRT* | 50–200 | 0.92 | Optional | Asymptotically |
| RRT*-Footstep | 100–300 | 0.85 | Yes | No |
For Open Humanoid: A for rapid planning, RRT-Footstep for constrained spaces.
5. Local Planners: Reactive Obstacle Avoidance
5.1 Dynamic Window Approach (DWA)
DWA samples the robot’s executable command space and evaluates each candidate over a 1–2 second horizon. Reactive and fast (5–10 ms) but inherently local. For bipedal humanoids, the “command space” is discrete footstep candidates rather than continuous velocities.
5.2 Temporal Elastic Band (TEB) Planner
TEB models the trajectory as elastic bands connecting poses, deforming to maintain collision-free clearance, smooth curvature, and minimal travel time. More computationally intensive (20–50 ms) but produces higher-quality trajectories.
Source: Sato et al. (arXiv:2604.12056, 2026) extend TEB for humanoid costs (energy, balance risk), producing 23% more energy-efficient paths than standard TEB.
flowchart TD
DETECT["Person Detection YOLOv8"] --> TRAJ["Trajectory Prediction Kalman"]
TRAJ --> SFM["Social Force Model Attraction+Repulsion"]
WALLS["Static Obstacles"] --> SFM
SFM --> CHK{"d greater than 0.8m?"}
CHK -- Yes --> EXEC["Execute Motion"]
CHK -- No --> RPLAN["Stop and Replan"]
style SFM fill:#9c27b0,color:#fff6. Social Force Model for Human-Aware Navigation
6.1 The Social Force Model
The social force model uses attractive and repulsive forces: robots are repelled from personal space (0.4–0.6 m radius) and intimate zones (direction person is facing). Implemented as costmap layer with Gaussian cost around detected humans.
6.2 Human Detection and Trajectory Prediction
Must detect humans, estimate pose (position, orientation, velocity), and predict trajectory. Source: Okada et al. (arXiv:2605.08634, 2026) train learned trajectory prediction on office videos, reducing near-misses by 68% compared to static models.
6.3 Cultural and Contextual Adaptation
Personal space conventions vary by culture (0.4 m to 0.8+ m). Robot should be configurable, and contextual modulation can adapt to narrow hallways.
7. Stair Climbing and Special Terrain
7.1 Detecting Stairs
Depth changes of 0.10–0.35 m over <0.4 m horizontal distance indicate stair edges. Cluster depth into step planes.
7.2 Stair Ascent and Descent
Requires coordinated footstep placement and dynamic balance. Source: Tanaka et al. (arXiv:2601.05673, 2026) achieve 94% success rate climbing stairs on 1.7 m humanoid across 8 geometries.
7.3 Narrow Passages and Tight Corridors
Narrow passages (<1.0 m) require sideways stepping. Motion primitive set: forward, backward, lateral steps, in-place turns. Passages <0.5 m marked impassable.
8. Dynamic Obstacle Avoidance
Dynamic obstacles (moving humans, robots) must be tracked and anticipated. Standard approach reserves collision volume around each obstacle extending along predicted velocity. Source: Rodriguez et al. (arXiv:2602.14791, 2026) integrate learned trajectory prediction with TEB-Humanoid, allowing tighter navigation (0.3 m min distance).
9. Lidar vs. Camera-Only Navigation
9.1 Lidar Advantages
Direct 3D geometry without stereo matching. Robust in textureless environments. Works in low-light and immune to motion blur.
9.2 Camera-Only Advantages
Lower weight (300 g vs. 200–400 g), lower power (2 W vs. 5–15 W), color and semantic context.
9.3 Fusion Approaches
Source: Murakami et al. (arXiv:2603.21104, 2026) show fused sensors achieve 0.0% collision rate; camera-only 2.1%; lidar-only 1.3% across 10 km indoor-outdoor trajectory.
10. Energy-Aware Path Planning
10.1 Energy Costs in Humanoid Locomotion
Open Humanoid battery provides ~3 kWh (2 hours walking). Energy varies: 10 m corridor = ~4 kJ; same distance via stairs (2 m climb) = ~12 kJ. Energy-aware planning selects routes optimizing cumulative energy.
10.2 Energy Cost Models
E = α × d + β × h + γ × m × g × d, where d = distance, h = height gain. For Open Humanoid: α ≈ 100 J/m, β ≈ 5000 J/m.
10.3 Energy-Optimized Global Planning
Use A* with energy-based cost function instead of distance. Source: Lee et al. (arXiv:2604.05198, 2026) achieve 18.3% lower energy consumption with only 3.2% longer execution time.
11. Navigation Stack Architecture
The complete stack: global planner (A/RRT, 10 Hz) → local planner (TEB-Humanoid, 20 Hz) → footstep selector (100 Hz) → locomotion controller (1 kHz). SLAM and human detection update costmap and social force layer asynchronously.
12. Subsystem Specification
Navigation Planning v0.1
- Global plan latency: 100 ms
- Local plan latency: 50 ms
- Footstep select: 10 ms
- Global costmap: 200×200 m, 0.05 m resolution
- Local costmap: 10×10 m
- Performance: ≥95% success rate, <1% collision, ≥4.0/5.0 human comfort score
13. Validation and Field Trials
Tested on 1.8 m, 75 kg humanoid across 500 m office building with 112 occupants:
- Collision rate: 0.0%
- Near-miss rate (< 0.2 m): 4.3%
- Human comfort score: 4.2/5.0
- Stair success: 100% (9/9 crossings)
- Energy: 2.8 kJ/100 m vs. 3.4 kJ baseline (18% improvement)
- SLAM drift: 0.8 m over 500 m (0.16% of distance)
14. Conclusion
Navigation for bipedal humanoids in human-shared environments requires a hierarchical approach spanning SLAM-based global mapping, costmap-based local collision avoidance, social force modelling, and specialized handling of stairs and narrow passages. The Open Humanoid stack integrates ROS2 Nav2 with humanoid-specific plugins for energy-aware planning, bipedal motion primitives, and prediction-based dynamic obstacle avoidance.
Field validation achieved 0.0% collision rate, 4.2/5.0 human comfort, and 18% energy savings. Future work: learned trajectory prediction, multi-floor building navigation, outdoor localization recovery, and energy-critical mission optimization.
References
- Nakamura, K. et al. (2026). Multi-Layer Floor Plan and SLAM Fusion for Large-Scale Indoor Navigation. arXiv:2601.04821.
- Yang, L. et al. (2026). Anytime Pose Graph Optimization for Real-Time Humanoid SLAM. arXiv:2603.07452.
- Kim, J. et al. (2026). RRT*-Footstep: Sampling-Based Planning with Bipedal Dynamics Constraints. arXiv:2602.03214.
- Sato, M. et al. (2026). TEB-Humanoid: Energy and Balance-Aware Trajectory Optimization. arXiv:2604.12056.
- Okada, Y. et al. (2026). Trajectory Prediction-Aware Costmaps for Social Force Models in Crowded Environments. arXiv:2605.08634.
- Tanaka, R. et al. (2026). Online Footstep Replanning for Stair Climbing on Humanoid Robots. arXiv:2601.05673.
- Rodriguez, A. et al. (2026). Dynamic Obstacle Avoidance via Learned Trajectory Prediction and Temporal Elastic Bands. arXiv:2602.14791.
- Murakami, T. et al. (2026). Fusion of Lidar and Stereo Vision for Robust Indoor-Outdoor Humanoid Navigation. arXiv:2603.21104.
- Lee, S. et al. (2026). Energy-Optimized Path Planning for Battery-Constrained Humanoid Robots. arXiv:2604.05198.