HumanoidPano: Hybrid Spherical Panoramic-LiDAR Cross-Modal Perception for Humanoid Robots

์ €์ž: Qiang Zhang, Zhang Zhang, Wei Cui, Jingkai Sun, Jiahang Cao, Yijie Guo, Gang Han, Wen Zhao, Jiaxu Wang, Chenghao Sun, Lingfeng Zhang, Hao Cheng, Yujie Chen, Lin Wang, Jian Tang, Renjing Xu | ๋‚ ์งœ: 2025-03-12 | URL: https://arxiv.org/abs/2503.09010 📄 PDF


Essence

Figure 1

Figure 1. The humanoid robot autonomously navigates complex environments using HumanoidPano, which fuses panoramic visio

์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์˜ ์ž์•„-ํ์ƒ‰ ๋ฐ ์ œํ•œ๋œ ์‹œ์•ผ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ํŒŒ๋…ธ๋ผ๋งˆ ๋น„์ „๊ณผ LiDAR๋ฅผ ์œตํ•ฉํ•˜๋Š” HumanoidPano ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•˜๋ฉฐ, Spherical Geometry-aware Constraints์™€ Spatial Deformable Attention์„ ํ†ตํ•ด ๊ธฐํ•˜ํ•™์ ์œผ๋กœ ์ •๋ ฌ๋œ ํฌ๋กœ์Šค๋ชจ๋‹ฌ ์ธ์‹์„ ๊ตฌํ˜„ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1. The humanoid robot autonomously navigates complex environments using HumanoidPano, which fuses panoramic visio

How

Figure 2

Figure 2. HumanoidPano Framework Overview. The system addresses panoramic image distortion using Spherical Geometric Con

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: HumanoidPano๋Š” ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์˜ ๊ณ ์œ ํ•œ ๊ตฌ์กฐ์  ์ œ์•ฝ์„ ์‹ฌ์ธต์ ์œผ๋กœ ๊ณ ๋ คํ•˜์—ฌ panoramic vision๊ณผ LiDAR๋ฅผ ๊ธฐํ•˜ํ•™์ ์œผ๋กœ ์ •๋ ฌํ•˜๋Š” ํ˜์‹ ์ ์ธ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, ์‹ค์ œ ๋กœ๋ด‡ ํ”Œ๋žซํผ์—์„œ์˜ ๊ฒ€์ฆ๊ณผ state-of-the-art ์„ฑ๋Šฅ์œผ๋กœ embodied AI ๋ถ„์•ผ์— ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„์„ ์ œ์‹œํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •