Learning Humanoid Navigation from Human Data

์ €์ž: Weizhuo Wang, Yanjie Ze, C. Karen Liu, Monroe Kennedy | ๋‚ ์งœ: 2026-04-01 | URL: https://arxiv.org/abs/2604.00416 📄 PDF


Essence

Figure 2

Fig. 2. Overview of the proposed method: A rolling buffer of 32 segmented

๋ณธ ๋…ผ๋ฌธ์€ ์ธ๊ฐ„์˜ ๋ณดํ–‰ ๋ฐ์ดํ„ฐ 5์‹œ๊ฐ„๋งŒ์„ ํ™œ์šฉํ•˜์—ฌ ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์ด ๋ฏธ์ง€์˜ ํ™˜๊ฒฝ์—์„œ ์ž์œจ์ ์œผ๋กœ ๋‚ด๋น„๊ฒŒ์ด์…˜ํ•  ์ˆ˜ ์žˆ๋Š” EgoNav ์‹œ์Šคํ…œ์„ ์ œ์•ˆํ•œ๋‹ค. ๋กœ๋ด‡ ๋ฐ์ดํ„ฐ ์—†์ด ์ˆœ์ˆ˜ ์ธ๊ฐ„ ๋ฐ์ดํ„ฐ๋งŒ์œผ๋กœ ํ•™์Šตํ•œ ๋ชจ๋ธ์„ Unitree G1 ํœด๋จธ๋…ธ์ด๋“œ์— ์ œ๋กœ์ƒท ๋ฐฐํฌํ•˜์—ฌ ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ์˜ ํšจ๊ณผ๋ฅผ ์ž…์ฆํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1.

์˜คํ”„๋ผ์ธ ํ‰๊ฐ€: baseline ๋Œ€๋น„ ์ถฉ๋Œ ํšŒํ”ผ ๋ฐ ๋‹ค์ค‘ ๋ชจ๋‹ฌ ์ปค๋ฒ„๋ฆฌ์ง€ ์šฐ์ˆ˜์„ฑ ์ž…์ฆ ์‹ค์ œ ๋ฐฐํฌ: Unitree G1 ํœด๋จธ๋…ธ์ด๋“œ์—์„œ ๋ฌธ์—ด๋ฆผ ๋Œ€๊ธฐ, ๊ตฐ์ค‘ ํšŒํ”ผ, ์œ ๋ฆฌ๋ฒฝ ํšŒํ”ผ ๋“ฑ์˜ ํ–‰๋™์ด ์ž๋™ ํ•™์Šต๋˜์–ด ๋ฏธ์ง€์˜ ์‹ค๋‚ดยท์™ธ๋ถ€ ํ™˜๊ฒฝ์—์„œ ์ œ๋กœ์ƒท ๋ฐฐํฌ ์„ฑ๊ณต ๋ชจ๋ธ ๊ณต๊ฐœ: ํ•™์Šต๋œ ๋ชจ๋ธ๊ณผ ๋ฐ์ดํ„ฐ์…‹ ๊ณต๊ฐœ ์˜ˆ์ •

How

Figure 2

Fig. 2. Overview of the proposed method: A rolling buffer of 32 segmented

Originality

Limitation & Further Study

๊ธฐ์ˆ ์  ํ•œ๊ณ„: 5์‹œ๊ฐ„์˜ ์ œํ•œ๋œ ์ธ๊ฐ„ ๋ฐ์ดํ„ฐ๋กœ ํ•™์Šต๋˜์–ด ๊ทน๋„๋กœ ๋ณต์žกํ•œ ํ™˜๊ฒฝ์˜ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ ๋ฏธ๊ฒ€์ฆ, hybrid ์ƒ˜ํ”Œ๋ง์˜ 10 ์Šคํ…์ด ์—ฌ์ „ํžˆ ์‹ค์‹œ๊ฐ„ ์š”๊ตฌ์‚ฌํ•ญ์„ ์™„์ „ํžˆ ์ถฉ์กฑํ•˜์ง€ ๋ชปํ•  ๊ฐ€๋Šฅ์„ฑ, visual memory์˜ ๋ฉ”๋ชจ๋ฆฌ ํšจ์œจ์„ฑ ๋ฏธ๋ถ„์„ ํ›„์† ์—ฐ๊ตฌ: ๋” ๋Œ€๊ทœ๋ชจ ์ธ๊ฐ„ ๋ฐ์ดํ„ฐ ํ™•๋ณด ์‹œ ์„ฑ๋Šฅ ํ–ฅ์ƒ ๊ฒ€์ฆ, ๋‹ค์–‘ํ•œ ๋กœ๋ด‡ ํ”Œ๋žซํผ์œผ๋กœ์˜ ์ „์ด ๊ฐ€๋Šฅ์„ฑ ํ™•์ธ, ๊ทนํ•œ ๋‚ ์”จ๋‚˜ ๋งค์šฐ ํ˜ผ์žกํ•œ ํ™˜๊ฒฝ์—์„œ์˜ ๊ฒฌ๊ณ ์„ฑ ํ‰๊ฐ€

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: EgoNav๋Š” ์ธ๊ฐ„ ๋ณดํ–‰ ๋ฐ์ดํ„ฐ๋งŒ์œผ๋กœ ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡ ๋‚ด๋น„๊ฒŒ์ด์…˜์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜๋Š” ํ˜์‹ ์  ์ ‘๊ทผ์„ ์ œ์‹œํ•˜๋ฉฐ, diffusion model ๊ธฐ๋ฐ˜ ๋‹ค์ค‘ ๋ชจ๋‹ฌ ๊ถค์  ์ƒ์„ฑ๊ณผ ์‹ค์‹œ๊ฐ„ ์ถ”๋ก ์˜ ๊ฒฐํ•ฉ, ์‹ค์ œ ๋ฏธ์ง€ ํ™˜๊ฒฝ์—์„œ์˜ ์ œ๋กœ์ƒท ๋ฐฐํฌ ์„ฑ๊ณต์€ ๋กœ๋ด‡ ๋‚ด๋น„๊ฒŒ์ด์…˜ ๋ถ„์•ผ์— ์ƒ๋‹นํ•œ ๊ธฐ์—ฌ๋ฅผ ํ•œ๋‹ค. ๋‹ค๋งŒ ํ•™์Šต ๋ฐ์ดํ„ฐ ๊ทœ๋ชจ์™€ ๊ทนํ•œ ํ™˜๊ฒฝ ๊ฒฌ๊ณ ์„ฑ์˜ ๊ฒ€์ฆ์ด ์ถ”๊ฐ€๋˜๋ฉด ๋”์šฑ ๊ฐ•๋ ฅํ•œ ๋…ผ๋ฌธ์ด ๋  ์ˆ˜ ์žˆ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •