Learning Perceptive Humanoid Locomotion over Challenging Terrain

์ €์ž: Wandong Sun, Baoshi Cao, Long Chen, Yongbo Su, Yang Liu, Zongwu Xie, Hong Liu | ๋‚ ์งœ: 2025-03-02 | URL: https://arxiv.org/abs/2503.00692 📄 PDF


Essence

Figure 2

Fig. 2: Training of Humanoid Perception Controller consists of two stages: (1) Oracle Policy Training generates referenc

์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์ด ์†Œ์Œ์ด ์žˆ๋Š” ์„ผ์„œ ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ์ง€ํ˜•์„ ์ธ์‹ํ•˜๊ณ  ๊ฑฐ์นœ ์ง€ํ˜•์„ ์•ˆ์ •์ ์œผ๋กœ ๋ณดํ–‰ํ•  ์ˆ˜ ์žˆ๋„๋ก, teacher-student distillation๊ณผ variational information bottleneck์„ ๊ฒฐํ•ฉํ•œ ์„ธ๊ณ„ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1: Deployment to outdoor environments. We deployed the model in outdoor challenging terrains. Our controller can

How

Figure 2

Fig. 2: Training of Humanoid Perception Controller consists of two stages: (1) Oracle Policy Training generates referenc

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ teacher-student distillation๊ณผ world model ๊ธฐ๋ฐ˜ ์„ผ์„œ ๋””๋…ธ์ด์ง•์„ ํšจ๊ณผ์ ์œผ๋กœ ๊ฒฐํ•ฉํ•˜์—ฌ ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์˜ ์‹ค์ œ ํ™˜๊ฒฝ ๋ณดํ–‰ ์„ฑ๋Šฅ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œ์ผฐ๋‹ค. 2 km์˜ ๋‹ค์–‘ํ•œ ์ง€ํ˜• ํšก๋‹จ ์„ฑ๊ณผ์™€ ์ฒด๊ณ„์ ์ธ ๋ฐฉ๋ฒ•๋ก ์€ ๋†’์€ ๊ธฐ์ˆ ์  ๊ฐ€์น˜๋ฅผ ๊ฐ€์ง€๋ฉฐ, ์‹ค์ œ ๋กœ๋ด‡ ๋ฐฐํฌ๋ฅผ ์œ„ํ•œ ์ค‘์š”ํ•œ ์ง„์ „์„ ๋ณด์—ฌ์ค€๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •