LangWBC: Language-directed Humanoid Whole-Body Control via End-to-end Learning

์ €์ž: Yiyang Shao, Xiaoyu Huang, Bike Zhang, Qiayuan Liao, Yuman Gao, Yufeng Chi, Zhongyu Li, Sophia Shao, Koushil Sreenath | ๋‚ ์งœ: 2025-04-30 | URL: https://arxiv.org/abs/2504.21738 📄 PDF


Essence

Figure 2

Fig. 2.

์ž์—ฐ์–ธ์–ด ๋ช…๋ น์„ humanoid robot์˜ ์ „์‹  ์ œ์–ด ๋™์ž‘์œผ๋กœ ์ง์ ‘ ๋ณ€ํ™˜ํ•˜๋Š” end-to-end ํ•™์Šต ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค. Reinforcement learning์œผ๋กœ ํ•™์Šตํ•œ teacher policy์™€ CVAE ๊ธฐ๋ฐ˜ student policy๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ ์–ธ์–ด-ํ–‰๋™์˜ ํ†ตํ•ฉ latent space๋ฅผ ๊ตฌ์„ฑํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1:

How

Figure 2

Fig. 2.

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ humanoid ์ „์‹  ์ œ์–ด์˜ ์˜ค๋žœ ๋‚œ์ œ์ธ ์–ธ์–ด-ํ–‰๋™ ๊ฐญ์„ end-to-end learning์œผ๋กœ ์ง์ ‘ ํ•ด๊ฒฐํ•˜๋ฉฐ, CVAE ๊ธฐ๋ฐ˜์˜ unified latent space ๊ตฌ์„ฑ์œผ๋กœ ๋™์ž‘ ๋‹ค์–‘์„ฑ๊ณผ ๋ถ€๋“œ๋Ÿฌ์šด ์ „ํ™˜์„ ๋™์‹œ์— ๋‹ฌ์„ฑํ•œ ์ ์ด ์šฐ์ˆ˜ํ•˜๋‹ค. ์‹ค์ œ ๋กœ๋ด‡ ๊ฒ€์ฆ๊ณผ ๊ฐ•๊ฑด์„ฑ ์ž…์ฆ์„ ํ†ตํ•ด ํ˜„์‹ค ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์˜€์œผ๋‚˜, ๋ฐ์ดํ„ฐ์…‹ ์˜์กด์„ฑ๊ณผ ๋‹ค์–‘ํ•œ ํ”Œ๋žซํผ ์ผ๋ฐ˜ํ™”์— ๋Œ€ํ•œ ์ถ”๊ฐ€ ๊ฒ€์ฆ์ด ํ•„์š”ํ•˜๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •