RobotDancing: Residual-Action Reinforcement Learning Enables Robust Long-Horizon Humanoid Motion Tracking

์ €์ž: Zhenguo Sun, Yibo Peng, Yuan Meng, Xukun Li, Bo-Sheng Huang, Zhenshan Bing, Xinlong Wang, Alois Knoll | ๋‚ ์งœ: 2025-09-25 | URL: https://arxiv.org/abs/2509.20717 📄 PDF


Essence

Figure 1

Fig. 1.

RobotDancing์€ ์ž”์ฐจ ๋™์ž‘(residual action) ๊ฐ•ํ™”ํ•™์Šต์„ ํ†ตํ•ด ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์ด ์žฅ๊ธฐ๊ฐ„ ๊ณ ์—ญ๋™ ์ถค ๋™์ž‘์„ ์ถ”์ ํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, ๋ชจ๋ธ-์‹ค์ œ ๊ฐ„์˜ ๋™์—ญํ•™ ๋ถˆ์ผ์น˜๋ฅผ ๋ช…์‹œ์ ์œผ๋กœ ๋ณด์ •ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1.

How

Figure 1

Fig. 1.

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: RobotDancing์€ ์ž”์ฐจ ๋™์ž‘ ํ•™์Šต๊ณผ ์ด์› ์ƒ˜ํ”Œ๋ง ์ „๋žต์„ ํ†ตํ•ด ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์˜ ์žฅ๊ธฐ ๊ณ ์—ญ๋™ ๋ชจ์…˜ ์ถ”์  ๋ฌธ์ œ๋ฅผ ์šฐ์•„ํ•˜๊ฒŒ ํ•ด๊ฒฐํ•˜๋ฉฐ, ์‹ค์ œ ๋กœ๋ด‡์œผ๋กœ์˜ ์˜์  ์ „๋‹ฌ ์„ฑ๊ณต์€ ์‹ค๋ฌด์  ๊ฐ€์น˜๊ฐ€ ๋†’๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •