BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion

์ €์ž: Qiayuan Liao, Takara E. Truong, Xiaoyu Huang, Yuman Gao, Guy Tevet, Koushil Sreenath, C. Karen Liu | ๋‚ ์งœ: 2025-08-11 | URL: https://arxiv.org/abs/2508.08241 📄 PDF


Essence

Figure 1

Figure 1: Overview of the proposed versatile humanoid control framework. (A) Scalable

BeyondMimic์€ ์ธ๊ฐ„ ๋ชจ์…˜ ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ํ•™์Šตํ•œ compact motion-tracking ๊ณต์‹๊ณผ classifier guidance๋ฅผ ํ™œ์šฉํ•œ diffusion model์„ ๊ฒฐํ•ฉํ•˜์—ฌ, ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์ด ํ•™์Šต ์ค‘ ๋ณด์ง€ ๋ชปํ•œ ๋‹ค์–‘ํ•œ ์ž‘์—…์„ zero-shot์œผ๋กœ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋Š” ํ†ตํ•ฉ ์ œ์–ด ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2: Diverse motion tracking policies deployed on a real humanoid robot. We demonstrate

How

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: BeyondMimic์€ motion tracking RL์˜ ๋ฏผ์ฒฉ์„ฑ๊ณผ diffusion ๋ชจ๋ธ์˜ ์œ ์—ฐ์„ฑ์„ ํšจ๊ณผ์ ์œผ๋กœ ๊ฒฐํ•ฉํ•˜์—ฌ, ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡ ์ œ์–ด์˜ ์žฅ๊ธฐ์  ๊ณผ์ œ์ธ ์ž์—ฐ์Šค๋Ÿฌ์›€, ๋ฏผ์ฒฉ์„ฑ, versatility๋ฅผ ๋™์‹œ์— ๋‹ฌ์„ฑํ•˜๋Š” ๊ฐ•๋ ฅํ•œ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค. ์‹ค์ œ ๋กœ๋ด‡ ๋ฐฐํฌ์™€ zero-shot task ์ผ๋ฐ˜ํ™” ์‹œ์—ฐ์€ ๋กœ๋ณดํ‹ฑ์Šค ์ปค๋ฎค๋‹ˆํ‹ฐ์— ์ƒ๋‹นํ•œ ๊ธฐ์—ฌ๋ฅผ ํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •