ResMimic: From General Motion Tracking to Humanoid Whole-body Loco-Manipulation via Residual Learning

์ €์ž: Siheng Zhao, Yanjie Ze, Yue Wang, C. Karen Liu, Pieter Abbeel, Guanya Shi, Rocky Duan | ๋‚ ์งœ: 2025-10-08 | DOI: 10.48550/arXiv.2510.05070 📄 PDF


Essence

Figure 3

Fig. 3: Overview of ResMimic : (1) A general motion tracking policy is trained on large-scale human motion data to serve

ResMimic๋Š” ์ผ๋ฐ˜ ๋ชจ์…˜ ์ถ”์ (GMT) ์ •์ฑ…์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํšจ์œจ์ ์ธ ์ž”์ฐจ ์ •์ฑ…(residual policy)์„ ํ•™์Šตํ•˜์—ฌ ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์˜ ์ •๋ฐ€ํ•œ ์ „์‹  ์ด๋™-์กฐ์ž‘ ๋Šฅ๋ ฅ์„ ์‹คํ˜„ํ•˜๋Š” ์ด๋‹จ๊ณ„ ์ž”์ฐจํ•™์Šต ํ”„๋ ˆ์ž„์›Œํฌ์ด๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1: We deploy ResMimic on a Unitree G1 humanoid to demonstrate diverse whole-body loco-manipulation capabilities.

How

Figure 3

Fig. 3: Overview of ResMimic : (1) A general motion tracking policy is trained on large-scale human motion data to serve

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ResMimic๋Š” ๋Œ€๊ทœ๋ชจ ์‚ฌ์ „ํ›ˆ๋ จ GMT ์ •์ฑ…๊ณผ ํšจ์œจ์  ์ž”์ฐจ ์ •์ฑ…์˜ ๊ฒฐํ•ฉ์œผ๋กœ ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์˜ ์ •๋ฐ€ํ•œ ์ „์‹  ์ด๋™-์กฐ์ž‘์„ ์‹คํ˜„ํ•œ ํ˜์‹ ์  ํ”„๋ ˆ์ž„์›Œํฌ์ด๋ฉฐ, ๋งž์ถคํ˜• ๋ณด์ƒ ์„ค๊ณ„์™€ ๊ด‘๋ฒ”์œ„ํ•œ ์‹ค์ฆ์œผ๋กœ ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡ ์ œ์–ด ๋ถ„์•ผ์— ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋ฅผ ํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •