Heracles: Bridging Precise Tracking and Generative Synthesis for General Humanoid Control

์ €์ž: Zelin Tao, Zeran Su, Peiran Liu, Jingkai Sun, Wenqiang Que, Jiahao Ma, Jialin Yu, Jiahang Cao, Pihai Sun, Hao Liang, Gang Han, Wen Zhao, Zhiyuan Xu, Jian Tang, Qiang Zhang, Yijie Guo | ๋‚ ์งœ: 2026-03-31 | DOI: 10.48550/arXiv.2603.27756 📄 PDF


Essence

Figure 1

Figure 1: Heracles synthesizes diverse, anthropomorphic recovery motions via state-conditioned diffusion. In

Heracles๋Š” state-conditioned diffusion ๋ฏธ๋“ค์›จ์–ด๋ฅผ ํ†ตํ•ด ์ •๋ฐ€ํ•œ ๋ชจ์…˜ ์ถ”์ ๊ณผ ์ƒ์„ฑ์  ์ ์‘์„ ํ†ตํ•ฉํ•˜์—ฌ ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์ด ๊ทน๋‹จ์ ์ธ ์™ธ๋ถ€ ๊ต๋ž€ ์ƒํ™ฉ์—์„œ๋„ ์ž์—ฐ์Šค๋Ÿฌ์šด ๋ณต๊ตฌ ๋™์ž‘์„ ์ˆ˜ํ–‰ํ•˜๋„๋ก ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: Heracles synthesizes diverse, anthropomorphic recovery motions via state-conditioned diffusion. In

How

Figure 2

Figure 2: Overview of the Heracles framework. (a) A flow matching model ^๐ท๐œƒlearns to synthesize feasible keyframe

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: Heracles๋Š” state-conditioned diffusion์„ ํ™œ์šฉํ•œ ํ˜์‹ ์ ์ธ ์ œ์–ด ๋ฏธ๋“ค์›จ์–ด๋ฅผ ์ œ์‹œํ•˜์—ฌ ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์˜ ์ •๋ฐ€ ์ถ”์ ๊ณผ ์ƒ์„ฑ์  ์ ์‘์„ฑ์˜ ์˜ค๋ž˜๋œ ๋”œ๋ ˆ๋งˆ๋ฅผ ์šฐ์•„ํ•˜๊ฒŒ ํ•ด๊ฒฐํ•˜๋ฉฐ, ๋ฌผ๋ฆฌ์  ๋กœ๋ด‡ ์‹คํ—˜์„ ํ†ตํ•œ ๊ฐ•๊ฑดํ•œ ์„ฑ๋Šฅ ๊ฒ€์ฆ์œผ๋กœ ์‹ค์งˆ์  ๊ฐ€์น˜๋ฅผ ์ž…์ฆํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •