Humanoid Hanoi: Investigating Shared Whole-Body Control for Skill-Based Box Rearrangement

์ €์ž: Minku Kim, Kuan-Chia Chen, Aayam Shrestha, Li Fuxin, Stefan Lee, Alan Fern | ๋‚ ์งœ: 2026-02-23 | DOI: 10.48550/arXiv.2602.13850 📄 PDF


Essence

Figure 2

Fig. 2: Independently trained high-level skills generate task-level commands that are executed through a shared, task-ag

ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์˜ ์žฅ๊ธฐ ๋ฐ•์Šค ์žฌ๋ฐฐ์—ด ์ž‘์—…์„ ์œ„ํ•ด ๊ณต์œ ๋œ task-agnostic WBC๋ฅผ ํ†ตํ•ด ์žฌ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ ์Šคํ‚ฌ๋“ค์„ ์กฐํ•ฉํ•˜๋Š” skill-based framework๋ฅผ ์ œ์•ˆํ•˜๊ณ , ๋ถ„ํฌ ์ด๋™์œผ๋กœ ์ธํ•œ ๊ฐ•๊ฑด์„ฑ ์ €ํ•˜๋ฅผ ๋ฐ์ดํ„ฐ ์ง‘๊ณ„๋ฅผ ํ†ตํ•ด ํ•ด๊ฒฐํ•œ๋‹ค.

Motivation

Achievement

Figure 4

Fig. 4: Cumulative task success rates for the Humanoid Hanoi benchmark. Each move consists of four skills: GoTo, Pickup,

How

Figure 2

Fig. 2: Independently trained high-level skills generate task-level commands that are executed through a shared, task-ag

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ๊ณต์œ  WBC๋ฅผ ํ†ตํ•œ ๋ชจ๋“ˆ์‹ ์Šคํ‚ฌ ์กฐํ•ฉ ์•„ํ‚คํ…์ฒ˜์˜ systematic exploration๊ณผ ๋ฐ์ดํ„ฐ ์ง‘๊ณ„ ๊ธฐ๋ฐ˜ robustness ๊ฐœ์„ ์ด๋ผ๋Š” ์‹ค์šฉ์  ๊ธฐ์—ฌ๋ฅผ ์ œ์‹œํ•˜๋ฉฐ, Humanoid Hanoi ๋ฒค์น˜๋งˆํฌ๋ฅผ ํ†ตํ•ด long-horizon ์žฅ๊ธฐ ์ž์œจ ์‹คํ–‰์˜ ๊ฐ€๋Šฅ์„ฑ์„ ์ž…์ฆํ•œ๋‹ค. ๋‹ค๋งŒ high-level planning, ๊ณ„์‚ฐ scalability, sim-to-real gap์— ๋Œ€ํ•œ ์‹ฌํ™” ๋ถ„์„์€ ๋ถ€์กฑํ•˜๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •