Learning Whole-Body Human-Humanoid Interaction from Human-Human Demonstrations

์ €์ž: Wei-Jin Huang, Yue-Yi Zhang, Yi-Lin Wei, Zhi-Wei Xia, Juantao Tan, Yuan-Ming Li, Zhilin Zhao, Wei-Shi Zheng | ๋‚ ์งœ: 2026-01-14 | DOI: 10.48550/arXiv.2601.09518 📄 PDF


Essence

Figure 2

Figure 2. PAIR preserves physical consistency where naive meth-

ํœด๋จผ-ํœด๋จผ ์ธํ„ฐ๋ž™์…˜(HHI) ๋ฐ์ดํ„ฐ๋ฅผ ๋ฌผ๋ฆฌ์  ์ผ๊ด€์„ฑ์„ ๋ณด์กดํ•˜๋ฉด์„œ ํœด๋จผ-ํœด๋ชจ์ด๋“œ ์ธํ„ฐ๋ž™์…˜(HHoI)์œผ๋กœ ๋ณ€ํ™˜ํ•˜๋Š” PAIR์™€, ์‹œ๊ฐ„์  ์˜๋„์™€ ๊ณต๊ฐ„์  ์„ ํƒ์„ ๋ถ„๋ฆฌํ•˜์—ฌ ์ƒํ˜ธ์ž‘์šฉ์  ์ดํ•ด๋ฅผ ๊ฐ–์ถ˜ D-STAR ์ •์ฑ…์„ ์ œ์•ˆํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1. From HHI to HHoI with simulation and real-robot results. Left: PAIR (Physics-Aware Interaction Retargeting) co

How

Figure 3

Figure 3. PAIR preserves contact semantics and physical consis-

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ด ๋…ผ๋ฌธ์€ HHI์—์„œ HHoI๋กœ์˜ ๋ฐ์ดํ„ฐ ๋ณ€ํ™˜ ๋ฌธ์ œ๋ฅผ ๋ฌผ๋ฆฌ์  ์ผ๊ด€์„ฑ ๊ด€์ ์—์„œ ์ฒด๊ณ„์ ์œผ๋กœ ํ•ด๊ฒฐํ•˜๊ณ , ์‹œ๊ณต๊ฐ„ ๋ถ„๋ฆฌ๋ฅผ ํ†ตํ•ด ์ƒํ˜ธ์ž‘์šฉ ์ •์ฑ…์˜ ๋ฐ˜์‘์„ฑ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ํ˜์‹ ์ ์ธ ์ ‘๊ทผ์„ ์ œ์‹œํ•œ๋‹ค. ์‹œ๋ฎฌ๋ ˆ์ด์…˜๊ณผ ์‹ค์ œ ๋กœ๋ด‡ ๊ฒ€์ฆ์„ ํ†ตํ•ด ์‹ค์šฉ์„ฑ์„ ์ž…์ฆํ•˜์˜€์œผ๋‚˜, ๋” ๋‹ค์–‘ํ•œ ์ƒํ˜ธ์ž‘์šฉ ์‹œ๋‚˜๋ฆฌ์˜ค์™€ ํ”Œ๋žซํผ์œผ๋กœ์˜ ํ™•์žฅ์ด ํ•„์š”ํ•˜๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •