Humanoid Policy ~ Human Policy

์ €์ž: Ri-Zhao Qiu, Shiqi Yang, Xuxin Cheng, Chaitanya Chawla, Jialong Li, Tairan He, Ge Yan, David J. Yoon, Ryan Hoque, Lars Paulsen, Ge Yang, Jian Zhang, Sha Yi, Guanya Shi, Xiaolong Wang | ๋‚ ์งœ: 2025-03-17 | URL: https://arxiv.org/abs/2503.13441 📄 PDF


Essence

Figure 3

Figure 3: Overview of HAT. Human Action Transformer (HAT) learns a robot policy by modeling

์ด ๋…ผ๋ฌธ์€ humanoid ๋กœ๋ด‡์˜ ์กฐ์ž‘ ์ •์ฑ… ํ•™์Šต์— ๋Œ€๊ทœ๋ชจ egocentric human demonstration์„ ํ™œ์šฉํ•˜๋Š” cross-embodiment ํ•™์Šต ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค. PH2D ๋ฐ์ดํ„ฐ์…‹๊ณผ Human Action Transformer (HAT)๋ฅผ ํ†ตํ•ด human๊ณผ robot ๊ฐ„์˜ embodiment gap์„ ์™„ํ™”ํ•˜๊ณ  ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ํšจ์œจ์„ ํฌ๊ฒŒ ๊ฐœ์„ ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: This paper advocates high-quality human data as a data source for cross-embodiment

How

Figure 3

Figure 3: Overview of HAT. Human Action Transformer (HAT) learns a robot policy by modeling

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ด ๋…ผ๋ฌธ์€ humanoid robot manipulation ํ•™์Šต์„ ์œ„ํ•ด ๋Œ€๊ทœ๋ชจ human data๋ฅผ ํšจ์œจ์ ์œผ๋กœ ํ™œ์šฉํ•˜๋Š” ์‹ค์šฉ์ ์ด๊ณ  ์ฐฝ์˜์ ์ธ ๋ฐฉ์•ˆ์„ ์ œ์‹œํ•œ๋‹ค. PH2D ๋ฐ์ดํ„ฐ์…‹์˜ ๊ทœ๋ชจ์™€ ํ’ˆ์งˆ, HAT์˜ unified design, ๊ทธ๋ฆฌ๊ณ  ์‹ค๋กœ๋ด‡ ๊ฒ€์ฆ์ด ๊ธฐ์—ฌ๋„ ์žˆ์œผ๋‚˜, ํ‰๊ฐ€ ๋ฒ”์œ„ ํ™•์žฅ๊ณผ ๋‹ค์–‘ํ•œ ํ”Œ๋žซํผ์œผ๋กœ์˜ ์ผ๋ฐ˜ํ™” ๊ฒ€์ฆ์ด ํ•„์š”ํ•˜๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •