Humanoid Policy ~ Human Policy

์ €์ž: Ri-Zhao Qiu, Shiqi Yang, Xuxin Cheng, Chaitanya Chawla, Jialong Li, Tairan He, Ge Yan, David J. Yoon, Ryan Hoque, Lars Paulsen, Ge Yang, Jian Zhang, Sha Yi, Guanya Shi, Xiaolong Wang | ๋‚ ์งœ: 2025-03-17 | URL: https://arxiv.org/abs/2503.13441 📄 PDF


Essence

Figure 1

Figure 1: This paper advocates high-quality human data as a data source for cross-embodiment

ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡ ์กฐ์ž‘ ์ •์ฑ… ํ•™์Šต์„ ์œ„ํ•ด ๋Œ€๊ทœ๋ชจ ์ž์•„์ค‘์‹ฌ ์ธ๊ฐ„ ๋ฐ๋ชจ๋ฅผ cross-embodiment ํ•™์Šต ๋ฐ์ดํ„ฐ๋กœ ํ™œ์šฉํ•˜๊ณ , Human Action Transformer (HAT)๋ฅผ ํ†ตํ•ด ์ธ๊ฐ„๊ณผ ๋กœ๋ด‡์„ ํ†ตํ•ฉ๋œ ์ƒํƒœ-ํ–‰๋™ ๊ณต๊ฐ„์—์„œ ๋‹ค์–‘ํ•œ embodiment์œผ๋กœ ๋ชจ๋ธ๋งํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: This paper advocates high-quality human data as a data source for cross-embodiment

How

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋กœ๋ด‡ ์กฐ์ž‘ ํ•™์Šต์—์„œ ๋Œ€๊ทœ๋ชจ ์ธ๊ฐ„ ๋ฐ์ดํ„ฐ ํ™œ์šฉ์˜ ์‹ค์งˆ์  ๊ฐ€์น˜๋ฅผ ์ž…์ฆํ•œ ์˜๋ฏธ ์žˆ๋Š” ์—ฐ๊ตฌ๋กœ, ํ†ตํ•ฉ๋œ state-action space์™€ ์ฒด๊ณ„์ ์ธ co-training ์ „๋žต์„ ํ†ตํ•ด embodiment ๊ฐ„๊ทน์„ ํšจ๊ณผ์ ์œผ๋กœ ํ•ด์†Œํ–ˆ์œผ๋ฉฐ, PH2D ๋ฐ์ดํ„ฐ์…‹๊ณผ HAT ๋ชจ๋ธ์˜ ๊ณต๊ฐœ๋ฅผ ํ†ตํ•ด cross-embodiment ํ•™์Šต ์ปค๋ฎค๋‹ˆํ‹ฐ์— ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋ฅผ ํ•  ๊ฒƒ์œผ๋กœ ๊ธฐ๋Œ€๋œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •