Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization

์ €์ž: Hao Luo, Ye Wang, Wanpeng Zhang, Sipeng Zheng, Ziheng Xi, Chaoyi Xu, Haiweng Xu, Haoqi Yuan, Chi Zhang, Yiqing Wang, Yicheng Feng, Zongqing Lu | ๋‚ ์งœ: 2026-01-19 | URL: https://arxiv.org/abs/2601.12993 📄 PDF


Essence

Figure 1

Figure 1: Being-H0.5 at a Glance. We scale human-centric robot learning with Being-H0.5 toward

Being-H0.5๋Š” ์ธ๊ฐ„ ์ค‘์‹ฌ ํ•™์Šต ํŒจ๋Ÿฌ๋‹ค์ž„๊ณผ ํ†ตํ•ฉ ์•ก์…˜ ๊ณต๊ฐ„์„ ํ™œ์šฉํ•˜์—ฌ ๋‹ค์–‘ํ•œ ๋กœ๋ด‡ ํ”Œ๋žซํผ ๊ฐ„ ์ผ๋ฐ˜ํ™”๋ฅผ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜๋Š” ๊ธฐ์ดˆ Vision-Language-Action ๋ชจ๋ธ์ด๋‹ค. 35,000์‹œ๊ฐ„ ์ด์ƒ์˜ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ฐ์ดํ„ฐ๋กœ ๊ตฌ์„ฑ๋œ UniHand-2.0์„ ํ†ตํ•ด 30๊ฐœ์˜ ๋กœ๋ด‡ ํ”Œ๋žซํผ์—์„œ ๊ฐ•๋ ฅํ•œ cross-embodiment ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: Being-H0.5 at a Glance. We scale human-centric robot learning with Being-H0.5 toward

How

Figure 2

Figure 2: Overview of UniHand 2.0. UniHand 2.0 is our large-scale pre-training recipe for human-centric

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: Being-H0.5๋Š” ์ธ๊ฐ„ ์ค‘์‹ฌ ํ•™์Šต ํŒจ๋Ÿฌ๋‹ค์ž„๊ณผ ๋Œ€๊ทœ๋ชจ ํ†ตํ•ฉ ๋ฐ์ดํ„ฐ์…‹์„ ํ™œ์šฉํ•˜์—ฌ cross-embodiment ๋กœ๋ด‡ ์ผ๋ฐ˜ํ™”์˜ ์ค‘์š”ํ•œ ์ง„์ „์„ ์ด๋ฃฌ ์˜๋ฏธ ์žˆ๋Š” ์—ฐ๊ตฌ์ด๋ฉฐ, Mixture-of-Flow, Manifold-Preserving Gating ๋“ฑ์˜ ๊ธฐ์ˆ  ํ˜์‹ ๊ณผ ์‹ค์„ธ๊ณ„ ๋ฐฐํฌ ์„ฑ๊ณต์ด ๋กœ๋ด‡๊ณตํ•™์˜ ํ™•์žฅ์„ฑ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๋Š” ๋ฐ ๊ธฐ์—ฌํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •