Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation

์ €์ž: Jianglong Ye, Keyi Wang, Chengjing Yuan, Ruihan Yang, Yiquan Li, Jiyue Zhu, Yuzhe Qin, Xueyan Zou, Xiaolong Wang | ๋‚ ์งœ: 2025-06-20 | URL: https://arxiv.org/abs/2506.17198 📄 PDF


Essence

Figure 1

Fig. 1: The Dex1B benchmark consists of 1B generated high-quality demonstrations for grasping (top) and articulation (mi

์ƒ์„ฑ ๋ชจ๋ธ๊ณผ ์ตœ์ ํ™” ๋ฐฉ๋ฒ•์„ ๊ฒฐํ•ฉํ•˜์—ฌ 10์–ต ๊ฐœ์˜ ๊ณ ํ’ˆ์งˆ ์†๊ฐ€๋ฝ ์กฐ์ž‘ ์‹œ์—ฐ์„ ์ƒ์„ฑํ•œ Dex1B ๋ฐ์ดํ„ฐ์…‹๊ณผ ์ด๋ฅผ ํ™œ์šฉํ•˜๋Š” DexSimple ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•˜์—ฌ ์†๊ฐ€๋ฝ ์กฐ์ž‘ ์ž‘์—…์˜ ์„ฑ๋Šฅ์„ 22% ํ–ฅ์ƒ์‹œ์ผฐ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1: The Dex1B benchmark consists of 1B generated high-quality demonstrations for grasping (top) and articulation (mi

How

Figure 2

Fig. 2: Dex1B demonstration collection. The engine takes object assets and hand pose initialization as input, using a co

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ์ƒ์„ฑ ๋ชจ๋ธ๊ณผ ์ตœ์ ํ™”๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ 10์–ต ๊ฐœ์˜ ๋Œ€๊ทœ๋ชจ ์†๊ฐ€๋ฝ ์กฐ์ž‘ ์‹œ์—ฐ ๋ฐ์ดํ„ฐ์…‹์„ ์ฒด๊ณ„์ ์œผ๋กœ ๊ตฌ์„ฑํ•˜๊ณ , ์ด๋ฅผ ํ™œ์šฉํ•œ ๊ฐ„๋‹จํ•˜๋ฉด์„œ๋„ ํšจ๊ณผ์ ํ•œ ํ•™์Šต ๋ฐฉ๋ฒ•์œผ๋กœ ์ตœ๊ณ  ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•œ ์ค‘์š”ํ•œ ๊ธฐ์—ฌ์ด๋‹ค. ๋ฐ์ดํ„ฐ์…‹์˜ ๊ทœ๋ชจ, ๋‹ค์–‘์„ฑ, ํ’ˆ์งˆ ์ธก๋ฉด์—์„œ ํ˜์‹ ์ ์ด๋ฉฐ ์‹ค์ œ ๋กœ๋ด‡ ์‹คํ—˜์„ ํ†ตํ•œ ๊ฒ€์ฆ๋„ ์ถฉ๋ถ„ํ•˜๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •