EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video

์ €์ž: Ryan Hoque, Peide Huang, David J. Yoon, Mouli Sivapurapu, Jian Zhang | ๋‚ ์งœ: 2025-05-16 | URL: https://arxiv.org/abs/2505.11709 📄 PDF


Essence

Figure 1

Figure 1: EgoDex is a large-scale egocentric dataset that focuses on human dexterous manipulation.

Apple Vision Pro๋ฅผ ํ™œ์šฉํ•˜์—ฌ 829์‹œ๊ฐ„์˜ 3D ์† ์ถ”์  ์ฃผ์„์ด ํฌํ•จ๋œ ๋Œ€๊ทœ๋ชจ ์ž์•„์ค‘์‹ฌ ๋น„๋””์˜ค ๋ฐ์ดํ„ฐ์…‹ EgoDex๋ฅผ ์ˆ˜์ง‘ํ•˜๊ณ , ์ด๋ฅผ ํ†ตํ•ด ๊ธฐ์ˆ ์  ์กฐ์ž‘ ๋ชจ๋ฐฉ ํ•™์Šต์„ ์œ„ํ•œ ๋ฒค์น˜๋งˆํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: EgoDex is a large-scale egocentric dataset that focuses on human dexterous manipulation.

How

Figure 3

Figure 3: Left: Joints captured by EgoDex. Right: Examples of dexterous manipulation behaviors.

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: EgoDex๋Š” ๊ธฐ์ˆ ์  ์กฐ์ž‘ ํ•™์Šต์„ ์œ„ํ•œ ํš๊ธฐ์ ์ธ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์…‹์„ ์ œ๊ณตํ•˜๋ฉฐ, ์›จ์–ด๋Ÿฌ๋ธ” ๊ธฐ์ˆ ์˜ ์‹ค์ œ ํ™œ์šฉ์„ ํ†ตํ•ด ๋กœ๋ด‡ ์กฐ์ž‘ ๋ถ„์•ผ์˜ '์ธํ„ฐ๋„ท ๊ทœ๋ชจ ๋ฐ์ดํ„ฐ' ์‹œ๋Œ€๋ฅผ ๊ฐœ์ฒ™ํ•œ๋‹ค. ๋ฐ์ดํ„ฐ์…‹์˜ ๊ทœ๋ชจ์™€ ์ •๋ฐ€๋„๋Š” ํƒ์›”ํ•˜๋‚˜, ์‹ค์ œ ๋กœ๋ด‡ ์ •์ฑ… ์ „์ด์˜ ์‹คํšจ์„ฑ ๊ฒ€์ฆ์ด ํ›„์† ๊ณผ์ œ๋กœ ๋‚จ์•„์žˆ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •