DemoDiffusion: One-Shot Human Imitation using pre-trained Diffusion Policy

์ €์ž: Sungjae Park, Homanga Bharadhwaj, Shubham Tulsiani | ๋‚ ์งœ: 2025-06-25 | URL: https://arxiv.org/abs/2506.20668 📄 PDF


Essence

Figure 2

Fig. 2: Retargeted human hand trajectory to closed-loop robot action sequence, for the task T : โ€œshut down the

DemoDiffusion์€ ๋‹จ์ผ ์ธ๊ฐ„ ์‹œ์—ฐ์œผ๋กœ๋ถ€ํ„ฐ ๋กœ๋ด‡์ด ์กฐ์ž‘ ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•˜๋Š” ๋ฐฉ๋ฒ•์œผ๋กœ, kinematic retargeting์œผ๋กœ ์–ป์€ ๊ถค์ ์„ pre-trained diffusion policy๋ฅผ ์ด์šฉํ•ด ๊ฐœ์„ ํ•œ๋‹ค.

Motivation

Achievement

Figure 4

Fig. 4: Real-World Manipulation Tasks. Human demonstrations for the 8 evaluation tasks, shown as two frames per task.

How

Figure 2

Fig. 2: Retargeted human hand trajectory to closed-loop robot action sequence, for the task T : โ€œshut down the

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: DemoDiffusion์€ pre-trained diffusion policy๋ฅผ kinematic retargeting์˜ ๊ฐœ์„ ์— ํ™œ์šฉํ•˜๋Š” ์šฐ์•„ํ•œ ์ ‘๊ทผ๋ฒ•์œผ๋กœ, ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ ์ธ๊ฐ„ ์‹œ์—ฐ๋งŒ์œผ๋กœ ๋กœ๋ด‡ ์กฐ์ž‘์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•œ๋‹ค. ์‹ค์„ธ๊ณ„ ์„ฑ๋Šฅ(83.8%)๊ณผ ๊ธฐ์กด ๋ฐฉ๋ฒ• ๋Œ€๋น„ ์šฐ์›”์„ฑ์„ ์ž…์ฆํ–ˆ์œผ๋ฉฐ, ์‹ค์šฉ์  ๋ฐฐํฌ ๊ด€์ ์—์„œ ๋†’์€ ๊ฐ€์น˜๋ฅผ ๊ฐ€์ง„๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •