Data Scaling Laws in Imitation Learning for Robotic Manipulation

์ €์ž: Yingdong Hu, Fanqi Lin, Pingyue Sheng, Chuan Wen, Jiacheng You, Yang Gao | ๋‚ ์งœ: 2024-10-24 | URL: https://arxiv.org/abs/2410.18647 📄 PDF


Essence

Figure 5

Figure 5: Power-law relationship. Dashed lines represent power-law fits, with the equations pro-

๋กœ๋ด‡ ์กฐ์ž‘ ํ•™์Šต์—์„œ ๋ฐ์ดํ„ฐ ์Šค์ผ€์ผ๋ง ๋ฒ•์น™์„ ์‹ค์ฆ์ ์œผ๋กœ ๊ทœ๋ช…ํ•˜๊ณ , ํ™˜๊ฒฝ๊ณผ ๊ฐ์ฒด ๋‹ค์–‘์„ฑ์ด ์ ˆ๋Œ€์  ๋ฐ์ดํ„ฐ ์–‘๋ณด๋‹ค ์ค‘์š”ํ•จ์„ ๋ณด์—ฌ์ฃผ์—ˆ์œผ๋ฉฐ ์ด๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํšจ์œจ์ ์ธ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ์ „๋žต์„ ์ œ์‹œํ•œ๋‹ค.

Motivation

Achievement

Figure 3

Fig. 3 presents the results, revealing several notable patterns: (1) Increasing the number of training

How

Figure 2

Fig 2 presents the results, with shaded regions representing 95% confidence intervals. There are sev-

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋กœ๋ด‡ ์กฐ์ž‘ ๋ถ„์•ผ์—์„œ ์ฒ˜์Œ์œผ๋กœ ์ฒด๊ณ„์ ์ธ ๋ฐ์ดํ„ฐ ์Šค์ผ€์ผ๋ง ๋ฒ•์น™์„ 40,000๊ฐœ ์ด์ƒ์˜ ์‹ค์ œ ์‹œ์—ฐ๊ณผ ์—„๊ฒฉํ•œ ํ‰๊ฐ€ ํ”„๋กœํ† ์ฝœ์„ ํ†ตํ•ด ๊ทœ๋ช…ํ•œ ์ค‘์š”ํ•œ ์‹ค์ฆ ์—ฐ๊ตฌ๋กœ, ํ™˜๊ฒฝ-๊ฐ์ฒด ๋‹ค์–‘์„ฑ์˜ ์šฐ์›”์„ฑ์ด๋ผ๋Š” ์‹ค์šฉ์  ์ธ์‚ฌ์ดํŠธ๋Š” ๋กœ๋ด‡ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ์ „๋žต์˜ ํ˜์‹ ์„ ๊ฐ€์ ธ์˜ฌ ์ˆ˜ ์žˆ๋Š” ๊ณ ์ž„ํŒฉํŠธ ๋…ผ๋ฌธ์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •