Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition

์ €์ž: Huy Ha, Pete Florence, Shuran Song | ๋‚ ์งœ: 2023-07-26 | URL: https://arxiv.org/abs/2307.14535 📄 PDF


Essence

Figure 1

Figure 1: Language-guided Skill Acquisition enables scalable robot learning. In the data generation stage, a LLM takes

LLM ๊ธฐ๋ฐ˜ ๊ณ ์ˆ˜์ค€ ๊ณ„ํš๊ณผ sampling-based robot planner๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์–ธ์–ด-๋ ˆ์ด๋ธ” ๋กœ๋ด‡ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ์„ ํ™•์žฅํ•˜๊ณ , ์ด๋ฅผ diffusion policy๋ฅผ ํ†ตํ•ด ๋‹ค์ค‘ ์ž‘์—… ์–ธ์–ด-์กฐ๊ฑด visuo-motor ์ •์ฑ…์œผ๋กœ ์ฆ๋ฅ˜ํ•˜๋Š” ๋กœ๋ด‡ ์Šคํ‚ฌ ํš๋“ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2: Benchmark. We validate our approach on a new multi-task benchmark addressing challenging long-horizon

How

Figure 3

Figure 3: Language-Driven Robot Data Generation takes as input the task description and simulation state, and outputs

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜ ๊ณ„ํš๊ณผ sampling-based planning์„ ๊ฒฐํ•ฉํ•œ ์ž๋™ ๋กœ๋ด‡ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ๊ณผ multi-task diffusion policy ํ•™์Šต์˜ novelํ•œ ํ†ตํ•ฉ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•˜๋ฉฐ, 33.2% ์„ฑ๋Šฅ ํ–ฅ์ƒ๊ณผ ํ•จ๊ป˜ ๋กœ๋ด‡ ์Šคํ‚ฌ ์Šต๋“์˜ ํ™•์žฅ ๊ฐ€๋Šฅ์„ฑ์„ ์ž…์ฆํ•œ๋‹ค. ๋‹ค์ค‘ ์ž‘์—… ๋ฒค์น˜๋งˆํฌ์™€ ํ•จ๊ป˜ ๋กœ๋ด‡ ํ•™์Šต ๋ถ„์•ผ์— ์˜๋ฏธ ์žˆ๋Š” ๊ธฐ์—ฌ๋ฅผ ํ•˜๊ณ  ์žˆ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •