Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance

์ €์ž: Jesse Zhang, Jiahui Zhang, Karl Pertsch, Ziyi Liu, Xiang Ren, Minsuk Chang, Shao-Hua Sun, Joseph J. Lim | ๋‚ ์งœ: 2023-10-16 | URL: https://arxiv.org/abs/2310.10021 📄 PDF


Essence

Figure 1

Figure 1: BOSS learns to execute a large set of useful, long-horizon skills with minimal supervision

BOSS๋Š” ๊ธฐ๋ณธ primitive ์Šคํ‚ฌ ์„ธํŠธ๋กœ๋ถ€ํ„ฐ LLM์˜ ์ง€๋„๋ฅผ ๋ฐ›์•„ ์Šคํ‚ฌ ์ฒด์ด๋‹์„ ํ†ตํ•ด ๋ณต์žกํ•œ ์žฅ๊ธฐ ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋Š” ์Šคํ‚ฌ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ์ž๋™์œผ๋กœ ๊ตฌ์ถ•ํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก ์ด๋‹ค. ์ตœ์†Œํ•œ์˜ ๊ฐ๋…์œผ๋กœ ํ™˜๊ฒฝ๊ณผ์˜ ์ƒํ˜ธ์ž‘์šฉ์„ ํ†ตํ•ด ์˜๋ฏธ ์žˆ๋Š” ์Šคํ‚ฌ ์กฐํ•ฉ์„ ํ•™์Šตํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: BOSS learns to execute a large set of useful, long-horizon skills with minimal supervision

How

Figure 1

Figure 1: BOSS learns to execute a large set of useful, long-horizon skills with minimal supervision

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: BOSS๋Š” LLM์˜ ์ƒ์‹ ์ง€์‹๊ณผ ๊ฐ•ํ™”ํ•™์Šต์˜ ํ™˜๊ฒฝ ์ƒํ˜ธ์ž‘์šฉ์„ ์ฐฝ์˜์ ์œผ๋กœ ๊ฒฐํ•ฉํ•˜์—ฌ ์ตœ์†Œ ๊ฐ๋…์œผ๋กœ ์žฅ๊ธฐ ๋ณต์žก ์ž‘์—…์„ ํ•™์Šตํ•˜๋Š” ๋ฌธ์ œ์˜ ์‹ค์šฉ์ ์ด๊ณ  ํ™•์žฅ ๊ฐ€๋Šฅํ•œ ํ•ด๊ฒฐ์ฑ…์„ ์ œ์‹œํ•œ๋‹ค. ์‹คํ—˜ ๊ฒ€์ฆ๊ณผ ์‹ค์ œ ๋กœ๋ด‡ ์‹œ์—ฐ์„ ํ†ตํ•ด ๋†’์€ ์‹ ๋ขฐ์„ฑ์„ ํ™•๋ณดํ–ˆ์œผ๋ฉฐ, ๋กœ๋ด‡ ํ•™์Šต ๋ถ„์•ผ์˜ ์ค‘์š”ํ•œ ๊ธฐ์—ฌ์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •