Code as Policies: Language Model Programs for Embodied Control

์ €์ž: Jacky Liang, Wenlong Huang, Fei Xia, Peng Xu, Karol Hausman, Brian Ichter, Pete Florence, Andy Zeng | ๋‚ ์งœ: 2022-09-16 | URL: https://arxiv.org/abs/2209.07753 📄 PDF


Essence

Figure 1

Fig. 1: Given examples (via few-shot prompting), robots can use code-writing

Large Language Model(LLM)์„ ํ™œ์šฉํ•˜์—ฌ ์ž์—ฐ์–ด ๋ช…๋ น์„ ๋กœ๋ด‡ ์ •์ฑ… ์ฝ”๋“œ๋กœ ์ง์ ‘ ๋ณ€ํ™˜ํ•˜๋Š” "Code as Policies" ๋ฐฉ์‹์„ ์ œ์•ˆํ•˜๋ฉฐ, few-shot prompting๊ณผ hierarchical code-gen์„ ํ†ตํ•ด ๋ณต์žกํ•œ ๋กœ๋ด‡ ํ–‰๋™์„ ์‹ค์‹œ๊ฐ„์œผ๋กœ ์ƒ์„ฑํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Fig. 2: Code as Policies can follow natural language instructions across diverse domains and robots: table-top manipulat

How

Figure 1

Fig. 1: Given examples (via few-shot prompting), robots can use code-writing

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ด ๋…ผ๋ฌธ์€ LLM์„ ๋กœ๋ด‡ ์ •์ฑ… ์ƒ์„ฑ์— ์ง์ ‘ ์ ์šฉํ•˜๋Š” ์ฐฝ์˜์ ์ธ ๋ฐฉ์‹์„ ์ œ์‹œํ•˜๋ฉฐ, hierarchical code-gen์„ ํ†ตํ•œ ์„ฑ๋Šฅ ๊ฐœ์„ ๊ณผ ๋‹ค์–‘ํ•œ ์‹ค์ œ ๋กœ๋ด‡ ํ”Œ๋žซํผ์—์„œ์˜ ๊ฒ€์ฆ์œผ๋กœ ๊ฐ•ํ•œ ์ž„ํŒฉํŠธ๋ฅผ ๊ฐ€์ง„๋‹ค. ๋‹ค๋งŒ ์ƒ์„ฑ ์ฝ”๋“œ์˜ ์•ˆ์ •์„ฑ ๊ฒ€์ฆ๊ณผ ์‹ค์‹œ๊ฐ„ ์„ฑ๋Šฅ ํ‰๊ฐ€๊ฐ€ ๋ณด์™„๋˜๋ฉด ๋”์šฑ ์™„์„ฑ๋„ ๋†’์€ ์—ฐ๊ตฌ๊ฐ€ ๋  ๊ฒƒ์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •