LLM-State: Open World State Representation for Long-horizon Task Planning with Large Language Model

์ €์ž: Siwei Chen, Anxing Xiao, David Hsu | ๋‚ ์งœ: 2023-11-29 | URL: https://arxiv.org/abs/2311.17406 📄 PDF


Essence

Figure 1

Fig. 1: LLM-State Example. The proposed state representation is a mixture

๊ฐœ๋ฐฉํ˜• ํ™˜๊ฒฝ์—์„œ LLM์˜ ์žฅ๊ธฐ ์ž‘์—… ๊ณ„ํš์„ ์œ„ํ•ด ๊ฐ์ฒด ์†์„ฑ์„ ๋™์ ์œผ๋กœ ์ถ”์ ํ•˜๊ณ  ์—…๋ฐ์ดํŠธํ•˜๋Š” ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ƒํƒœ ํ‘œํ˜„ LLM-State๋ฅผ ์ œ์•ˆํ•œ๋‹ค. ์ด๋Š” ๊ตฌ์กฐํ™”๋œ ๊ฐ์ฒด ์ค‘์‹ฌ ํ‘œํ˜„๊ณผ ๋น„๊ตฌ์กฐํ™”๋œ ํ–‰๋™ ์ด๋ ฅ ์š”์•ฝ์„ ๊ฒฐํ•ฉํ•˜์—ฌ ์žฅ๊ธฐ๊ฐ„ ์ƒํƒœ ์ถ”์  ๋ฐ ์‹คํŒจ ๋ณต๊ตฌ๋ฅผ ๊ฐœ์„ ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1: LLM-State Example. The proposed state representation is a mixture

How

Figure 2

Fig. 2: Overview of the system framework. The task planner consists of three components: LLM as Encoder, LLM as State Es

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ด ๋…ผ๋ฌธ์€ ๊ฐœ๋ฐฉํ˜• ํ™˜๊ฒฝ์˜ ์žฅ๊ธฐ ์ž‘์—… ๊ณ„ํš์„ ์œ„ํ•ด LLM์˜ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ์ƒํƒœ ํ‘œํ˜„ ๊ตฌ์„ฑ์— ์ง์ ‘ ํ™œ์šฉํ•˜๋Š” ์ฐฝ์˜์  ์ ‘๊ทผ์„ ์ œ์‹œํ•˜๋ฉฐ, ๊ตฌ์กฐ-๋น„๊ตฌ์กฐ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์„ค๊ณ„๋ฅผ ํ†ตํ•ด ๋ช…์‹œ์„ฑ๊ณผ ์œ ์—ฐ์„ฑ์˜ ๊ท ํ˜•์„ ๋‹ฌ์„ฑํ•œ๋‹ค. ๋‹ค๋งŒ ์‹ค์ œ ํ™˜๊ฒฝ ์ ์šฉ, ๊ณ„์‚ฐ ํšจ์œจ์„ฑ, ์ •๋Ÿ‰์  ๊ฒ€์ฆ์—์„œ ๊ฐœ์„ ์ด ํ•„์š”ํ•˜๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •