SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning

์ €์ž: Krishan Rana, Jesse Haviland, Sourav Garg, Jad Abou-Chakra, Ian Reid, Niko Suenderhauf | ๋‚ ์งœ: 2023-07-12 | URL: https://arxiv.org/abs/2307.06135 📄 PDF


Essence

Figure 1

Figure 1: SayPlan Overview (top). SayPlan operates across two stages to ensure scalability: (left)

SayPlan์€ 3D Scene Graph (3DSG) ํ‘œํ˜„์„ ํ™œ์šฉํ•˜์—ฌ LLM ๊ธฐ๋ฐ˜ ๋Œ€๊ทœ๋ชจ ๋กœ๋ด‡ ํƒœ์Šคํฌ ๊ณ„ํš์„ ํ™•์žฅ ๊ฐ€๋Šฅํ•˜๊ฒŒ ๋งŒ๋“œ๋Š” ์ ‘๊ทผ๋ฒ•์ด๋‹ค. ์˜๋ฏธ๋ก ์  ๊ฒ€์ƒ‰, ๊ณ ์ „์  ๊ฒฝ๋กœ ๊ณ„ํš ํ†ตํ•ฉ, ๋ฐ˜๋ณต ์žฌ๊ณ„ํš ํŒŒ์ดํ”„๋ผ์ธ์„ ํ†ตํ•ด ๋ฉ€ํ‹ฐ๋ฃธ, ๋ฉ€ํ‹ฐํ”Œ๋กœ์–ด ํ™˜๊ฒฝ์—์„œ ์‹คํ–‰ ๊ฐ€๋Šฅํ•œ ๊ณ„ํš์„ ์ƒ์„ฑํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: SayPlan Overview (top). SayPlan operates across two stages to ensure scalability: (left)

How

Figure 1

Figure 1: SayPlan Overview (top). SayPlan operates across two stages to ensure scalability: (left)

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: SayPlan์€ 3DSG์˜ ๊ณ„์ธต์  ๊ตฌ์กฐ๋ฅผ ์˜๋ฆฌํ•˜๊ฒŒ ํ™œ์šฉํ•˜์—ฌ ๋ฉ€ํ‹ฐ๋ฃธ, ๋ฉ€ํ‹ฐํ”Œ๋กœ์–ด ๋Œ€๊ทœ๋ชจ ํ™˜๊ฒฝ์—์„œ LLM ๊ธฐ๋ฐ˜ ๋กœ๋ด‡ ๊ณ„ํš์˜ ํ™•์žฅ์„ฑ ๋ฌธ์ œ๋ฅผ ์‹ค์งˆ์ ์œผ๋กœ ํ•ด๊ฒฐํ•œ ๊ฐ•๋ ฅํ•œ ์—ฐ๊ตฌ์ด๋‹ค. ์˜๋ฏธ๋ก ์  ๊ฒ€์ƒ‰, ๊ฒฝ๋กœ ๊ณ„ํš ํ†ตํ•ฉ, ๋ฐ˜๋ณต ์žฌ๊ณ„ํš ์กฐํ•ฉ์œผ๋กœ ์‹คํ–‰ ๊ฐ€๋Šฅํ•˜๊ณ  ์‹ ๋ขฐ์„ฑ ์žˆ๋Š” ๊ณ„ํš์„ ๋ณด์žฅํ•˜์—ฌ ์‹ค์ œ ๋กœ๋ณดํ‹ฑ์Šค ์‘์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ์ž…์ฆํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •