Advances in Embodied Navigation Using Large Language Models: A Survey

์ €์ž: Jinzhou Lin, Han Gao, Xuxiang Feng, Rongtao Xu, Changwei Wang, Man Zhang, Li Guo, Shibiao Xu | ๋‚ ์งœ: 2023-11-01 | URL: https://arxiv.org/abs/2311.00530 📄 PDF


Essence

Figure 1

Fig. 1: This presentation exhibit a temporal map depicting the works of embodied navigation from 2022 to 2024, and we

์ด ๋…ผ๋ฌธ์€ Large Language Models (LLMs)๊ณผ embodied intelligence์˜ ์œตํ•ฉ์— ์ดˆ์ ์„ ๋งž์ถฐ LLM ๊ธฐ๋ฐ˜ navigation ๋ชจ๋ธ๋“ค์˜ ์ตœ์‹  ๋™ํ–ฅ์„ ์ข…ํ•ฉ์ ์œผ๋กœ ์กฐ์‚ฌํ•˜๊ณ , ๊ธฐ์กด ๋ชจ๋ธ๊ณผ ๋ฐ์ดํ„ฐ์…‹์˜ ์žฅ๋‹จ์ ์„ ๋ถ„์„ํ•œ ์„œ๋ฒ ์ด์ด๋‹ค.

Motivation

Achievement

Figure 2

Fig. 2: The first type utilizes LLMs to analyze incoming visual or textual data to extract goal-relevant information, up

How

Figure 2

Fig. 2: The first type utilizes LLMs to analyze incoming visual or textual data to extract goal-relevant information, up

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ด ๋…ผ๋ฌธ์€ ๋น ๋ฅด๊ฒŒ ์„ฑ์žฅํ•˜๋Š” LLM ๊ธฐ๋ฐ˜ embodied navigation ๋ถ„์•ผ์— ๋Œ€ํ•œ ์ฒซ ๋ฒˆ์งธ ์ฒด๊ณ„์  ์„œ๋ฒ ์ด๋กœ์„œ, ํ˜„์žฌ๊นŒ์ง€์˜ ์—ฐ๊ตฌ ์„ฑ๊ณผ๋ฅผ ๋ช…ํ™•ํžˆ ๋ถ„๋ฅ˜ํ•˜๊ณ  ๋ฏธ๋ž˜ ๋ฐฉํ–ฅ์„ ์ œ์‹œํ•˜๋Š” ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋ฅผ ํ•œ๋‹ค. ๋‹ค๋งŒ, ๊ธฐ์ˆ ์  ๊นŠ์ด์™€ ์‹ค์ œ ๊ตฌํ˜„์ƒ์˜ ๋„์ „๊ณผ์ œ์— ๋Œ€ํ•œ ๋”์šฑ ๊ตฌ์ฒด์ ์ธ ๋ถ„์„์ด ๋ณด๊ฐ•๋œ๋‹ค๋ฉด ์‹ค๋ฌด์ž๋“ค์—๊ฒŒ ๋”์šฑ ์œ ์šฉํ•œ ์ž๋ฃŒ๊ฐ€ ๋  ๊ฒƒ์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •