Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI

์ €์ž: Yang Liu, Weixing Chen, Yongjie Bai, Xiaodan Liang, Guanbin Li | ๋‚ ์งœ: 2024.07 | DOI: N/A 📄 PDF


Essence

Figure 1

Fig. 1. The framework of the embodied agent based on MLMs and WMs,

๋ณธ ๋…ผ๋ฌธ์€ Embodied AI์˜ ํฌ๊ด„์ ์ธ ์กฐ์‚ฌ๋กœ, ์‚ฌ์ด๋ฒ„ ๊ณต๊ฐ„๊ณผ ๋ฌผ๋ฆฌ ์„ธ๊ณ„์˜ ์ •๋ ฌ์„ ๋ชฉํ‘œ๋กœ Multi-modal Large Models (MLMs)๊ณผ World Models (WMs)์˜ ์ตœ์‹  ๋ฐœ์ „์„ ๋‹ค๋ฃฌ๋‹ค. Embodied perception, embodied interaction, embodied agent, sim-to-real adaptation์˜ ๋„ค ๊ฐ€์ง€ ์ฃผ์š” ์—ฐ๊ตฌ ๋Œ€์ƒ์„ ์ค‘์‹ฌ์œผ๋กœ ์ตœ์‹  ๋ฐฉ๋ฒ•๋ก ๊ณผ ๋ฐ์ดํ„ฐ์…‹์„ ์ข…ํ•ฉ์ ์œผ๋กœ ๋ถ„์„ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1. The framework of the embodied agent based on MLMs and WMs,

How

Figure 1

Fig. 1. The framework of the embodied agent based on MLMs and WMs,

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 5/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ MLMs ์‹œ๋Œ€์˜ Embodied AI์— ๋Œ€ํ•œ ์ฒซ ๋ฒˆ์งธ ํฌ๊ด„์  survey๋กœ์„œ, embodied robots, simulators, perception, interaction, agents, sim-to-real adaptation์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ •๋ฆฌํ•˜๊ณ  ARIO ๋ฐ์ดํ„ฐ์…‹์„ ์ œ์•ˆํ•˜์—ฌ ์—ฐ๊ตฌ ์ปค๋ฎค๋‹ˆํ‹ฐ์— ํฐ ๊ธฐ์—ฌ๋ฅผ ํ•œ๋‹ค. ๋‹ค๋งŒ ๋น ๋ฅด๊ฒŒ ๋ฐœ์ „ํ•˜๋Š” ๋ถ„์•ผ์˜ ํŠน์„ฑ์ƒ ์ง€์†์ ์ธ ์—…๋ฐ์ดํŠธ๊ฐ€ ํ•„์š”ํ•˜๋ฉฐ, ์‹ค์ œ ๋กœ๋ด‡ ํ™˜๊ฒฝ์—์„œ์˜ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ ๊ฒ€์ฆ์ด ํ–ฅํ›„ ๊ณผ์ œ์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •