Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

์ €์ž: Yafei Hu, Quanting Xie, Vidhi Jain, Jonathan Francis, Jay Patrikar, Nikhil Keetha, Seungchan Kim, Yaqi Xie, Tianyi Zhang, Hao-Shu Fang, Shibo Zhao, Shayegan Omidshafiei, Dong-Ki Kim, Ali-akbar Agha-mohammadi, Katia Sycara, Matthew Johnson-Roberson, Dhruv Batra, Xiaolong Wang, Sebastian Scherer, Chen Wang, Zsolt Kira, Fei Xia, Yonatan Bisk | ๋‚ ์งœ: 2023-12-14 | URL: https://arxiv.org/abs/2312.08782 📄 PDF


Essence

Figure 1

Figure 1: In this paper, we present a survey toward building general-purpose robots via foundation models. We mainly cat

์ด ๋…ผ๋ฌธ์€ NLP์™€ CV ๋ถ„์•ผ์˜ foundation models๋ฅผ ๋กœ๋ด‡ ๊ณตํ•™์— ์ ์šฉํ•˜์—ฌ ๋ฒ”์šฉ ๋กœ๋ด‡ ์‹œ์Šคํ…œ ๊ฐœ๋ฐœ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ํƒ๊ตฌํ•˜๋Š” ์ข…ํ•ฉ ์„ค๋ฌธ์กฐ์‚ฌ์ด๋ฉฐ, ๊ธฐ์กด vision/language foundation models์˜ ํ™œ์šฉ๊ณผ robotics-specific foundation models์˜ ์„ค๊ณ„๋ฅผ ๋‹ค๋ฃฌ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: In this paper, we present a survey toward building general-purpose robots via foundation models. We mainly cat

How

Figure 4

Figure 4: Comprehensive visualizations of the Open-X Embodiment and Droid Dataset encompassing robot morphologies, envir

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ด ๋…ผ๋ฌธ์€ ๋กœ๋ด‡ ๊ณตํ•™์— foundation models๋ฅผ ์ ์šฉํ•˜๋Š” ํ˜„ํ™ฉ์„ ์ตœ์ดˆ๋กœ ํฌ๊ด„์ ์œผ๋กœ ์ •๋ฆฌํ•œ ์ค‘์š”ํ•œ ์„ค๋ฌธ์กฐ์‚ฌ๋กœ, ์ฒด๊ณ„์ ์ธ ํƒ์†Œ๋…ธ๋ฏธ์™€ ๋ช…ํ™•ํ•œ ๋„์ „ ๊ณผ์ œ ๋ถ„์„์„ ์ œ๊ณตํ•˜๋ฉฐ, ํ–ฅํ›„ ๋ฒ”์šฉ ๋กœ๋ด‡ ๊ฐœ๋ฐœ์„ ์œ„ํ•œ ์—ฐ๊ตฌ ๋กœ๋“œ๋งต์„ ์ œ์‹œํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •