Octo: An Open-Source Generalist Robot Policy

์ €์ž: Octo Model Team, Dibya Ghosh, Homer Walke, Karl Pertsch, Kevin Black, Oier Mees, Sudeep Dasari, Joey Hejna, Tobias Kreiman, Charles Xu, Jianlan Luo, You Liang Tan, Lawrence Yunliang Chen, Pannag Sanketi, Quan Vuong, Ted Xiao, Dorsa Sadigh, Chelsea Finn, Sergey Levine | ๋‚ ์งœ: 2024-05-20 | URL: https://arxiv.org/abs/2405.12213 📄 PDF


Essence

Figure 1

Fig. 1: We introduce Octo, an open-source, generalist policy for robotic manipulation. Octo is a transformer-based polic

Open X-Embodiment ๋ฐ์ดํ„ฐ์…‹์˜ 800k ๊ถค์ ์œผ๋กœ ์‚ฌ์ „ํ•™์Šต๋œ transformer ๊ธฐ๋ฐ˜์˜ generalist robot policy์ธ Octo๋ฅผ ์ œ์•ˆํ•˜๋ฉฐ, ์–ธ์–ด ๋ช…๋ น์ด๋‚˜ ๋ชฉํ‘œ ์ด๋ฏธ์ง€๋กœ ์ง€์‹œ ๊ฐ€๋Šฅํ•˜๊ณ  ์ƒˆ๋กœ์šด ์„ผ์„œ์™€ ์•ก์…˜ ๊ณต๊ฐ„์œผ๋กœ ํšจ์œจ์ ์œผ๋กœ ๋ฏธ์„ธ์กฐ์ • ๊ฐ€๋Šฅํ•˜๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1: We introduce Octo, an open-source, generalist policy for robotic manipulation. Octo is a transformer-based polic

How

Figure 2

Fig. 2: Model architecture. Left: Octo tokenizes task descriptions (green) and input observations (blue) using a pretrai

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: Octo๋Š” ๋Œ€๊ทœ๋ชจ ๋‹ค์–‘ํ•œ ๋ฐ์ดํ„ฐ์™€ ์œ ์—ฐํ•œ ์•„ํ‚คํ…์ฒ˜๋กœ generalist robot policy์˜ ์‹ค์งˆ์  ๋ฐœ์ „์„ ์ด๋ฃจ์—ˆ์œผ๋ฉฐ, ์™„์ „ ๊ณต๊ฐœ๋ฅผ ํ†ตํ•ด ๋กœ๋ด‡ ์ปค๋ฎค๋‹ˆํ‹ฐ์— ์ฆ‰์‹œ์  ๊ธฐ์—ฌ๋ฅผ ์ œ๊ณตํ•œ๋‹ค. ๋ฏธ์„ธ์กฐ์ • ํšจ์œจ์„ฑ๊ณผ ๋‹ค์ค‘ ํ”Œ๋žซํผ ํ˜ธํ™˜์„ฑ์€ ์‹ค์ œ ์‘์šฉ์„ฑ์„ ํฌ๊ฒŒ ๋†’์ธ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •