RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation

์ €์ž: Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauza, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad ลปoล‚na, Scott Reed, Sergio Gรณmez Colmenarejo, Jon Scholz, Abbas Abdolmaleki, Oliver Groth, Jean-Baptiste Regli, Oleg Sushkov, Tom Rothรถrl, Josรฉ Enrique Chen, Yusuf Aytar, Dave Barker, Joy Ortiz, Martin Riedmiller, Jost Tobias Springenberg, Raia Hadsell, Francesco Nori, Nicolas Heess | ๋‚ ์งœ: 2023-06-20 | URL: https://arxiv.org/abs/2306.11706 📄 PDF


Essence

Figure 1

Figure 1: The self-improvement process. RoboCat is a multi-task, multi-embodiment visual goal-conditioned

RoboCat๋Š” ์„œ๋กœ ๋‹ค๋ฅธ ๋กœ๋ด‡๊ณผ ์ž‘์—… ๊ฒฝํ—˜์„ ํ™œ์šฉํ•˜์—ฌ ๋‹ค์ค‘ embodiment๊ณผ ๋‹ค์ค‘ ์ž‘์—…์„ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ๋Š” ์‹œ๊ฐ ๊ธฐ๋ฐ˜ goal-conditioned decision transformer ๊ธฐ๋ฐ˜์˜ ์ž๊ฐ€ ๊ฐœ์„  ๋กœ๋ด‡ ์กฐ์ž‘ ์—์ด์ „ํŠธ์ด๋‹ค. 100-1000๊ฐœ์˜ ์˜ˆ์ œ๋งŒ์œผ๋กœ ์ƒˆ๋กœ์šด ์ž‘์—…๊ณผ ๋กœ๋ด‡์— ์ ์‘ํ•˜๋ฉฐ, ์ž์ฒด ์ƒ์„ฑ ๋ฐ์ดํ„ฐ๋ฅผ ์ด์šฉํ•œ ๋ฐ˜๋ณต์  ๊ฐœ์„ ์ด ๊ฐ€๋Šฅํ•˜๋‹ค.

Motivation

Achievement

Figure 5

Figure 5: RoboCat compared to VFM baselines on training tasks. RoboCat performs better on the vast

How

Figure 1

Figure 1: The self-improvement process. RoboCat is a multi-task, multi-embodiment visual goal-conditioned

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: RoboCat๋Š” foundation model ํŒจ๋Ÿฌ๋‹ค์ž„์„ ๋กœ๋ด‡ ์กฐ์ž‘์— ์„ฑ๊ณต์ ์œผ๋กœ ์ ์šฉํ•˜์—ฌ ์ด์งˆ์  embodiment ์ฒ˜๋ฆฌ, ํšจ์œจ์  ์ ์‘, ์ž๊ฐ€ ๊ฐœ์„ ์„ ๋™์‹œ์— ๋‹ฌ์„ฑํ•œ ํš๊ธฐ์  ์—ฐ๊ตฌ์ด๋‹ค. ๊ด‘๋ฒ”์œ„ํ•œ ์‹คํ—˜ ๊ฒ€์ฆ๊ณผ ๋ช…ํ™•ํ•œ presentation์ด ๊ฐ•์ ์ด๋‚˜, ๋ณต์žก๋„ ์ฆ๊ฐ€์™€ ์žฅ๊ธฐ scaling์— ๋Œ€ํ•œ ๋ถ„์„์ด ํ–ฅํ›„ ๊ณผ์ œ์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •