A Generalist Agent

์ €์ž: Scott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Alexander Novikov, Gabriel Barth-Maron, Mai Gimenez, Yury Sulsky, Jackie Kay, Jost Tobias Springenberg, Tom Eccles, Jake Bruce, Ali Razavi, Ashley Edwards, Nicolas Heess, Yutian Chen, Raia Hadsell, Oriol Vinyals, Mahyar Bordbar, Nando de Freitas | ๋‚ ์งœ: 2022-05-12 | URL: https://arxiv.org/abs/2205.06175 📄 PDF


Essence

Figure 1

Figure 1: A generalist agent. Gato can sense and act with di๏ฌ€erent embodiments across a wide range of

Gato๋Š” ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ์˜ ์ ‘๊ทผ ๋ฐฉ์‹์„ ์ผ๋ฐ˜ํ™”ํ•˜์—ฌ ํ…์ŠคํŠธ๋ฅผ ๋„˜์–ด ๋‹ค์–‘ํ•œ ๋ชจ๋‹ฌ๋ฆฌํ‹ฐ์™€ ๊ตฌ์ฒดํ™”(embodiment)๋ฅผ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ๋Š” ๋‹จ์ผ ์‹ ๊ฒฝ๋ง ๊ธฐ๋ฐ˜์˜ ๋ฒ”์šฉ ์ •์ฑ… ์—์ด์ „ํŠธ์ด๋‹ค. ๋™์ผํ•œ ๊ฐ€์ค‘์น˜๋ฅผ ๊ฐ€์ง„ ํ•˜๋‚˜์˜ ๋ชจ๋ธ๋กœ Atari ๊ฒŒ์ž„, ์ด๋ฏธ์ง€ ์บก์…”๋‹, ๋Œ€ํ™”, ๋กœ๋ด‡ ์ œ์–ด ๋“ฑ 604๊ฐœ์˜ ์„œ๋กœ ๋‹ค๋ฅธ ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: A generalist agent. Gato can sense and act with di๏ฌ€erent embodiments across a wide range of

How

Figure 2

Figure 2: Training phase of Gato. Data from di๏ฌ€erent tasks and modalities is serialized into a ๏ฌ‚at sequence of

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: Gato๋Š” ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ์˜ ์Šค์ผ€์ผ๋ง ์›๋ฆฌ๋ฅผ ๋‹ค์ค‘ ๋ชจ๋‹ฌ๋ฆฌํ‹ฐ ์ œ์–ด ๋ฌธ์ œ๋กœ ํ™•์žฅํ•˜์—ฌ ๋‹จ์ผ ๋ฒ”์šฉ ์—์ด์ „ํŠธ์˜ ๊ฐ€๋Šฅ์„ฑ์„ ์‹ค์ฆ์ ์œผ๋กœ ๋ณด์—ฌ์ฃผ๋Š” ํš๊ธฐ์  ์—ฐ๊ตฌ์ด๋‹ค. ๊ธฐ์ˆ ์  ๊ตฌ์„ฑ์€ ์ƒ๋Œ€์ ์œผ๋กœ ๋‹จ์ˆœํ•˜์ง€๋งŒ, 604๊ฐœ ์ž‘์—… ๊ทœ๋ชจ์—์„œ์˜ ํ†ตํ•ฉ ๋ฐ ์‹ค์ œ ๋กœ๋ด‡ ์ œ์–ด ์„ฑ๊ณต์€ ๋†’์€ ์‹ค๋ฌด์  ๊ฐ€์น˜์™€ ์žฅ๊ธฐ์  ์˜ํ–ฅ๋ ฅ์„ ๊ฐ€์ง„๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •