Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware

์ €์ž: Tony Z. Zhao, Vikash Kumar, Sergey Levine, Chelsea Finn | ๋‚ ์งœ: 2023-04-23 | URL: https://arxiv.org/abs/2304.13705 📄 PDF


Essence

Figure 1

Fig. 1: ALOHA

์ €๋น„์šฉ ํ•˜๋“œ์›จ์–ด๋กœ ์„ธ๋ฐ€ํ•œ ์–‘ํŒ” ์กฐ์ž‘ ์ž‘์—…์„ ํ•™์Šตํ•˜๊ธฐ ์œ„ํ•ด ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜ ์‹œ์Šคํ…œ๊ณผ Action Chunking with Transformers (ACT) ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๊ฒฐํ•ฉํ•œ ์‹œ์Šคํ…œ์„ ์ œ์‹œํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1: ALOHA

How

Figure 4

Fig. 4: Architecture of Action Chunking with Transformers (ACT). We train ACT as a Conditional VAE (CVAE), which has an

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ด ๋…ผ๋ฌธ์€ ์ €๋น„์šฉ ํ•˜๋“œ์›จ์–ด์™€ ํ˜์‹ ์ ์ธ imitation learning ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ๊ฒฐํ•ฉ์œผ๋กœ ๋กœ๋ณดํ‹ฑ ์กฐ์ž‘์˜ ๋ฏผ์ฃผํ™”์— ๊ธฐ์—ฌํ•˜๋Š” ์ค‘์š”ํ•œ ์ž‘์—…์ด๋ฉฐ, Action Chunking with Transformers๋Š” ์˜ค๋ฅ˜ ์ถ•์  ๋ฌธ์ œ๋ฅผ ์šฐ์•„ํ•˜๊ฒŒ ํ•ด๊ฒฐํ•˜๋Š” ๋…์ฐฝ์  ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •