Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and Aviation

์ €์ž: Ria Doshi, Homer Walke, Oier Mees, Sudeep Dasari, Sergey Levine | ๋‚ ์งœ: 2024-08-21 | URL: https://arxiv.org/abs/2408.11812 📄 PDF


Essence

Figure 1

Figure 1: We introduce CrossFormer, a transformer-based policy trained on 900K trajectories of diverse,

CrossFormer๋Š” 20๊ฐœ์˜ ์„œ๋กœ ๋‹ค๋ฅธ ๋กœ๋ด‡ embodiment์—์„œ 900K ๊ถค์ ์œผ๋กœ ํ•™์Šต๋œ ๋‹จ์ผ transformer ๊ธฐ๋ฐ˜ ์ •์ฑ…์œผ๋กœ, ๊ด€์ฐฐ ๋ฐ ํ–‰๋™ ๊ณต๊ฐ„์˜ ์ˆ˜๋™ ์ •๋ ฌ ์—†์ด ์กฐ์ž‘, ๋„ค๋น„๊ฒŒ์ด์…˜, ๋ณดํ–‰, ํ•ญ๊ณต ๋กœ๋ด‡์„ ๋ชจ๋‘ ์ œ์–ดํ•  ์ˆ˜ ์žˆ๋‹ค.

Motivation

Achievement

Figure 5

Figure 5: Real Evaluation. We compare CrossFormer to the same architecture trained on just the

How

Figure 2

Figure 2: Policy architecture. Our architecture enables cross-embodied policy learning through

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: CrossFormer๋Š” cross-embodied ๋กœ๋ด‡ ํ•™์Šต์—์„œ ํš๊ธฐ์ ์ธ ์ง„์ „์„ ์ด๋ฃจ์—ˆ์œผ๋ฉฐ, ์‹ค์šฉ์ ์ธ ๋ฌธ์ œ(์„ผ์„œ/์•ก์ถ”์—์ดํ„ฐ ์ด์งˆ์„ฑ)๋ฅผ ์šฐ์•„ํ•˜๊ฒŒ ํ•ด๊ฒฐํ•˜๊ณ  ๊ด‘๋ฒ”์œ„ํ•œ ์‹ค์ œ ์‹คํ—˜์œผ๋กœ ๊ฒ€์ฆ๋œ ๊ฐ•๋ ฅํ•œ ์ž‘์—…์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •