Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments

์ €์ž: Haritheja Etukuru, Norihito Naka, Zijin Hu, Seungjae Lee, Julian Mehu, Aaron Edsinger, Chris Paxton, Soumith Chintala, Lerrel Pinto, Nur Muhammad Mahi Shafiullah | ๋‚ ์งœ: 2024-09-09 | URL: https://arxiv.org/abs/2409.05865 📄 PDF


Essence

Figure 1

Figure 1: Robot Utility Models are trained on a diverse set of environments and objects, and then

Robot Utility Models (RUM)์€ ๋‹ค์–‘ํ•œ ํ™˜๊ฒฝ์—์„œ ์ˆ˜์ง‘ํ•œ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ๋กœ ํ•™์Šตํ•˜์—ฌ ์ƒˆ๋กœ์šด ํ™˜๊ฒฝ์—์„œ ํŒŒ์ธํŠœ๋‹ ์—†์ด ์ฆ‰์‹œ ๋ฐฐํฌ ๊ฐ€๋Šฅํ•œ ๋กœ๋ด‡ ์ •์ฑ… ํ”„๋ ˆ์ž„์›Œํฌ์ด๋‹ค. 90% ์„ฑ๊ณต๋ฅ ๋กœ ๋ฏธ์ง€์˜ ํ™˜๊ฒฝ๊ณผ ๊ฐ์ฒด์— ๋Œ€ํ•ด zero-shot ์ผ๋ฐ˜ํ™”๋ฅผ ๋‹ฌ์„ฑํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: Robot Utility Models are trained on a diverse set of environments and objects, and then

How

Figure 2

Figure 2: Stick-v2, our data collection tool (left: real photo, right: render), is built out of an iPhone

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ๋กœ๋ด‡ ์ •์ฑ…์˜ zero-shot ์ผ๋ฐ˜ํ™”๋ผ๋Š” ์ค‘์š”ํ•œ ๋ฌธ์ œ๋ฅผ ์ฒด๊ณ„์ ์ธ ์—”์ง€๋‹ˆ์–ด๋ง ์ ‘๊ทผ์œผ๋กœ ํ•ด๊ฒฐํ•˜๋ฉฐ, ์‹ค์šฉ์ ์ธ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ๋„๊ตฌ, ํšจ๊ณผ์ ์ธ ํ•™์Šต ๋ฐ ๋ฐฐํฌ ํŒŒ์ดํ”„๋ผ์ธ, ํ˜์‹ ์ ์ธ mLLM ๊ธฐ๋ฐ˜ ์‹คํŒจ ๋ณต๊ตฌ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์ œ์‹œํ•œ๋‹ค. 2,950ํšŒ์˜ ์‹ค์ œ ๋กœ๋ด‡ ๋กค์•„์›ƒ๊ณผ ์˜คํ”ˆ์†Œ์‹ฑ๋œ ๋ฆฌ์†Œ์Šค๋ฅผ ํ†ตํ•ด ๊ฐ•๋ ฅํ•œ ์‹ค์ฆ์  ๊ธฐ์—ฌ๋ฅผ ์ด๋ฃจ์—ˆ์œผ๋‚˜, ๋‹ค์–‘ํ•œ ์ž‘์—…/๋กœ๋ด‡ ํ”Œ๋žซํผ์œผ๋กœ์˜ ํ™•์žฅ์„ฑ๊ณผ ์ƒ์„ธํ•œ ์‹คํŒจ ๋ถ„์„์ด ํ–ฅํ›„ ๊ณผ์ œ๋กœ ๋‚จ์•„์žˆ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •