Mobi-$ฯ€$: Mobilizing Your Robot Learning Policy

์ €์ž: Jingyun Yang, Isabella Huang, Brandon Vu, Max Bajracharya, Rika Antonova, Jeannette Bohg | ๋‚ ์งœ: 2025-05-29 | URL: https://arxiv.org/abs/2505.23692 📄 PDF


Essence

Figure 1

Figure 1: Introducing policy mobilization. (a) Assume a visuomotor policy ฯ€ trained from one or a set of limited camera

๋ณธ ๋…ผ๋ฌธ์€ ์ œํ•œ๋œ ์นด๋ฉ”๋ผ ๋ทฐํฌ์ธํŠธ์—์„œ ํ•™์Šต๋œ visuomotor ์กฐ์ž‘ ์ •์ฑ…์„ ๋ชจ๋ฐ”์ผ ๋กœ๋ด‡ ํ”Œ๋žซํผ์—์„œ ์‹คํ–‰ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜๋Š” "policy mobilization" ๋ฌธ์ œ๋ฅผ ์ •์˜ํ•˜๊ณ , 3D Gaussian Splatting๊ณผ sampling-based optimization์„ ํ™œ์šฉํ•˜์—ฌ ์ตœ์ ์˜ ๋กœ๋ด‡ ๋ฒ ์ด์Šค ํฌ์ฆˆ๋ฅผ ์ฐพ๋Š” ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค.

Motivation

Achievement

Figure 3

Figure 3: A suite of simulated tasks for benchmarking performance of policy mobilization methods. We pick five single-st

Mobi-ฯ€ ํ”„๋ ˆ์ž„์›Œํฌ ๊ฐœ๋ฐœ: policy mobilization ๋‚œ์ด๋„๋ฅผ ์ •๋Ÿ‰ํ™”ํ•˜๋Š” ๋ฉ”ํŠธ๋ฆญ, RoboCasa ๊ธฐ๋ฐ˜ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ํƒœ์Šคํฌ ์Šค์œ„ํŠธ, ๋ถ„์„์šฉ ์‹œ๊ฐํ™” ๋„๊ตฌ / ๋ฐฉ๋ฒ•๋ก ์˜ ํšจ๊ณผ์„ฑ: ์‹œ๋ฎฌ๋ ˆ์ด์…˜๊ณผ ์‹ค์ œ ํ™˜๊ฒฝ ๋ชจ๋‘์—์„œ non-policy-aware baseline๊ณผ policy-aware baseline์„ ๋Šฅ๊ฐ€ํ•˜๋Š” ์„ฑ๋Šฅ ๋‹ฌ์„ฑ / ๊ธฐ์กด ์ •์ฑ… ํ™œ์šฉ ๊ฐ€๋Šฅ์„ฑ: stationary robot ๋ฐ์ดํ„ฐ๋งŒ์œผ๋กœ ํ•™์Šต๋œ ์กฐ์ž‘ ์ •์ฑ…์„ ๋ชจ๋ฐ”์ผ ํ”Œ๋žซํผ์— ํšจ๊ณผ์ ์œผ๋กœ ๋ฐฐํฌ

How

Figure 2

Figure 2: Overview of our proposed proof-of-concept method. The goal of our method is to find a proper robot pose p for

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: Policy mobilization์„ ๋ช…ํ™•ํžˆ ์ •์˜ํ•˜๊ณ  3D Gaussian Splatting ๊ธฐ๋ฐ˜์˜ ์‹ค์งˆ์  ํ•ด๊ฒฐ์ฑ…์„ ์ œ์‹œํ•œ ์šฐ์ˆ˜ํ•œ ์—ฐ๊ตฌ์ด๋‹ค. ๊ธฐ์กด stationary robot ์ •์ฑ…์˜ ๋ชจ๋ฐ”์ผ ๋กœ๋ด‡ ๋ฐฐํฌ ๋ฌธ์ œ๋ฅผ elegantํ•˜๊ฒŒ ํ•ด๊ฒฐํ•˜๋ฉฐ, Mobi-ฯ€ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ํ†ตํ•ด ์ฒด๊ณ„์  ํ‰๊ฐ€๊ฐ€ ๊ฐ€๋Šฅํ•˜๋„๋ก ํ•œ ์ ์ด ํŠนํžˆ ๊ฐ€์น˜์žˆ๋‹ค. ๋‹ค๋งŒ ์‹คํ™˜๊ฒฝ ์‹คํ—˜ ๊ทœ๋ชจ ํ™•๋Œ€์™€ ๋” ์ •๊ตํ•œ method ๊ฐœ๋ฐœ์ด ์ถ”๊ฐ€๋˜๋ฉด ์˜ํ–ฅ๋ ฅ์„ ๋”์šฑ ๋†’์ผ ์ˆ˜ ์žˆ์„ ๊ฒƒ์œผ๋กœ ๊ธฐ๋Œ€๋œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •