Sim-and-Real Co-Training: A Simple Recipe for Vision-Based Robotic Manipulation

์ €์ž: Abhiram Maddukuri, Zhenyu Jiang, Lawrence Yunliang Chen, Soroush Nasiriany, Yuqi Xie, Yu Fang, Wenqi Huang, Zu Wang, Zhenjia Xu, Nikita Chernyadev, Scott Reed, Ken Goldberg, Ajay Mandlekar, Linxi Fan, Yuke Zhu | ๋‚ ์งœ: 2025-03-31 | URL: https://arxiv.org/abs/2503.24361 📄 PDF


Essence

Figure 1

Fig. 1: Sim-and-Real Co-Training. We show how co-training

์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋ฐ์ดํ„ฐ์™€ ์‹ค์ œ ๋กœ๋ด‡ ๋ฐ์ดํ„ฐ๋ฅผ ํ˜ผํ•ฉํ•˜์—ฌ ํ•™์Šตํ•˜๋Š” sim-and-real co-training ์ „๋žต์„ ์ฒด๊ณ„์ ์œผ๋กœ ์—ฐ๊ตฌํ•˜๊ณ , ๋น„์ „ ๊ธฐ๋ฐ˜ ๋กœ๋ด‡ ์กฐ์ž‘ ์ž‘์—…์—์„œ ์‹ค์ œ ๋ฐ์ดํ„ฐ๋งŒ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ ๋Œ€๋น„ ํ‰๊ท  38% ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋‹ฌ์„ฑํ–ˆ๋‹ค.

Motivation

Achievement

Figure 4

Fig. 4:

How

Figure 2

Fig. 2: Method Overview. Our workflow consists of three components: (1) We start with a real-world target task in mind a

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ sim-and-real co-training์˜ ์‹ค์šฉ์„ฑ์„ ์ฒด๊ณ„์ ์œผ๋กœ ๊ฒ€์ฆํ•˜์—ฌ ์‹ค์ œ ๋กœ๋ด‡ ํ•™์Šต์˜ ๋ฐ์ดํ„ฐ ํšจ์œจ์„ฑ ๋ฌธ์ œ์— ์ง์ ‘์ ์ธ ํ•ด๊ฒฐ์ฑ…์„ ์ œ์‹œํ•˜๋ฉฐ, ๋ช…ํ™•ํ•œ ์‹คํ—˜ ์„ค๊ณ„์™€ ์‹ค๋ฌด์  ๊ฐ€์ด๋“œ๋ผ์ธ์œผ๋กœ ๋กœ๋ด‡ ์ปค๋ฎค๋‹ˆํ‹ฐ์— ๋†’์€ ๊ฐ€์น˜๋ฅผ ์ œ๊ณตํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •