VIRAL: Visual Sim-to-Real at Scale for Humanoid Loco-Manipulation

์ €์ž: Tairan He, Zi Wang, Haoru Xue, Qingwei Ben, Zhengyi Luo, Wenli Xiao, Ye Yuan, Xingye Da, Fernando Castaรฑeda, Shankar Sastry, Changliu Liu, Guanya Shi, Linxi Fan, Yuke Zhu | ๋‚ ์งœ: 2025-11-27 | DOI: 10.48550/arXiv.2511.15200 📄 PDF


Essence

Figure 2

Figure 2. VIRAL teacher-student pipeline. Phase 1: In simulation, a privileged RL teacher policy ฯ€teacher receives full-

VIRAL์€ humanoid robot์˜ loco-manipulation์„ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ ํ•™์Šตํ•˜๊ณ  zero-shot์œผ๋กœ ์‹ค์ œ ๋กœ๋ด‡์— ๋ฐฐํฌํ•˜๋Š” visual sim-to-real ํ”„๋ ˆ์ž„์›Œํฌ์ด๋ฉฐ, teacher-student ๊ตฌ์กฐ์™€ ๋Œ€๊ทœ๋ชจ GPU ์ปดํ“จํŒ…์„ ํ™œ์šฉํ•˜์—ฌ RGB ๊ธฐ๋ฐ˜ ์ •์ฑ…์„ ํ†ตํ•ด 54๊ฐœ ์‚ฌ์ดํด์˜ ์—ฐ์†์ ์ธ ๊ฐ์ฒด ์ด๋™์„ ๋‹ฌ์„ฑํ–ˆ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1. Center: Unitree G1 humanoid performing loco-manipulation, walking between tables to place and pick objects for

How

Figure 2

Figure 2. VIRAL teacher-student pipeline. Phase 1: In simulation, a privileged RL teacher policy ฯ€teacher receives full-

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ humanoid loco-manipulation์— ๋Œ€ํ•œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ธฐ๋ฐ˜ ์ ‘๊ทผ์˜ ์‹คํ˜„ ๊ฐ€๋Šฅ์„ฑ์„ ๋Œ€๊ทœ๋ชจ GPU ์ปดํ“จํŒ…๊ณผ ์ฒด๊ณ„์ ์ธ ์„ค๊ณ„๋ฅผ ํ†ตํ•ด ์‹ค์ฆํ•œ ์ค‘์š”ํ•œ ์—ฐ๊ตฌ๋กœ, teacher-student ํ”„๋ ˆ์ž„์›Œํฌ์™€ visual domain randomization์˜ ์กฐํ•ฉ์ด zero-shot sim-to-real ์ „์ด๋ฅผ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•จ์„ ๋ณด์—ฌ์ค€๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •