Emergent Active Perception and Dexterity of Simulated Humanoids from Visual Reinforcement Learning

์ €์ž: Zhengyi Luo, Chen Tessler, Toru Lin, Ye Yuan, Tairan He, Wenli Xiao, Yunrong Guo, Gal Chechik, Kris Kitani, Linxi Fan, Yuke Zhu | ๋‚ ์งœ: 2025-05-18 | URL: https://arxiv.org/abs/2505.12278 📄 PDF


Essence

Figure 1

Figure 1. Perceptive Dexterous Control (PDC) enables a humanoid equipped with egocentric vision to search for, reach, gr

๋ณธ ๋…ผ๋ฌธ์€ egocentric vision๋งŒ์„ ์‚ฌ์šฉํ•˜์—ฌ simulated humanoid๊ฐ€ ๋ณต์žกํ•œ household tasks๋ฅผ ์ˆ˜ํ–‰ํ•˜๋„๋ก ํ•˜๋Š” Perceptive Dexterous Control (PDC) ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•˜๋ฉฐ, visual perception์„ task specification์˜ ์ธํ„ฐํŽ˜์ด์Šค๋กœ ํ™œ์šฉํ•˜์—ฌ active search ๋“ฑ์˜ emergent behaviors๋ฅผ ์œ ๋„ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1. Perceptive Dexterous Control (PDC) enables a humanoid equipped with egocentric vision to search for, reach, gr

How

Figure 2

Figure 2. Kitchens: Our agent is trained in parallel on a large set of (randomly) procedurally generated kitchens. Each

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ egocentric vision์„ ์œ ์ผํ•œ ์ •๋ณด์›์œผ๋กœ ํ•˜๋Š” humanoid whole-body dexterous control์˜ ์‹คํ˜„์ด๋ผ๋Š” ๋„์ „์  ๋ฌธ์ œ๋ฅผ perception-as-interface ํŒจ๋Ÿฌ๋‹ค์ž„๊ณผ hierarchical RL์„ ํ†ตํ•ด ์ฐฝ์˜์ ์œผ๋กœ ํ•ด๊ฒฐํ•˜๋ฉฐ, emergent active search behaviors์˜ ๋ช…์‹œ์  ์ž…์ฆ์„ ํ†ตํ•ด vision-driven control์˜ ์ด์ ์„ ์ƒˆ๋กญ๊ฒŒ ์กฐ๋ช…ํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •