World Models for Robotic Manipulation: A Survey

์ €์ž: Fangyuan Wang, Ziyuan Wang, Guorui Pei, Mengshi Zhang, Canxi Liang, Jun Hu, Zhongxuan Li, Jinsong Wu, Ning Han, Zeqing Zhang, Jiaming Qi, Hongmin Wu, Shiyao Zhang, Pai Zheng, Jia Pan, David Navarro-Alarcon, Sichao Liu, Peng Zhou | ๋‚ ์งœ: 2026 | DOI: 10.48550/ARXIV.2606.00113 📄 PDF


Essence

Figure 2

Fig. 2. Representation spectrum of world models. The five families are ordered by increasing structured inductive bias,

๋กœ๋ด‡ ์กฐ์ž‘์„ ์œ„ํ•œ world model์— ๋Œ€ํ•œ ํฌ๊ด„์  ์„œ๋ฒ ์ด๋‹ค. ์„ธ ๊ฐ€์ง€ ์งˆ๋ฌธ(์–ด๋–ค ๋ฏธ๋ž˜ ํ‘œํ˜„์„ ์˜ˆ์ธกํ•˜๋Š”๊ฐ€, ์˜ˆ์ธก์„ ํ–‰๋™์— ์–ด๋–ป๊ฒŒ ์—ฐ๊ฒฐํ•˜๋Š”๊ฐ€, ํ•™์Šต ํŒŒ์ดํ”„๋ผ์ธ์˜ ์–ด๋А ๋‹จ๊ณ„์—์„œ ์‚ฌ์šฉ๋˜๋Š”๊ฐ€)์„ ์ค‘์‹ฌ์œผ๋กœ action-conditioned predictive system์œผ๋กœ์„œ์˜ world model์„ ์ •์˜ํ•˜๊ณ , ๋‹ค์„ฏ ๊ฐ€์ง€ ํ‘œํ˜„ ๊ณ„์—ด๊ณผ ๊ธฐ๋Šฅ์  ๋ถ„๋ฅ˜๋ฅผ ์ œ์‹œํ•œ๋‹ค.

Motivation

Achievement

Figure 4

Fig. 4. Five functional roles of infrastructure world models for robotic manipulation: synthetic experience generation,

How

Figure 5

Fig. 5. World models across the robot-learning lifecycle. During pretraining, predictive objectives learn reusable laten

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ด ์„œ๋ฒ ์ด๋Š” ๋กœ๋ด‡ ์กฐ์ž‘ ๋ถ„์•ผ์—์„œ fragmented๋œ world model ๋ฌธํ—Œ์„ ํ†ตํ•ฉํ•˜๋Š” ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋‹ค. ์„ธ ๊ฐ€์ง€ ์ง๊ต ์ถ•์˜ framework์™€ ๋ช…ํ™•ํ•œ operational definition์€ ํ–ฅํ›„ ์—ฐ๊ตฌ์˜ ์„ค๊ณ„ ์„ ํƒ์„ ๊ฐ€์ด๋“œํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, 34๊ฐœ dataset ๊ฒ€ํ† ์™€ ์ข…ํ•ฉ ํ‰๊ฐ€ ํ”„๋กœํ† ์ฝœ์€ ์‹ค์งˆ์  ๊ฐ€์น˜๋ฅผ ์ œ๊ณตํ•œ๋‹ค. ๋‹ค๋งŒ closed-loop ํ‰๊ฐ€ ๋ถ€์กฑ๊ณผ contact modeling ๋“ฑ ์กฐ์ž‘ ๊ณ ์œ ์˜ ๋„์ „์ด ์—ฌ์ „ํžˆ ๋ฏธํ•ด๊ฒฐ๋˜์–ด ์žˆ๊ณ , ๊ฐœ๋…์  ๊ฒฝ๊ณ„์˜ ๋ชจํ˜ธ์„ฑ๋„ ์™„์ „ํžˆ ์ œ๊ฑฐ๋˜์ง€ ์•Š์•˜๋‹ค. ์ „์ฒด์ ์œผ๋กœ ์กฐ์ž‘ ์ค‘์‹ฌ์˜ predictive modeling์„ ์ดํ•ดํ•˜๋Š” ๋ฐ ํ•„์ˆ˜์ ์ธ ์ฐธ๊ณ ๋ฌธํ—Œ์ด์ง€๋งŒ, ๊ตฌ์ฒด์ ์ธ ๊ธฐ์ˆ  ํ˜์‹ ๋ณด๋‹ค๋Š” ์ข…ํ•ฉ ์ •๋ฆฌ์˜ ์„ฑ๊ฒฉ์ด ๊ฐ•ํ•˜๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •