An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable Simulation

์ €์ž: Lu Shi, Yuxuan Xu, Shiyu Wang, Jinhao Huang, Wenhao Zhao, Yufei Jia, Zike Yan, Weibin Gu, Guyue Zhou | ๋‚ ์งœ: 2025-03-13 | URL: https://arxiv.org/abs/2503.10118 📄 PDF


Essence

Figure 1

Fig. 1.

๋ณธ ๋…ผ๋ฌธ์€ Real-Sim-Real (RSR) ๋ฃจํ”„ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•˜์—ฌ differentiable simulation์„ ํ™œ์šฉํ•ด ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๋ฐ˜๋ณต์ ์œผ๋กœ ๊ฐœ์„ ํ•˜๊ณ  ์‹ค์ œ ์„ธ๊ณ„ ์กฐ๊ฑด๊ณผ ์ •๋ ฌ์‹œํ‚ด์œผ๋กœ์จ sim-to-real ๊ฐญ์„ ํ•ด์†Œํ•œ๋‹ค. ์ •๋ณด ์ด๋ก  ๊ธฐ๋ฐ˜์˜ ๋น„์šฉ ํ•จ์ˆ˜๋ฅผ ํ†ตํ•ด ๋‹ค์–‘ํ•˜๊ณ  ๋Œ€ํ‘œ์ ์ธ ์‹ค์„ธ๊ณ„ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘์„ ์œ ๋„ํ•˜์—ฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์ •์ œ์˜ ํšจ์œจ์„ฑ์„ ๊ทน๋Œ€ํ™”ํ•œ๋‹ค.

Motivation

Achievement

How

Figure 1

Fig. 1.

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ information theory ๊ธฐ๋ฐ˜์˜ informative cost function์„ ํ†ตํ•ด sim-to-real ์ „์ด ๋ฌธ์ œ๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ํ•ด๊ฒฐํ•˜๋Š” ์ƒˆ๋กœ์šด RSR ๋ฃจํ”„ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•˜๋ฉฐ, differentiable simulation๊ณผ ๊ธฐ์กด RL ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ํ†ตํ•ฉ์œผ๋กœ ์‹ค๋ฌด ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์ด ๋†’๋‹ค. ๋‹ค๋งŒ ์‹ค์„ธ๊ณ„ ์‹คํ—˜์˜ ๋ฒ”์œ„ ํ™•๋Œ€์™€ ๊ณ„์‚ฐ ๋น„์šฉ ๋ถ„์„์ด ์ถ”ํ›„ ๊ณผ์ œ์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •