Sampling-Based System Identification with Active Exploration for Legged Robot Sim2Real Learning

์ €์ž: Nikhil Sobanbabu, Guanqi He, Tairan He, Yuxiang Yang, Guanya Shi | ๋‚ ์งœ: 2025-05-20 | URL: https://arxiv.org/abs/2505.14266 📄 PDF


Essence

Figure 2

Figure 2: Overview of SPI-Active. Data Collection: Collect real-world trajectories using RL policies or

SPI-Active๋Š” legged robot์˜ ๋ฌผ๋ฆฌ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฐ˜์œผ๋กœ ์‹๋ณ„ํ•˜๊ณ  Fisher Information ์ตœ๋Œ€ํ™”๋ฅผ ํ†ตํ•œ active exploration์œผ๋กœ sim-to-real ๊ฐญ์„ ์ตœ์†Œํ™”ํ•˜๋Š” two-stage ํ”„๋ ˆ์ž„์›Œํฌ์ด๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: SPI-Active enables high-fidelity Sim-to-Real transfer across diverse locomotion tasks. To highlight

How

Figure 2

Figure 2: Overview of SPI-Active. Data Collection: Collect real-world trajectories using RL policies or

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ด ๋…ผ๋ฌธ์€ legged robot์˜ sim-to-real ๊ฐญ ํ•ด๊ฒฐ์„ ์œ„ํ•œ ์›๋ฆฌ์ ์ด๊ณ  ์‹ค์šฉ์ ์ธ system identification ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•˜๋ฉฐ, Fisher Information ๊ธฐ๋ฐ˜ active exploration ์ „๋žต์˜ ์ฐฝ์˜์  ์ ์šฉ์œผ๋กœ ๊ณ ์ •๋ฐ€ locomotion ์ž‘์—…์—์„œ ํ˜„์ €ํ•œ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋‹ฌ์„ฑํ–ˆ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •