Unified Humanoid Fall-Safety Policy from a Few Demonstrations

์ €์ž: Zhengjie Xu, Ye Li, Kwan-yee Lin, Stella X. Yu | ๋‚ ์งœ: 2025-11-10 | DOI: 10.48550/arXiv.2511.07407 📄 PDF


Essence

Figure 1

Fig. 1.

ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์ด ๊ท ํ˜•์„ ์žƒ์—ˆ์„ ๋•Œ ์•ˆ์ „ํ•˜๊ฒŒ ๋„˜์–ด์ง€๊ณ  ๋น ๋ฅด๊ฒŒ ์ผ์–ด๋‚  ์ˆ˜ ์žˆ๋„๋ก, ์ŠคํŒŒ์Šคํ•œ ์ธ๊ฐ„ ์‹œ์—ฐ๊ณผ reinforcement learning, diffusion ๊ธฐ๋ฐ˜ ๋ฉ”๋ชจ๋ฆฌ๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ ๋‚™์ƒ ์˜ˆ๋ฐฉยท์ถฉ๊ฒฉ ์™„ํ™”ยทํšŒ๋ณต์„ ํ†ตํ•ฉํ•˜๋Š” ๋‹จ์ผ ์ •์ฑ…์„ ํ•™์Šตํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Fig. 2.

How

Figure 2

Fig. 2.

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ํœด๋จธ๋…ธ์ด๋“œ ๋‚™์ƒ ์™„ํ™”์™€ ํšŒ๋ณต์„ ๋ช…์‹œ์ ์œผ๋กœ ํ†ตํ•ฉํ•˜๋Š” ์ฒซ ์„ฑ๊ณต์ ์ธ ํ†ตํ•ฉ ์ •์ฑ…์„ ์ œ์‹œํ•˜๋ฉฐ, ์ŠคํŒŒ์Šค ์ธ๊ฐ„ ์‹œ์—ฐ๊ณผ RL, diffusion model์„ ์ฐฝ์˜์ ์œผ๋กœ ๊ฒฐํ•ฉํ•˜์—ฌ ์•ˆ์ „ํ•œ ๋‹ค์ค‘ ๋ชจ๋‹ฌ ํ–‰๋™์„ ํ•™์Šตํ•œ๋‹ค. Unitree G1์—์„œ์˜ ๊ฒฌ๊ณ ํ•œ sim-to-real ์ „์ด์™€ ์ผ๊ด€๋œ ์„ฑ๋Šฅ์€ ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ์˜ ๋กœ๋ด‡ ์•ˆ์ „์„ฑ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œํ‚ฌ ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์—ฌ์ค€๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •