Chasing Stability: Humanoid Running via Control Lyapunov Function Guided Reinforcement Learning

์ €์ž: Zachary Olkin, Kejun Li, William D. Compton, Aaron D. Ames | ๋‚ ์งœ: 2025-09-23 | URL: https://arxiv.org/abs/2509.19573 📄 PDF


Essence

Figure 1

Fig. 1: Overview of our approach. Trajectory optimization

๋ณธ ๋…ผ๋ฌธ์€ Control Lyapunov Function(CLF)์˜ ์•ˆ์ •์„ฑ ์กฐ๊ฑด์„ RL ๋ณด์ƒ์— ์ž„๋ฒ ๋”ฉํ•˜์—ฌ ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์˜ ๋‹ฌ๋ฆฌ๊ธฐ๋ฅผ ์‹คํ˜„ํ•˜๋Š” CLF-RL ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค. ์ด๋Š” ํœด๋จธ๋…ธ์ด๋“œ๊ฐ€ ๋น„ํ–‰ ๋ฐ ๋‹จ์ผ ์ง€์ง€ ์ƒ(flight and single support phases)๋ฅผ ํฌํ•จํ•œ ๋™์  ๋‹ฌ๋ฆฌ๊ธฐ๋ฅผ ์ˆ˜ํ–‰ํ•˜๋„๋ก ํ•œ๋‹ค.

Motivation

Achievement

How

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ๊ณ ์ „ ์ œ์–ด ์ด๋ก (CLF)๊ณผ ์ตœ์‹  RL์„ ๋งค์šฐ ํšจ๊ณผ์ ์œผ๋กœ ํ†ตํ•ฉํ•˜์—ฌ, ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์˜ ๋™์  ๋‹ฌ๋ฆฌ๊ธฐ ์ œ์–ด๋ฅผ ์œ„ํ•œ ์›๋ฆฌ ๊ธฐ๋ฐ˜์˜ ์ฒด๊ณ„์  ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค. ์‹ค์ œ ํ•˜๋“œ์›จ์–ด์—์„œ์˜ ์•ˆ์ •์  ๋ฐฐํฌ์™€ ๊ฐ•๊ฑดํ•œ ์ถ”์  ์„ฑ๋Šฅ์€ ๋†’์€ ์‹ค์šฉ์  ๊ฐ€์น˜๋ฅผ ์ž…์ฆํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •