Humanoid Parkour Learning

์ €์ž: Ziwen Zhuang, Shenzhe Yao, Hang Zhao | ๋‚ ์งœ: 2024-06-15 | URL: https://arxiv.org/abs/2406.10759 📄 PDF


Essence

Figure 1

Figure 1: We present a single vision-based end-to-end whole-body-control parkour policy for humanoid robots

๋ณธ ๋…ผ๋ฌธ์€ ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์ด motion prior ์—†์ด end-to-end vision-based ์ •์ฑ…์œผ๋กœ ๋‹ค์–‘ํ•œ parkour ๊ธฐ์ˆ ์„ ํ•™์Šตํ•  ์ˆ˜ ์žˆ๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค. Fractal noise๋ฅผ ํ™œ์šฉํ•œ terrain randomization๊ณผ DAgger๋ฅผ ํ†ตํ•œ vision policy ์ฆ๋ฅ˜๋กœ sim-to-real transfer๋ฅผ ๋‹ฌ์„ฑํ•˜๋ฉฐ, ์‹ค์ œ ๋กœ๋ด‡์—์„œ 0.42m ์ ํ”„, 0.8m gap ํ†ต๊ณผ, 1.8m/s ์ฃผํ–‰ ๋“ฑ์„ ์„ฑ๊ณตํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2: We design 10 different types of terrain with controllable difficulty. By training on all these

How

Figure 2

Figure 2: We design 10 different types of terrain with controllable difficulty. By training on all these

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์˜ parkour learning์—์„œ motion prior ์ œ๊ฑฐ์™€ fractal noise ๊ธฐ๋ฐ˜ ์ž๋™ foot-raising ์œ ๋„๋ผ๋Š” ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค. 3๋‹จ๊ณ„ ํ›ˆ๋ จ ํŒŒ์ดํ”„๋ผ์ธ๊ณผ DAgger ์ฆ๋ฅ˜๋ฅผ ํ†ตํ•œ sim-to-real transfer๋Š” ๊ธฐ์ˆ ์ ์œผ๋กœ ๊ฒฌ๊ณ ํ•˜๋ฉฐ, ์‹ค์ œ ๋กœ๋ด‡์—์„œ์˜ ๋‹ค์–‘ํ•œ ์„ฑ๊ณต ์‚ฌ๋ก€๋Š” ์‹ค์šฉ์  ๊ฐ€์น˜๊ฐ€ ๋†’๋‹ค. ๋‹ค๋งŒ ์ง์„  track ์ œ์•ฝ, ์ •๋Ÿ‰์  ํ‰๊ฐ€ ๋ถ€์กฑ, ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ ๊ฒ€์ฆ ๋ฏธํก์ด ํ•œ๊ณ„์ด๋‚˜, ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์˜ agile locomotion ๋ถ„์•ผ์— ์ƒ๋‹นํ•œ ์ง„์ „์„ ์ด๋ฃจ์—ˆ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •