Opt2Skill: Imitating Dynamically-feasible Whole-Body Trajectories for Versatile Humanoid Loco-Manipulation

์ €์ž: Fukang Liu, Zhaoyuan Gu, Yilin Cai, Ziyi Zhou, Hyunyoung Jung, Jaehwi Jang, Shijie Zhao, Sehoon Ha, Yue Chen, Danfei Xu, Ye Zhao | ๋‚ ์งœ: 2024-09-30 | URL: https://arxiv.org/abs/2409.20514 📄 PDF


Essence

Figure 1

Fig. 1. The proposed Opt2Skill framework enables a Digit humanoid robot to

Opt2Skill์€ Differential Dynamic Programming (DDP)๋กœ ์ƒ์„ฑํ•œ ๋™์—ญํ•™์ ์œผ๋กœ ์‹คํ˜„ ๊ฐ€๋Šฅํ•œ ๊ถค์ ์„ Reinforcement Learning (RL)์œผ๋กœ ๋ชจ๋ฐฉํ•˜๊ฒŒ ํ•จ์œผ๋กœ์จ ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์˜ ๋‹ค์–‘ํ•œ ๋กœ์ฝ”-์กฐ์ž‘ ์ž‘์—…์„ ํšจ๊ณผ์ ์œผ๋กœ ์ˆ˜ํ–‰ํ•˜๋Š” ํ†ตํ•ฉ ํŒŒ์ดํ”„๋ผ์ธ์ด๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1. The proposed Opt2Skill framework enables a Digit humanoid robot to

How

Figure 2

Fig. 2. Overall structure of the Opt2Skill framework. (a) We first generate structured, dynamically feasible reference t

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: Opt2Skill์€ model-based trajectory optimization๊ณผ reinforcement learning์„ ํšจ๊ณผ์ ์œผ๋กœ ๊ฒฐํ•ฉํ•˜์—ฌ ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์˜ ๋™์—ญํ•™์ ์œผ๋กœ ์‹คํ˜„ ๊ฐ€๋Šฅํ•œ ๋‹ค์–‘ํ•œ ๋กœ์ฝ”-์กฐ์ž‘ ์ž‘์—…์„ ์ฒด๊ณ„์ ์œผ๋กœ ํ•ด๊ฒฐํ•˜๋ฉฐ, ์‹ค์ œ ํ•˜๋“œ์›จ์–ด ์ „์ด๊นŒ์ง€ ์„ฑ๊ณตํ•œ ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋กœ, ํ† ํฌ ์ •๋ณด ํ™œ์šฉ๊ณผ ๊ด‘๋ฒ”์œ„ํ•œ ์‹คํ—˜ ๊ฒ€์ฆ์„ ํ†ตํ•ด ๋†’์€ ๊ณผํ•™์  ๊ฐ€์น˜๋ฅผ ๊ฐ–์ถ˜๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •