SkillMimic: Learning Basketball Interaction Skills from Demonstrations

์ €์ž: Yinhuai Wang, Qihan Zhao, Runyi Yu, Hok Wai Tsui, Ailing Zeng, Jing Lin, Zhengyi Luo, Jiwen Yu, Xiu Li, Qifeng Chen, Jian Zhang, Lei Zhang, Ping Tan | ๋‚ ์งœ: 2024-08-12 | URL: https://arxiv.org/abs/2408.15270 📄 PDF


Essence

Figure 2

Figure 2. Concept of SkillMimic. We define an interaction skill as

SkillMimic์€ skill-specific reward ์„ค๊ณ„ ์—†์ด ํ†ตํ•ฉ๋œ HOI imitation reward๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋‹จ์ผ policy๋กœ ๋‹ค์–‘ํ•œ ๋†๊ตฌ ์ƒํ˜ธ์ž‘์šฉ ๊ธฐ์ˆ ์„ ํ•™์Šตํ•˜๊ณ  ํ•ฉ์„ฑํ•  ์ˆ˜ ์žˆ๋Š” data-driven ํ”„๋ ˆ์ž„์›Œํฌ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1. We propose a novel approach that for the first time enables physically simulated humanoids to learn a variety

How

Figure 3

Fig. 3 (b) shows the training pipeline of SkillMimic. Given

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: SkillMimic์€ skill-specific reward ์ œ๊ฑฐ๋ฅผ ํ†ตํ•ด ์ƒํ˜ธ์ž‘์šฉ ๊ธฐ์ˆ  ํ•™์Šต์˜ ์‹ค์šฉ์„ฑ์„ ํ˜์‹ ์ ์œผ๋กœ ๊ฐœ์„ ํ–ˆ์œผ๋ฉฐ, contact graph์™€ ํ†ตํ•ฉ HOI reward ์„ค๊ณ„๋Š” ๊ธฐ์ˆ ์ ์œผ๋กœ ๊ฒฌ๊ณ ํ•˜๊ณ  ๋†๊ตฌ ๋ฐ์ดํ„ฐ์…‹ ๊ธฐ์—ฌ์™€ ํ•จ๊ป˜ ์ด ๋ถ„์•ผ์˜ significant advance๋ฅผ ์ด๋ฃฌ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •