DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills

์ €์ž: Xue Bin Peng, Pieter Abbeel, Sergey Levine, Michiel van de Panne | ๋‚ ์งœ: 2018-04-08 | URL: https://arxiv.org/abs/1804.02717 📄 PDF


Essence

Figure 1

Fig. 1. Highly dynamic skills learned by imitating reference motion capture clips using our method, executed by physical

Motion capture ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•œ example-guided reinforcement learning์œผ๋กœ ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ์บ๋ฆญํ„ฐ ์• ๋‹ˆ๋ฉ”์ด์…˜์„ ํ•™์Šตํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•˜๋ฉฐ, ๋ชจ์…˜ ๋ชจ๋ฐฉ๊ณผ task ๋ชฉํ‘œ๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ ๊ฐ•๊ฑดํ•˜๊ณ  ๋‹ค์–‘ํ•œ ๊ธฐ์ˆ ์„ ์ˆ˜ํ–‰ํ•˜๋Š” ์ œ์–ด ์ •์ฑ…์„ ํ•™์Šตํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1. Highly dynamic skills learned by imitating reference motion capture clips using our method, executed by physical

How

Figure 2

Fig. 2. Schematic illustration of the visuomotor policy network. The

Originality

Limitation & Further Study

Evaluation

Novelty: 3/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ๊ฐœ๋ณ„ ๊ธฐ์ˆ ์˜ novel ํ•œ ์กฐํ•ฉ๋ณด๋‹ค๋Š” physics-based character animation์—์„œ์˜ ํšจ๊ณผ์  ์‹œ์Šคํ…œ ์„ค๊ณ„๋ฅผ ํ†ตํ•ด ์‹ค์งˆ์  ๊ฐ€์น˜๋ฅผ ์ œ์‹œํ•˜๋ฉฐ, ๊ด‘๋ฒ”์œ„ํ•œ ์‹ค์ฆ ๊ฒฐ๊ณผ๋กœ ๋ฐฉ๋ฒ•์˜ ์‹ค์šฉ์„ฑ๊ณผ ํ™•์žฅ์„ฑ์„ ๊ฐ•๋ ฅํžˆ ์ž…์ฆํ•œ ๋งค์šฐ ์˜ํ–ฅ๋ ฅ ์žˆ๋Š” ๊ธฐ์—ฌ์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •