Learning agile and dynamic motor skills for legged robots

์ €์ž: Jemin Hwangbo, Joonho Lee, Alexey Dosovitskiy, Dario Bellicoso, Vassilios Tsounis, Vladlen Koltun, Marco Hutter | ๋‚ ์งœ: 2019-01-24 | URL: https://arxiv.org/abs/1901.08652 📄 PDF


Essence

Figure 5

Fig. 5. Training control policies in simulation. The policy net-

๋ณธ ๋…ผ๋ฌธ์€ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ reinforcement learning์œผ๋กœ ์‚ฌ์กฑ ๋กœ๋ด‡์˜ ์ œ์–ด ์ •์ฑ…์„ ํ•™์Šตํ•˜๊ณ  ํ˜„์‹ค์˜ ANYmal ๋กœ๋ด‡์— ์ „์ดํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•˜์—ฌ, ๊ณ ์† ์ฃผํ–‰๊ณผ ๋‚™ํ•˜ ๋ณต๊ตฌ ๋“ฑ์˜ ๋™์  ์šด๋™ ๊ธฐ์ˆ ์„ ๋‹ฌ์„ฑํ–ˆ๋‹ค.

Motivation

Achievement

How

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ์‚ฌ์กฑ ๋กœ๋ด‡์˜ ๋™์  ์ œ์–ด์— reinforcement learning๊ณผ domain randomization์„ ํšจ๊ณผ์ ์œผ๋กœ ๊ฒฐํ•ฉํ•˜์—ฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜-ํ˜„์‹ค ์ „์ด ๋ฌธ์ œ๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ํ•ด๊ฒฐํ–ˆ์œผ๋ฉฐ, ์‹ค์ œ ๊ณ ๊ธ‰ ๋กœ๋ด‡ ํ”Œ๋žซํผ์—์„œ ์ด์ „์— ๋‹ฌ์„ฑํ•˜์ง€ ๋ชปํ•œ ์ˆ˜์ค€์˜ ์šด๋™ ๊ธฐ์ˆ ์„ ๊ตฌํ˜„ํ•จ์œผ๋กœ์จ ๋กœ๋ด‡ ์ œ์–ด ๋ถ„์•ผ์— ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋ฅผ ํ–ˆ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •