Mastering Diverse Domains through World Models

์ €์ž: Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap | ๋‚ ์งœ: 2023-01-10 | URL: https://arxiv.org/abs/2301.04104 📄 PDF


Essence

Figure 1

Figure 1: Benchmark summary. a, Using fixed hyperparameters across all domains, Dreamer

DreamerV3๋Š” world model์„ ํ•™์Šตํ•˜์—ฌ ๊ณ ์ •๋œ ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ๋กœ 150๊ฐœ ์ด์ƒ์˜ ๋‹ค์–‘ํ•œ ๋„๋ฉ”์ธ์—์„œ ์ „๋ฌธํ™”๋œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๋Šฅ๊ฐ€ํ•˜๋Š” ๋ฒ”์šฉ RL ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด๋‹ค. normalization, balancing, transformation ๊ธฐ๋ฐ˜์˜ robustness ๊ธฐ๋ฒ•์œผ๋กœ ๋„๋ฉ”์ธ ๊ฐ„ ์•ˆ์ •์  ํ•™์Šต์„ ์‹คํ˜„ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: Benchmark summary. a, Using fixed hyperparameters across all domains, Dreamer

How

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: DreamerV3๋Š” world model ๊ธฐ๋ฐ˜ RL์˜ robustness ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜์—ฌ ๋‹จ์ผ ์„ค์ •์œผ๋กœ ๋‹ค์ค‘ ๋„๋ฉ”์ธ์„ ๋งˆ์Šคํ„ฐํ•˜๋Š” ์‹ค์งˆ์  ์„ฑ๊ณผ๋ฅผ ๋‹ฌ์„ฑํ–ˆ๋‹ค. ํŠนํžˆ Minecraft diamond ์ˆ˜์ง‘์€ ์ด ๋ถ„์•ผ์˜ ์˜ค๋žœ ๋ฏธํ•ด๊ฒฐ ๊ณผ์ œ๋ฅผ ์ฒ˜์Œ์œผ๋กœ ์ •๋ณตํ•œ ๊ฒƒ์œผ๋กœ, RL์˜ ์‹ค์šฉ์  ์ ์šฉ ๋ฒ”์œ„๋ฅผ ํฌ๊ฒŒ ํ™•์žฅํ•œ ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •