PPF: Pre-training and Preservative Fine-tuning of Humanoid Locomotion via Model-Assumption-based Regularization

์ €์ž: Hyunyoung Jung, Zhaoyuan Gu, Ye Zhao, Hae-Won Park, Sehoon Ha | ๋‚ ์งœ: 2025-04-14 | URL: https://arxiv.org/abs/2504.09833 📄 PDF


Essence

Figure 1

Fig. 1.

๋ณธ ์—ฐ๊ตฌ๋Š” ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ์ œ์–ด๊ธฐ์˜ ๋ชจ๋ฐฉํ•™์Šต(Pre-training)๊ณผ ๊ฐ•ํ™”ํ•™์Šต์„ ๊ฒฐํ•ฉํ•˜๋˜, ๋ชจ๋ธ ๊ฐ€์ •์ด ์„ฑ๋ฆฝํ•˜๋Š” ์ƒํƒœ์—์„œ๋งŒ ์ •๊ทœํ™”ํ•˜๋Š” MAR(Model-Assumption-based Regularization)์„ ํ†ตํ•ด ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์˜ ๋ณดํ–‰ ์ •์ฑ…์„ ํ•™์Šตํ•˜๋Š” PPF ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1.

How

Figure 3

Fig. 3.

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ๋ชจ๋ธ ๊ธฐ๋ฐ˜๊ณผ ํ•™์Šต ๊ธฐ๋ฐ˜ ์ œ์–ด์˜ ์žฅ์ ์„ ๊ฒฐํ•ฉํ•˜๋ฉด์„œ ์žฌ์•™์  ๋ง๊ฐ์„ ์™„ํ™”ํ•˜๋Š” MAR์ด๋ผ๋Š” ์ฐฝ์‹ ์  ์ •๊ทœํ™” ๊ธฐ๋ฒ•์„ ์ œ์•ˆํ•˜๋ฉฐ, ์‹ค์ œ ์ธ๊ฐ„ํ˜• ๋กœ๋ด‡์—์„œ 1.5 m/s์˜ ๊ณ ์† ๋ณดํ–‰๊ณผ ๋‹ค์–‘ํ•œ ์ง€ํ˜• ๊ฐ•๊ฑด์„ฑ์„ ๋‹ฌ์„ฑํ•˜์—ฌ ์‹ค์šฉ์  ๊ฐ€์น˜๊ฐ€ ๋†’๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •