GENMO: A GENeralist Model for Human MOtion

์ €์ž: Jiefeng Li, Jinkun Cao, Haotian Zhang, Davis Rempe, Jan Kautz, Umar Iqbal, Ye Yuan | ๋‚ ์งœ: 2025-05-02 | URL: https://arxiv.org/abs/2505.01425 📄 PDF


Essence

Figure 1

Figure 1. GENMO unifies human motion estimation and generation in a single framework and supports diverse conditioning s

๋ณธ ๋…ผ๋ฌธ์€ ์ธ๊ฐ„ ๋ชจ์…˜ ์ƒ์„ฑ๊ณผ ์ถ”์ •์„ ๋‹จ์ผ diffusion ๊ธฐ๋ฐ˜ ํ”„๋ ˆ์ž„์›Œํฌ์—์„œ ํ†ตํ•ฉํ•˜๋Š” GENMO๋ฅผ ์ œ์•ˆํ•œ๋‹ค. ๋ชจ์…˜ ์ถ”์ •์„ ์ œ์•ฝ์ด ์žˆ๋Š” ๋ชจ์…˜ ์ƒ์„ฑ์œผ๋กœ ์žฌ์ •์˜ํ•˜๊ณ , dual-mode ํ•™์Šต ํŒจ๋Ÿฌ๋‹ค์ž„์„ ํ†ตํ•ด ์ •ํ™•ํ•œ global motion estimation๊ณผ ๋‹ค์–‘ํ•œ ๋ชจ์…˜ ์ƒ์„ฑ์„ ๋™์‹œ์— ๋‹ฌ์„ฑํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1. GENMO unifies human motion estimation and generation in a single framework and supports diverse conditioning s

How

Figure 1

Figure 1. GENMO unifies human motion estimation and generation in a single framework and supports diverse conditioning s

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ์ธ๊ฐ„ ๋ชจ์…˜ ์ƒ์„ฑ๊ณผ ์ถ”์ •์„ ํ†ตํ•ฉํ•˜๋Š” ์ƒˆ๋กœ์šด ๊ด€์ ๊ณผ ์‹ค์šฉ์ ์ธ ์†”๋ฃจ์…˜์„ ์ œ์‹œํ•˜๋Š” ๊ฐ•๋ ฅํ•œ ์—ฐ๊ตฌ์ด๋‹ค. Dual-mode training paradigm๊ณผ estimation-guided objective๋Š” ์ฐฝ์˜์ ์ด๋ฉฐ, ๋‹ค์–‘ํ•œ ์กฐ๊ฑด ์‹ ํ˜ธ์˜ ์œ ์—ฐํ•œ ์ฒ˜๋ฆฌ๋Š” ์‹ค์ œ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์—์„œ ๋†’์€ ๊ฐ€์น˜๋ฅผ ๊ฐ€์ง„๋‹ค. ๋‹ค๋งŒ ์ƒ์„ธํ•œ ์ •๋Ÿ‰์  ํ‰๊ฐ€์™€ ๊ณ„์‚ฐ ํšจ์œจ์„ฑ ๋ถ„์„์˜ ๊ฐ•ํ™”๊ฐ€ ํ•„์š”ํ•˜๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •