GENMO: A GENeralist Model for Human MOtion

์ €์ž: Jiefeng Li, Jinkun Cao, Haotian Zhang, Davis Rempe, Jan Kautz, Umar Iqbal, Ye Yuan | ๋‚ ์งœ: 2025-05-02 | URL: https://arxiv.org/abs/2505.01425 📄 PDF


Essence

Figure 1

Figure 1. GENMO unifies human motion estimation and generation in a single framework and supports diverse conditioning s

GENMO๋Š” ์ธ๊ฐ„ ๋™์ž‘ ์ถ”์ •๊ณผ ์ƒ์„ฑ์„ ๋‹จ์ผ ํ”„๋ ˆ์ž„์›Œํฌ์—์„œ ํ†ตํ•ฉํ•˜๋Š” generalist ๋ชจ๋ธ๋กœ, ๋™์ž‘ ์ถ”์ •์„ ์ œ์•ฝ ์กฐ๊ฑด์ด ์žˆ๋Š” ๋™์ž‘ ์ƒ์„ฑ์œผ๋กœ ์žฌ๊ตฌ์„ฑํ•˜์—ฌ ์ •ํ™•ํ•œ ์ถ”์ •๊ณผ ๋‹ค์–‘ํ•œ ์ƒ์„ฑ์„ ๋™์‹œ์— ๋‹ฌ์„ฑํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1. GENMO unifies human motion estimation and generation in a single framework and supports diverse conditioning s

How

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: GENMO๋Š” ๋™์ž‘ ์ถ”์ •๊ณผ ์ƒ์„ฑ์˜ ์˜ค๋žซ๋™์•ˆ์˜ ๋ถ„๋ฆฌ๋ฅผ ํ˜์‹ ์ ์œผ๋กœ ํ†ตํ•ฉํ•˜๋Š” ์ฒซ ๋ฒˆ์งธ generalist ๋ชจ๋ธ๋กœ, dual-mode ํ›ˆ๋ จ๊ณผ estimation-guided ๋ชฉํ‘œ๋ฅผ ํ†ตํ•ด ๋‘ ์ž‘์—… ๊ฐ„ ์ƒ์Šน ํšจ๊ณผ๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ๋‹ฌ์„ฑํ•˜๋ฉฐ, ๋‹ค์–‘ํ•œ benchmark์—์„œ state-of-the-art ์„ฑ๋Šฅ์„ ์ž…์ฆํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •