PhysHMR: Learning Humanoid Control Policies from Vision for Physically Plausible Human Motion Reconstruction

์ €์ž: Qiao Feng, Yiming Huang, Yufu Wang, Jiatao Gu, Lingjie Liu | ๋‚ ์งœ: 2025-10-02 | URL: https://arxiv.org/abs/2510.02566 📄 PDF


Essence

Figure 1

Fig. 1. Given a monocular video (a), (b) kinematic-based methods (e.g., GVHMR [Shen et al. 2024]) often cannot produce p

PhysHMR์€ ๋ชจ๋…ธํ˜๋Ÿฌ ๋น„๋””์˜ค๋กœ๋ถ€ํ„ฐ ๋ฌผ๋ฆฌ์ ์œผ๋กœ ํƒ€๋‹นํ•œ ์ธ๊ฐ„ ๋™์ž‘ ์žฌ๊ตฌ์„ฑ์„ ์œ„ํ•ด ๋น„์ „-๊ธฐ๋ฐ˜ ํœด๋จธ๋…ธ์ด๋“œ ์ œ์–ด ์ •์ฑ…์„ ์ง์ ‘ ํ•™์Šตํ•˜๋Š” ํ†ตํ•ฉ ํ”„๋ ˆ์ž„์›Œํฌ์ด๋‹ค. ๊ธฐ์กด์˜ ๋‘ ๋‹จ๊ณ„ ๋ฐฉ์‹(์šด๋™ํ•™ ๊ธฐ๋ฐ˜ ์ถ”์ • + ๋ฌผ๋ฆฌ ํ›„์ฒ˜๋ฆฌ)๊ณผ ๋‹ฌ๋ฆฌ, ์‹œ๊ฐ ์ •๋ณด์™€ ๋ฌผ๋ฆฌ ์ œ์•ฝ์„ ๋‹จ์ผ ์ •์ฑ… ๋„คํŠธ์›Œํฌ์—์„œ ํ•จ๊ป˜ ์ถ”๋ก ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1. Given a monocular video (a), (b) kinematic-based methods (e.g., GVHMR [Shen et al. 2024]) often cannot produce p

How

Figure 2

Fig. 2 provides an overview of our method. Given a monocular

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: PhysHMR์€ ์‹œ๊ฐ-๊ธฐ๋ฐ˜ ์ œ์–ด์™€ ๋ฌผ๋ฆฌ ์ถ”๋ก ์„ ํ†ตํ•ฉํ•˜๋Š” ์ฐฝ์˜์  ์ ‘๊ทผ์œผ๋กœ ๋ชจ๋…ธํ˜๋Ÿฌ ๋น„๋””์˜ค ๊ธฐ๋ฐ˜ ์ธ๊ฐ„ ๋™์ž‘ ์žฌ๊ตฌ์„ฑ์˜ ๊ทผ๋ณธ์  ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•œ๋‹ค. ์šฐ์ˆ˜ํ•œ ๋ฌผ๋ฆฌ์  ํƒ€๋‹น์„ฑ ๊ฐœ์„ ๊ณผ ์‹ค์งˆ์  ์‘์šฉ ๊ฐ€์น˜๋กœ ์ปดํ“จํ„ฐ ๋น„์ „๊ณผ ๊ทธ๋ž˜ํ”ฝ์Šค ๋ถ„์•ผ์— ์˜๋ฏธ ์žˆ๋Š” ๊ธฐ์—ฌ๋ฅผ ํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •