TOP: Time Optimization Policy for Stable and Accurate Standing Manipulation with Humanoid Robots

์ €์ž: Zhenghan Chen, Haocheng Xu, Haodong Zhang, Liang Zhang, He Li, Dongqi Wang, Jiyu Yu, Yifei Yang, Zhongxiang Zhou, Rong Xiong | ๋‚ ์งœ: 2025-08-01 | URL: https://arxiv.org/abs/2508.00355 📄 PDF


Essence

Figure 2

Fig. 2: The overall architecture. (A) Training a latent code zt based on VAE structure to represent diverse upper-body m

์ด ๋…ผ๋ฌธ์€ ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์˜ ์•ˆ์ •์ ์ธ ์„œ์„œํ•˜๊ธฐ ์กฐ์ž‘์„ ์œ„ํ•ด ์ƒ์ฒด ๋™์ž‘์˜ ์‹œ๊ฐ„ ๊ถค์ ์„ ์ตœ์ ํ™”ํ•˜๋Š” Time Optimization Policy (TOP)์„ ์ œ์•ˆํ•œ๋‹ค. ์ƒ์ฒด์˜ ๋น ๋ฅธ ์›€์ง์ž„์œผ๋กœ ์ธํ•œ ๋ชจ๋ฉ˜ํ…€์„ ์ค„์—ฌ ๊ท ํ˜•, ์ •ํ™•์„ฑ, ์‹œ๊ฐ„ ํšจ์œจ์„ฑ์„ ๋™์‹œ์— ๋‹ฌ์„ฑํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1: Illustration of different methods. A: Whole-body RL

How

Figure 2

Fig. 2: The overall architecture. (A) Training a latent code zt based on VAE structure to represent diverse upper-body m

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ด ๋…ผ๋ฌธ์€ ์ƒ์ฒด ๋™์ž‘ ์‹œ๊ฐ„ ์ตœ์ ํ™”๋ผ๋Š” ์ง๊ด€์ ์ด๋ฉด์„œ๋„ ํšจ๊ณผ์ ์ธ ์•„์ด๋””์–ด๋กœ ํœด๋จธ๋…ธ์ด๋“œ ์„œ์„œํ•˜๊ธฐ ์กฐ์ž‘์˜ ์•ˆ์ •์„ฑ-์ •ํ™•์„ฑ-ํšจ์œจ์„ฑ trade-off ๋ฌธ์ œ๋ฅผ ์ฐฝ์˜์ ์œผ๋กœ ํ•ด๊ฒฐํ•œ๋‹ค. ์ด๋ก ๊ณผ ์‹คํ—˜์ด ์ž˜ ๊ฒฐํ•ฉ๋˜์–ด ์žˆ์œผ๋ฉฐ, humanoid ๋กœ๋ด‡ ์ œ์–ด ๋ถ„์•ผ์— ์‹ค์งˆ์ ์ธ ๊ธฐ์—ฌ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •