HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning

์ €์ž: Zhi Jing, Siyuan Yang, Jicong Ao, Ting Xiao, Yu-Gang Jiang, Chenjia Bai | ๋‚ ์งœ: 2025-07-01 | URL: https://arxiv.org/abs/2507.00833 📄 PDF


Essence

Figure 1

Figure 1: The overview of HumanoidGen. It includes spatial annotations, scene generation, constraint

HumanoidGen์€ LLM ์ถ”๋ก ๊ณผ ์›์ž์  ์† ๋™์ž‘์„ ํ™œ์šฉํ•˜์—ฌ ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์˜ ์–‘์† ์ •๊ตํ•œ ์กฐ์ž‘์„ ์œ„ํ•œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋ฐ์ดํ„ฐ์™€ ์‹œ์—ฐ์„ ์ž๋™์œผ๋กœ ์ƒ์„ฑํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ์ด๋‹ค. MCTS ๊ธฐ๋ฐ˜ ์ถ”๋ก  ๊ฐ•ํ™”๋ฅผ ํ†ตํ•ด ์žฅ์‹œ๊ฐ„ ์ž‘์—…๊ณผ ๋ถˆ์ถฉ๋ถ„ํ•œ ์ฃผ์„์—์„œ์˜ ๊ณ„ํš ๋Šฅ๋ ฅ์„ ๊ฐœ์„ ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: The overview of HumanoidGen. It includes spatial annotations, scene generation, constraint

How

Figure 1

Figure 1: The overview of HumanoidGen. It includes spatial annotations, scene generation, constraint

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: HumanoidGen์€ LLM ๊ธฐ๋ฐ˜ ์ž๋™ํ™”, ์›์ž์  ์† ๋™์ž‘ ์„ค๊ณ„, MCTS ๊ฐ•ํ™” ์ถ”๋ก ์˜ ์กฐํ•ฉ์œผ๋กœ ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์˜ ์–‘์† ์ •๊ตํ•œ ์กฐ์ž‘ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ์— ์ƒˆ๋กœ์šด ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•˜๋ฉฐ, HGen-Bench ๋ฒค์น˜๋งˆํฌ์™€ ํ•จ๊ป˜ ๋ฐ์ดํ„ฐ ์Šค์ผ€์ผ๋ง์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ์‹ค์ฆํ•˜์—ฌ ์‹ค๋ฌด์  ๊ฐ€์น˜๊ฐ€ ๋†’๋‹ค. ๋‹ค๋งŒ ๊ณต๊ฐ„ ์ฃผ์„์˜ ์ˆ˜๋™ ์ž‘์„ฑ ๋ถ€๋‹ด๊ณผ sim-to-real ๊ฒ€์ฆ ๋ถ€์žฌ๊ฐ€ ํ™•์žฅ์„ฑ์„ ์ œํ•œํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •