Trinity: A Modular Humanoid Robot AI System

์ €์ž: Jingkai Sun, Qiang Zhang, Gang Han, Wen Zhao, Zhe Yong, Yan He, Jiaxu Wang, Jiahang Cao, Yijie Guo, Renjing Xu | ๋‚ ์งœ: 2025-03-11 | URL: https://arxiv.org/abs/2503.08338 📄 PDF


Essence

Figure 1

Fig. 1: Overview of the Modular Humanoid Robot AI System. In this system, task instructions are processed by both a visi

Trinity๋Š” LLM, VLM, RL์„ ๋ชจ๋“ˆ์‹ ๊ณ„์ธต ๊ตฌ์กฐ๋กœ ํ†ตํ•ฉํ•˜์—ฌ humanoid robot์„ ์ œ์–ดํ•˜๋Š” ์ข…ํ•ฉ AI ์‹œ์Šคํ…œ์ด๋‹ค. ๊ฐ ๋ชจ๋“ˆ์ด ๋…๋ฆฝ์ ์œผ๋กœ ์ตœ์ ํ™”๋˜๋ฉด์„œ๋„ ํ˜‘๋ ฅํ•˜์—ฌ ๋ณต์žกํ•œ ํ™˜๊ฒฝ์—์„œ humanoid robot์˜ ํšจ์œจ์ ์ธ ์ œ์–ด๋ฅผ ์‹คํ˜„ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1: Overview of the Modular Humanoid Robot AI System. In this system, task instructions are processed by both a visi

1) ์ตœ์ดˆ์˜ ํ†ตํ•ฉ ์‹œ์Šคํ…œ: LLM, VLM, RL์„ humanoid robot์— ์ฒ˜์Œ์œผ๋กœ ํ†ตํ•ฉํ•˜์—ฌ full-scale humanoid robot ์ƒ์—์„œ ์ข…ํ•ฉ ๊ฒ€์ฆ์„ ์™„๋ฃŒํ•˜๊ณ  ์‹ค๋ฌด์  ํƒ€๋‹น์„ฑ์„ ์ž…์ฆํ–ˆ๋‹ค. 2) ๋ชจ๋“ˆ์‹ ๊ณ„์ธต ๊ตฌ์กฐ: interchangeable models์„ ํ†ตํ•ด ๋ณต์žกํ•œ ๋ฌธ์ œ๋ฅผ ๋ถ„ํ•ดยทํ•ด๊ฒฐํ•˜๋ฉฐ system flexibility์™€ scalability๋ฅผ ํ–ฅ์ƒ์‹œ์ผฐ๋‹ค. 3) ์‹œ์Šคํ…œ ํ•ด์„์„ฑ ๋ฐ ์•ˆ์ „์„ฑ: ๋‹ค์ค‘ ๋ชจ๋“ˆ ๊ฐ„ ์ƒํ˜ธ์ž‘์šฉ์œผ๋กœ interpretability๋ฅผ ๋ณด์žฅํ•˜์—ฌ human-robot interaction์˜ ์•ˆ์ „์„ฑ์„ ํ™•๋ณดํ–ˆ๋‹ค. 4) Loco-manipulation ์„ฑ๋Šฅ: locomotion policy์™€ manipulation network์˜ ๋ถ„๋ฆฌ๋ฅผ ํ†ตํ•ด lower limbs์™€ center of mass๋ฅผ ํ˜‘์กฐ์ ์œผ๋กœ ์กฐ์ •ํ•˜๋Š” ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ–ˆ๋‹ค.

How

Figure 2

Fig. 2: Overview of locomotion policy training. The state transitions sampled from

Originality

Limitation & Further Study

ํ›„์† ์—ฐ๊ตฌ ๋ฐฉํ–ฅ:

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: Trinity๋Š” RL, LLM, VLM์„ ๋ชจ๋“ˆ์‹ ๊ณ„์ธต ๊ตฌ์กฐ๋กœ ํ†ตํ•ฉํ•˜์—ฌ humanoid robot์˜ ๋ณต์žกํ•œ ์ œ์–ด ๋ฌธ์ œ๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ํ•ด๊ฒฐํ•˜๋Š” ํ˜์‹ ์ ์ธ ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค. Full-scale humanoid robot์— ๋Œ€ํ•œ ์ข…ํ•ฉ ๊ฒ€์ฆ๊ณผ loco-manipulation ์„ฑ๋Šฅ์ด ์ฃผ์š” ๊ฐ•์ ์ด๋‚˜, ๋” ๊ด‘๋ฒ”์œ„ํ•œ ์ž‘์—…์— ๋Œ€ํ•œ ํ‰๊ฐ€์™€ sim-to-real transfer ์„ฑ๋Šฅ์˜ ๋ช…ํ™•ํ•œ ๋ถ„์„์ด ํ•„์š”ํ•˜๋‹ค. ์ „๋ฐ˜์ ์œผ๋กœ humanoid robotics ๋ถ„์•ผ์˜ ์ค‘์š”ํ•œ ์ง„์ „์„ ๋Œ€ํ‘œํ•˜๋Š” ์–‘์งˆ์˜ ์‹œ์Šคํ…œ ๋…ผ๋ฌธ์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •