RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

์ €์ž: Yiran Qin, Li Kang, Xiufeng Song, Zhenfei Yin, Xiaohong Liu, Xihui Liu, Ruimao Zhang, Lei Bai | ๋‚ ์งœ: 2025-03-20 | URL: https://arxiv.org/abs/2503.16408 📄 PDF


Essence

Figure 1

Figure 1. When performing the task โ€œGrab the steak and use the camera to photograph it with 4 embodied agentsโ€, collabor

๋ณธ ๋…ผ๋ฌธ์€ ๋‹ค์ค‘ ๊ตฌ์ฒดํ™” ์—์ด์ „ํŠธ(embodied multi-agent) ์‹œ์Šคํ…œ์˜ ํ˜‘๋ ฅ์„ ์œ„ํ•ด ๋…ผ๋ฆฌ์ , ๊ณต๊ฐ„์ , ์‹œ๊ฐ„์  ์ œ์•ฝ์„ ์กฐํ•ฉํ•œ compositional constraints ๊ฐœ๋…์„ ์ œ์‹œํ•˜๊ณ , ์ด๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์ž๋™ํ™”๋œ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ํ”„๋ ˆ์ž„์›Œํฌ RoboFactory๋ฅผ ๊ฐœ๋ฐœํ•˜์—ฌ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ์กฐ์ž‘ ๋ฒค์น˜๋งˆํฌ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2. Overview of RoboFactory. Given the global task description, prior information, and observations, RoboBrain gen

How

Figure 2

Figure 2. Overview of RoboFactory. Given the global task description, prior information, and observations, RoboBrain gen

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ๋กœ๋ณดํ‹ฑ ํ˜‘๋ ฅ์˜ ํ•ต์‹ฌ ๋„์ „ ๋ฌธ์ œ๋ฅผ compositional constraints๋กœ ์šฐ์•„ํ•˜๊ฒŒ ํ•ด๊ฒฐํ•˜๊ณ , ์ฒซ ๋ฒˆ์งธ ๋ฒค์น˜๋งˆํฌ๋ฅผ ์ œ๊ณตํ•จ์œผ๋กœ์จ ๋‹ค์ค‘ ์—์ด์ „ํŠธ embodied AI ์—ฐ๊ตฌ์˜ ์ค‘์š”ํ•œ ๊ธฐ์ดˆ๋ฅผ ๋งˆ๋ จํ–ˆ๋‹ค. ๋‹ค๋งŒ ์‹ค์ œ ๋กœ๋ด‡ ๊ฒ€์ฆ๊ณผ ํ™•์žฅ์„ฑ ๋ถ„์„์ด ์ถ”๊ฐ€๋˜๋ฉด ๋”์šฑ ๊ฐ•๋ ฅํ•œ ๊ธฐ์—ฌ๊ฐ€ ๋  ๊ฒƒ์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •