OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

์ €์ž: Mengkang Hu, Yuhang Zhou, Wendong Fan, Yuzhou Nie, Bowei Xia, Tao Sun, Ziyu Ye, Zhaoxuan Jin, Yingru Li, Qiguang Chen, Zeyu Zhang, Yifeng Wang, Qianshuo Ye, Bernard Ghanem, Ping Luo, Guohao Li | ๋‚ ์งœ: 2025-06-11 | DOI: 10.48550/arXiv.2505.23885 📄 PDF


Essence

Figure 2

Figure 2: WORKFORCE์™€ OWL์˜ ๊ฐœ์š”. ๊ธฐ์กด ์ ‘๊ทผ๊ณผ ๋‹ฌ๋ฆฌ ์ƒˆ ๋„๋ฉ”์ธ ์ ์‘ ์‹œ ์ „์ฒด ์žฌํ•™์Šต ์—†์ด ๋ชจ๋“ˆ์‹ ํ™•์žฅ ๊ฐ€๋Šฅ

LLM ๊ธฐ๋ฐ˜ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ์‹œ์Šคํ…œ์—์„œ ๋„๋ฉ”์ธ๋ณ„ ํŠนํ™”๋œ ์„ค๊ณ„๋กœ ์ธํ•œ ์ด์‹์„ฑ ๋ถ€์กฑ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ์ „๋žต ๊ณ„ํš(Planner)๊ณผ ๋„๋ฉ”์ธ ํŠนํ™” ์‹คํ–‰(Worker)์„ ๋ถ„๋ฆฌํ•œ ๋ชจ๋“ˆ์‹ WORKFORCE ํ”„๋ ˆ์ž„์›Œํฌ์™€ ์ด๋ฅผ ์ตœ์ ํ™”ํ•˜๋Š” OWL ํ•™์Šต ํŒจ๋Ÿฌ๋‹ค์ž„์„ ์ œ์•ˆํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: GAIA ๋ฒค์น˜๋งˆํฌ์—์„œ WORKFORCE์™€ OWL์˜ ์„ฑ๋Šฅ ๋น„๊ต. ์ƒ์šฉ ์‹œ์Šคํ…œ OpenAI Deep Research ๋Šฅ๊ฐ€

  1. ์ตœ์ฒจ๋‹จ ์„ฑ๋Šฅ: WORKFORCE๋Š” GAIA ๋ฒค์น˜๋งˆํฌ์—์„œ 69.70% ์ •ํ™•๋„ ๋‹ฌ์„ฑ, OpenAI Deep Research (55.15%) ๋Œ€๋น„ 2.34% ์ดˆ๊ณผ, ๊ธฐ์กด ์˜คํ”ˆ์†Œ์Šค SOTA (67.46%) ๋Šฅ๊ฐ€
  2. ํšจ์œจ์  ํ•™์Šต: OWL๋กœ ํ›ˆ๋ จ๋œ Qwen2.5-32B-Instruct ๋ชจ๋ธ์ด 52.73% (+16.37%) ๋‹ฌ์„ฑ, GPT-4o-mini (47.27%), Qwen2.5-72B-Instruct (49.09%) ์ดˆ๊ณผ. GAIA ๋ฐ์ดํ„ฐ ๋ฏธ์‚ฌ์šฉ ํ•™์Šต์œผ๋กœ๋„ ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ ์ž…์ฆ
  3. ๋ชจ๋“ˆ์‹ ํ™•์žฅ์„ฑ: ์ƒˆ ๋„๋ฉ”์ธ ์ ์‘ ์‹œ Worker nodes๋งŒ ์ถ”๊ฐ€/์ˆ˜์ •ํ•˜๋ฉด ๋˜๋ฏ€๋กœ ์žฌ์„ค๊ณ„ ๋ฐ ์žฌํ›ˆ๋ จ ์ตœ์†Œํ™”

How

Figure 3

Figure 3: WORKFORCE ํ”„๋ ˆ์ž„์›Œํฌ ๊ฐœ์š”. Planner, Coordinator, Worker Pool์˜ ๊ณ„์ธต์  ๊ตฌ์กฐ

์ถ”๋ก  ๋ฉ”์ปค๋‹ˆ์ฆ˜ (Inference):

ํ›ˆ๋ จ ๋ฉ”์ปค๋‹ˆ์ฆ˜ (OWL):

Originality

Limitation & Further Study

Evaluation

์ดํ‰: WORKFORCE์™€ OWL์€ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ์‹œ์Šคํ…œ์˜ ๋„๋ฉ”์ธ ๊ฐ„ ์ด์‹์„ฑ ๋ฌธ์ œ์— ์šฐ์•„ํ•œ ๋ชจ๋“ˆ์‹ ํ•ด๊ฒฐ์ฑ…์„ ์ œ์‹œํ•˜๋ฉฐ, GAIA ๋ฒค์น˜๋งˆํฌ์—์„œ ์ƒ์šฉ ์‹œ์Šคํ…œ์„ ์ดˆ๊ณผํ•˜๋Š” ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ–ˆ๋‹ค๋Š” ์ ์—์„œ ์‹ค์งˆ์  ๊ธฐ์—ฌ๊ฐ€ ์žˆ๋‹ค. ๋‹ค๋งŒ Worker ์„ค๊ณ„ ์ผ๋ฐ˜ํ™”, ํ•™์Šต ๋ฉ”์ปค๋‹ˆ์ฆ˜ ์ƒ์„ธํ™”, ๋ณด๋‹ค ๋‹ค์–‘ํ•œ ๋„๋ฉ”์ธ ๊ฒ€์ฆ ๋“ฑ์ด ์ถ”๊ฐ€๋˜๋ฉด ์˜ํ–ฅ๋ ฅ์ด ๋” ๋†’์•„์งˆ ๊ฒƒ์œผ๋กœ ์˜ˆ์ƒ๋œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
OWL์˜ ๋‹ค์ค‘์—์ด์ „ํŠธ WORKFORCE ๊ตฌ์กฐ ์„ค๊ณ„๊ฐ€ AutoGen์˜ ๊ธฐ๋ณธ ์—์ด์ „ํŠธ ํ˜‘์—… ํ”„๋ ˆ์ž„์›Œํฌ์— ๋ฐ”ํƒ•ํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์—์ด์ „ํŠธ ๋ฉ”๋ชจ๋ฆฌ ๊ตฌ์กฐ์™€ ์›Œํฌํฌ์Šค ์—ญํ•  ๋ถ„๋ฆฌ๋ฅผ ๋‹ค๋ฃจ๋Š” Agentic Memory ๋…ผ๋ฌธ์ด ์„ค๊ณ„์  ๋ชจ๋“ˆํ™” ๋ฐ ๊ธฐ์–ต ์ฒด๊ณ„ ๊ฐœ๋ฐœ์˜ ํ† ๋Œ€๊ฐ€ ๋จ.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
596์€ Agentic AI์˜ ๋ฉ€ํ‹ฐ์—์ด์ „ํŠธ ํ˜‘๋ ฅ๊ณผ ์›Œํฌํ”Œ๋กœ ์ตœ์ ํ™” ์ด๋ก ์„ ์ œ์‹œํ•ด, 864์˜ ์™„์ „ ์ž๋™ VASP ๊ณ„์‚ฐ ์‹œ์Šคํ…œ์˜ ์„ค๊ณ„์— ์ฐธ๊ณ ๊ฐ€ ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Prompt ์ผ๊ด€์„ฑ ๋ฐ self-consistency ์œ ๋„ ๋ฐฉ๋ฒ•์ด ๋‹ค์ค‘์—์ด์ „ํŠธ ์‹œ์Šคํ…œ์˜ ๊ณ„ํš๊ณผ ๊ธฐ์–ต ์„ฑ๋Šฅ ๋ณด์™„์— ๋Œ€์•ˆ์ด ๋จ.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ ํ˜‘์—…/๋ถ„์—… ๋ฐ ์‚ฌํšŒ์  ์ƒํ˜ธ์ž‘์šฉ ๋ฉ”์ปค๋‹ˆ์ฆ˜์— ๋Œ€ํ•œ ๋น„๊ต์  ์ ‘๊ทผ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
MechAgents ๋…ผ๋ฌธ์€ ํŠนํ™”๋œ ๊ณตํ•™ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ์„ค๊ณ„ ํ˜‘์—…์‹œ์Šคํ…œ์œผ๋กœ, OWL์˜ ์ผ๋ฐ˜์  ๋‹ค์ค‘์—์ด์ „ํŠธ ์ž‘์—… ๋ณด์กฐ ๊ตฌ์กฐ์™€ ๋น„๊ตํ•ด ๋ณผ ์ˆ˜ ์žˆ๋Š” ์ฐจ๋ณ„์  ์‚ฌ๋ก€์ž…๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋ฉ€ํ‹ฐ์—์ด์ „ํŠธ LLM ์‹œ์Šคํ…œ์˜ ํ”Œ๋ž˜๋‹ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ๋ฐ ํ˜‘์—… ๊ตฌ์กฐ๋ฅผ ๋‹ค์–‘ํ•œ ์‹œ๊ฐ์—์„œ ๋น„๊ตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
WORKFORCE์™€ ์œ ์‚ฌํ•œ ๋ชจ๋“ˆํ˜• ๋ฉ€ํ‹ฐ์—์ด์ „ํŠธ ์ƒํ˜ธ์ž‘์šฉ ๋ฐ ๊ณ„ํš ์ˆ˜๋ฆฝ ๋ฐฉ๋ฒ•์˜ ์ตœ์‹  ๋™ํ–ฅ์„ ๋ณผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
PlanGenLLMs๋Š” OWL์ด ์ œ์•ˆํ•œ ํ”Œ๋ž˜๋‹ ๋ถ„๋ฆฌ ๊ธฐ๋ฒ•๊ณผ ํ”Œ๋ž˜๋„ˆ ์›Œ์ปค ๊ตฌ์กฐ ๋“ฑ LLM ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ ํ”Œ๋ž˜๋‹ ํ‰๊ฐ€๋ฅผ ํ™•๋Œ€ํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
OWL ๋…ผ๋ฌธ์€ ๋‹ค์–‘ํ•œ ๋ถ„์•ผ์—์„œ ์ผ๋ฐ˜๋ชฉ์  ๋ฉ€ํ‹ฐ์—์ด์ „ํŠธ AI ํ•™์Šต ๋ฐ ์›Œํฌ๋กœ๋“œ ์Šค์ผ€์ค„๋ง์„ ์‹œ๋„ํ•˜์—ฌ Biomni ์‹œ์Šคํ…œ์˜ ์ ์šฉ๋ฒ”์œ„ ํ™•์žฅ์— ์‹œ์‚ฌ์ ์„ ์ค€๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
OWL ํ”„๋ ˆ์ž„์›Œํฌ๊ฐ€ ์‹ค์งˆ์  ๋ฉ€ํ‹ฐ์—์ด์ „ํŠธ ํ”Œ๋ž˜๋‹ ํ‰๊ฐ€์ฒด๊ณ„๋ฅผ ์–ด๋–ป๊ฒŒ ๊ตฌ์ฒดํ™”ํ•˜๋Š”์ง€ PlanGenLLMs์˜ ํ‰๊ฐ€ ํฌ์ธํŠธ์™€ ์—ฐ๊ณ„ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •