OmniScientist: Toward a Co-evolving Ecosystem of Human and AI Scientists

์ €์ž: Chenyang Shao, Dehao Huang, Yu Li, Keyu Zhao, Weiquan Lin, Yining Zhang, Qingbin Zeng, Zhiyu Chen, Tianxing Li, Yifei Huang, Taozhong Wu, Xinyang Liu, Ruotong Zhao, Mengsheng Zhao, Jiaoyang Li, Xuhua Zhang, Yue Wang, Yuanyi Zhen, Fengli Xu, Yong Li, Tie-Yan Liu | ๋‚ ์งœ: 2025-11-21 | URL: https://arxiv.org/abs/2511.16931 📄 PDF


Essence

๋ณธ ๋…ผ๋ฌธ์€ ๊ธฐ์กด AI Scientist ์‹œ์Šคํ…œ๋“ค์˜ ๊ณ ๋ฆฝ์ ์ด๊ณ  ๋…๋ฆฝ์ ์ธ ๋ฌธ์ œ ํ•ด๊ฒฐ ์ ‘๊ทผ๋ฐฉ์‹์˜ ํ•œ๊ณ„๋ฅผ ์ง€์ ํ•˜๊ณ , ์ธ๊ฐ„ ๊ณผํ•™ ์ธํ”„๋ผ์˜ ์‚ฌํšŒ์ ยทํ˜‘๋ ฅ์  ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ๋ช…์‹œ์ ์œผ๋กœ ์ธ์ฝ”๋”ฉํ•˜๋Š” OmniScientist ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค. ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜, ๋ฌธํ—Œ ๊ฒ€ํ† , ์—ฐ๊ตฌ ์•„์ด๋””์–ด ๋„์ถœ, ์‹คํ—˜ ์ž๋™ํ™”, ๊ณผํ•™ ์ €์ˆ , ๋…ผ๋ฌธ ๊ฒ€ํ† ์˜ ์ „ ์ฃผ๊ธฐ์— ๊ฑธ์ณ end-to-end ์ž๋™ํ™”๋ฅผ ๊ตฌํ˜„ํ•˜๋ฉด์„œ๋„, ๊ตฌ์กฐํ™”๋œ ์ง€์‹ ์‹œ์Šคํ…œ, ํ˜‘๋ ฅ ํ”„๋กœํ† ์ฝœ(OSP), ํ‰๊ฐ€ ํ”Œ๋žซํผ(ScienceArena)์„ ํ†ตํ•ด ์ธ๊ฐ„-AI ๊ณผํ•™์ž์˜ ๊ณต์ง„ํ™” ์ƒํƒœ๊ณ„๋ฅผ ์‹คํ˜„ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: Overview of Our OmniScientist System

How

Figure 2

Figure 2: The Multi-Agent Refinement Pipeline (left) and the Refined Data Structure (right).

Originality

Limitation & Further Study

Evaluation

Novelty: 5/5 Technical Soundness: 4/5 Significance: 5/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: OmniScientist๋Š” ๊ธฐ์กด AI Scientist ์‹œ์Šคํ…œ์˜ ๊ณ ๋ฆฝ์„ฑ์„ ๊ทน๋ณตํ•˜๊ณ , ์ธ๊ฐ„ ๊ณผํ•™ ์ธํ”„๋ผ์˜ ์‚ฌํšŒ์ ยทํ˜‘๋ ฅ์  ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ๋ช…์‹œ์ ์œผ๋กœ ์ธ์ฝ”๋”ฉํ•จ์œผ๋กœ์จ AI ๊ณผํ•™์ž์˜ ์ง„ํ™”์— ๋Œ€ํ•œ ๊ทผ๋ณธ์ ์ธ ์‹œ๊ฐ ๋ณ€ํ™”๋ฅผ ์ œ์‹œํ•œ๋‹ค. End-to-end ์ž๋™ํ™”, ํ˜‘๋ ฅ ํ”„๋กœํ† ์ฝœ, ํ‰๊ฐ€ ์ƒํƒœ๊ณ„์˜ ํ†ตํ•ฉ์  ๊ตฌํ˜„์€ ๋†’์€ ์•ผ์‹ฌ์  ๋ชฉํ‘œ์ด๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ์ธ๊ฐ„-AI ๊ณต์ง„ํ™” ์ƒํƒœ๊ณ„ ๊ตฌ์ถ•์˜ ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์—ฌ์ค€๋‹ค. ๋‹ค๋งŒ ์ œํ•œ๋œ ์‚ฌ๋ก€ ๊ฒ€์ฆ, LLM ์˜์กด์„ฑ ๋ฌธ์ œ, ์‹ค์ œ ์ธ๊ฐ„-AI ํ˜‘๋ ฅ์˜ large-scale validation ๋ถ€์กฑ ๋“ฑ์ด ๊ฐœ์„  ๊ณผ์ œ์ด๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Towards AI for science๋Š” ์ธ๊ฐ„-๊ณผํ•™ ์ธํ”„๋ผ์™€ AI์˜ ํ†ตํ•ฉ์  ํ˜‘๋ ฅ ๊ตฌ์กฐ์— ๋Œ€ํ•œ ๊ฐœ๋…์  ๋…ผ์˜๋ฅผ ์ œ๊ณตํ•˜์—ฌ OmniScientist ๋กœ๋“œ๋งต์˜ ์ด๋ก ์  ๊ธฐ๋ฐ˜์ด ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Exploring collaboration mechanisms for llm agents ๋…ผ๋ฌธ์€ ์‚ฌํšŒ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ LLM ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ ํ˜‘์—… ๊ตฌ์กฐ๋ฅผ ๋‹ค๋ฃจ์–ด OmniScientist์˜ ์ธ๊ฐ„-์—์ด์ „ํŠธ ์ƒํ˜ธ์ž‘์šฉ ์„ค๊ณ„์™€ ๋น„๊ต๊ฐ€ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery ๋…ผ๋ฌธ์€ end-to-end ๊ณผํ•™ ์ž๋™ํ™”๋ผ๋Š” ์œ ์‚ฌ ๋ชฉํ‘œ๋ฅผ ๊ฐ–๊ณ  OmniScientist์™€ ๋‹ค๋ฅธ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Sakana์˜ AI Scientist ํ‰๊ฐ€ ๋…ผ๋ฌธ์€ ์ž๋™ํ™” ์—ฐ๊ตฌ์‹œ์Šคํ…œ์˜ ์„ฑ๋Šฅ ๊ฒ€์ฆ ํ†ต์ฐฐ์„ ์ œ๊ณตํ•˜์—ฌ, ์ธ๊ฐ„-์ธ๊ณต์ง€๋Šฅ ํ˜‘์—…ํ˜• ํ”„๋ ˆ์ž„์›Œํฌ์™€ ์ฐจ๋ณ„์ ์„ ๋น„๊ตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
OmniScientist ๋…ผ๋ฌธ์€ ์ธ๊ฐ„-AI ์‚ฌํšŒ์  ์ƒํ˜ธ์ž‘์šฉ ๋ฐ ์žฅ๊ธฐ๊ฐ„ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ ์—์ด์ „ํŠธ ์ผ๊ด€์„ฑ๊ณผ ํ•™์Šต์„ ๋ชจ๋ธ๋งํ•˜์—ฌ Vending-Bench์˜ ์žฅ๊ธฐ ์—์ด์ „ํŠธ ์ผ๊ด€์„ฑ ๋ถ„์„๊ณผ ๊ด€๋ จ ๊นŠ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
3389๋Š” ์ธ๊ฐ„-์—์ด์ „ํŠธ ๊ณต์ง„ํ™”์  ํ‰๊ฐ€ ์ƒํƒœ๊ณ„๋ฅผ ๋…ผ์˜ํ•˜๋Š” ์ฐจ์„ธ๋Œ€ ๋ฒค์น˜๋งˆํ‚น ์ ‘๊ทผ์œผ๋กœ, 090์˜ ๋ฒค์น˜๋งˆํฌ์™€ ๋Œ€๋น„๋œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
3389(OmniScientist)๋Š” ์ธ๊ฐ„-์—์ด์ „ํŠธ ํ˜‘๋ ฅํ˜• ์ƒํƒœ๊ณ„ ๋ฐ ๋™์  ๊ฒ€์ƒ‰-์ƒ์„ฑ ์—์ด์ „ํŠธ ์„ค๊ณ„๋ฅผ ๋‹ค๋ฃจ์–ด, 295์˜ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ์กฐ์ • ๊ตฌํ˜„์„ ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ธ๊ฐ„-AI ์ง€์‹ ์ฐฝ์ถœ์˜ ์žฅ๊ธฐ์ , ์ƒํƒœ๊ณ„ ๊ด€์ ์—์„œ ํ˜‘๋ ฅ/๊ณต์ง„ํ™” ๋ชจ๋ธ์„ ์ž๋™ํ™” ์—ฐ๊ตฌ ํ”Œ๋žซํ’ˆ๊ณผ ์—ฐ๊ฒฐํ•˜์—ฌ ๊ตฌ์ฒดํ™”ํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
๊ณผํ•™ ํƒ๊ตฌ ๋ถ„์•ผ์—์„œ LLM ๊ธฐ๋ฐ˜ AI Scientist ์‹œ์Šคํ…œ ์ „๋ฐ˜์„ ์ฒด๊ณ„์ ์œผ๋กœ ๊ฒ€ํ† ํ•˜๊ณ , OmniScientist ํ”„๋ ˆ์ž„์›Œํฌ์™€ ์œ ์‚ฌํ•œ ์‚ฌ๋ก€๋ฅผ ์ •๋ฆฌํ•ฉ๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
Vending-Bench ๋…ผ๋ฌธ์€ ์žฅ๊ธฐ ๊ณผ์ œ ์ˆ˜ํ–‰์˜ ์ผ๊ด€์„ฑ ํ‰๊ฐ€๋ฅผ ์œ„ํ•œ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•ด, AI/์ธ๊ฐ„ ํ˜‘์—… ๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ ์‹œ์Šคํ…œ์˜ ์‹คํ–‰ํšจ๊ณผ๋ฅผ ์ธก์ •ํ•˜๋Š”๋ฐ ํ™œ์šฉ๋ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •