Interesting Scientific Idea Generation using Knowledge Graphs and LLMs: Evaluations with 100 Research Group Leaders

์ €์ž: Xuemei Gu, Mario Krenn | ๋‚ ์งœ: 2025-01-07 | DOI: 10.48550/arXiv.2405.17044 📄 PDF


Essence

Figure 1

Fig. 1. SciMuse suggests research ideas or collaborations using a knowledge graph and GPT-4. (a), Knowledge

SciMuse๋Š” 5,800๋งŒ ๊ฐœ์˜ ์—ฐ๊ตฌ๋…ผ๋ฌธ๊ณผ LLM์„ ํ™œ์šฉํ•˜์—ฌ ๊ฐœ์ธํ™”๋œ ์—ฐ๊ตฌ ์•„์ด๋””์–ด๋ฅผ ์ƒ์„ฑํ•˜๊ณ , 100๋ช… ์ด์ƒ์˜ ์—ฐ๊ตฌ ๊ทธ๋ฃน ๋ฆฌ๋”์˜ ํ‰๊ฐ€๋ฅผ ํ†ตํ•ด AI ์ƒ์„ฑ ์•„์ด๋””์–ด์˜ ํฅ๋ฏธ๋„๋ฅผ ์˜ˆ์ธกํ•˜๋Š” ์‹œ์Šคํ…œ์„ ์ œ์‹œํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Fig. 2. Large-scale human evaluation within the Max Planck Society. (a)-(b), The map of Germany, based on the

๋Œ€๊ทœ๋ชจ ํ‰๊ฐ€ ๋ฐ์ดํ„ฐ ๊ตฌ์ถ•: 54๊ฐœ ๋ง‰์Šคํ”Œ๋ž‘ํฌ ์—ฐ๊ตฌ์†Œ์˜ 110๋ช… ์—ฐ๊ตฌ ๊ทธ๋ฃน ๋ฆฌ๋”๊ฐ€ 4,451๊ฐœ์˜ ๊ฐœ์ธํ™”๋œ ์•„์ด๋””์–ด๋ฅผ ํ‰๊ฐ€ํ•˜์—ฌ ์•ฝ 25%๊ฐ€ ํฅ๋ฏธ๋„ 4-5๋ฅผ ๋ฐ›์Œ

ํฅ๋ฏธ๋„ ์˜ˆ์ธก ๋ชจ๋ธ ๊ฐœ๋ฐœ: supervised neural network์™€ unsupervised zero-shot LLM ranking์„ ํ†ตํ•ด ์ƒˆ๋กœ์šด ์•„์ด๋””์–ด์˜ ํฅ๋ฏธ๋„๋ฅผ ์ •ํ™•ํžˆ ์˜ˆ์ธก ๊ฐ€๋Šฅ

Knowledge graph ํŠน์„ฑ ๋ถ„์„: 8๊ฐ€์ง€ knowledge graph ํŠน์„ฑ(๋…ธ๋“œ ์ค‘์‹ฌ์„ฑ, ์ธ์šฉ ์ง€ํ‘œ, semantic distance ๋“ฑ)๊ณผ ์—ฐ๊ตฌ์ž ํฅ๋ฏธ๋„ ๊ฐ„์˜ ์ƒ๊ด€๊ด€๊ณ„ ๊ทœ๋ช…

ํ•™์ œ ๊ฐ„ ํ˜‘๋ ฅ ๊ธฐํšŒ ๋ฐœ๊ตด: ๊ฐ™์€ ๋ถ„์•ผ ๋‚ด ํ˜‘๋ ฅ(institutional collaboration)๋ณด๋‹ค ์„œ๋กœ ๋‹ค๋ฅธ ๋ถ„์•ผ ๊ฐ„ ํ˜‘๋ ฅ ์•„์ด๋””์–ด๊ฐ€ ๋†’์€ ํฅ๋ฏธ๋„๋ฅผ ๋ณด์ž„

How

Figure 1

Fig. 1. SciMuse suggests research ideas or collaborations using a knowledge graph and GPT-4. (a), Knowledge

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ์—ฐ๊ตฌ๋Š” AI ์ƒ์„ฑ ์—ฐ๊ตฌ ์•„์ด๋””์–ด์˜ ๊ฐ€์น˜๋ฅผ ๋Œ€๊ทœ๋ชจ ์‹ค์ฆ ํ‰๊ฐ€๋ฅผ ํ†ตํ•ด ๊ฒ€์ฆํ•œ ํš๊ธฐ์ ์ธ ๋…ผ๋ฌธ์ด๋ฉฐ, dual prediction ๋ฐฉ์‹๊ณผ knowledge graph ๊ธฐ๋ฐ˜ ์ฒด๊ณ„ํ™”๋ฅผ ํ†ตํ•ด ์‹ค์šฉ์„ฑ์„ ๋†’์˜€์œผ๋‚˜, ํ‰๊ฐ€์ž ๊ตฌ์„ฑ์˜ ๋ถˆ๊ท ํ˜•๊ณผ ์‹œ๊ฐ„์  ์ œ์•ฝ์ด ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ์„ ์ œํ•œํ•œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Improving Scientific Hypothesis Generation with Knowledge Graphs ๋…ผ๋ฌธ์€ SciMuse์™€ ์œ ์‚ฌํ•˜๊ฒŒ ์ง€์‹๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฐ˜ ๊ณผํ•™์  ์•„์ด๋””์–ด ์ƒ์„ฑ ์ ‘๊ทผ๋ฐฉ๋ฒ•์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ •๋ฆฌํ•˜์—ฌ, ๋ณธ ๋…ผ๋ฌธ์˜ ๋ฐฉ๋ฒ•๋ก ์  ๊ทผ๊ฑฐ๊ฐ€ ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
InfiAgent-DABench๋Š” ๋ฐ์ดํ„ฐ ๋ถ„์„ ์ž‘์—…์— ์žˆ์–ด์„œ LLM ๊ธฐ๋ฐ˜ ์•„์ด๋””์–ด(ํ˜น์€ ๊ฐ€์„ค) ์ƒ์„ฑ ๋ฐฉ๋ฒ•๋ก  ๋ฒค์น˜๋งˆํฌ๋ฅผ ์ œ๊ณตํ•ด, 434์—์„œ ์ œ์‹œํ•œ ์•„์ด๋””์–ด ์˜ˆ์ธก ์„ฑ๋Šฅ ๊ฒ€์ฆ์— ์ฐธ๊ณ ํ•  ๋งŒํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
434 ๋…ผ๋ฌธ์€ LLM+์ง€์‹๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฐ˜ ๊ฐœ์ธํ™” ๊ณผํ•™์•„์ด๋””์–ด ์ƒ์„ฑ ๋ฐ ์ธ๊ฐ„ํ‰๊ฐ€ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•ด, 518์˜ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ํ˜‘์—…์‹ ์•„์ด๋””์–ด ์ƒ์„ฑ ๋ชจ๋ธ์˜ ์ด๋ก ์  ์ถœ๋ฐœ์ ์ด๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์ง€์‹๊ทธ๋ž˜ํ”„ ๋ฐ LLM, ์—์ด์ „ํŠธ ๊ธฐ๋ฐ˜์˜ ๊ณผํ•™์  ์•„์ด๋””์–ด ์ƒ์„ฑ ๋ฐฉ๋ฒ•์„ ๋ณธ ๋…ผ๋ฌธ์˜ SciAgents ํ”„๋ ˆ์ž„์›Œํฌ ๋ฐœ์ „ ๋ฐฉํ–ฅ๊ณผ ์—ฐ๊ฒฐํ•ด๋ณผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
434๋Š” ์ง€์‹๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฐ˜์œผ๋กœ ๊ณผํ•™ ์•„์ด๋””์–ด๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก ์„ ๋‹ค๋ฃจ์–ด, 132์—์„œ LLM+KG ์กฐํ•ฉ์˜ ๊ทผ๊ฐ„์ด ๋œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
LLM๊ณผ ์ง€์‹ ๊ทธ๋ž˜ํ”„๋ฅผ ํ™œ์šฉํ•œ ๊ณผํ•™ ์•„์ด๋””์–ด ์ƒ์„ฑ ๊ด€๋ จ ๋ฐฉ๋ฒ•๋ก ์˜ ์ด๋ก ์  ๊ธฐ์ดˆ๋ฅผ ์ œ๊ณตํ•˜๋ฏ€๋กœ ๊ฐ™์ด ๋ณด๋ฉด ์ข‹์Šต๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์ง€์‹ ๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฐ˜ ๊ณผํ•™ ์•„์ด๋””์–ด ์ƒ์„ฑ ๋ฐฉ์‹์— ์ง‘์ค‘ํ•œ ๋…ผ๋ฌธ์œผ๋กœ, ResearchLink์˜ ๋ฐฉ๋ฒ•๋ก ์  ๊ธฐ๋ฐ˜ ์‚ฌ๋ก€๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ธ๊ณผ ๊ด€๊ณ„๋ฅผ ํ™œ์šฉํ•œ ํ…์ŠคํŠธ ์ƒ์„ฑ ํ’ˆ์งˆ ํ–ฅ์ƒ์„ ๋‹ค๋ฃจ๋Š” ๊ด€๋ จ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Knowledge graph๋ฅผ ํ™œ์šฉํ•œ ๊ณผํ•™ ์•„์ด๋””์–ด ์—ฐ๊ฒฐ/์ƒ์„ฑ ์—ฐ๊ตฌ๋กœ ๋…ผ๋ฌธ ๊ฐ„ ๊ด€๊ณ„ ์„ค๋ช… ๊ธฐ๋Šฅ์˜ ํƒ€ ์ ‘๊ทผ๋ฒ•์„ ๋ณด์—ฌ์ค€๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Spacer ๋…ผ๋ฌธ์€ deliberate decontextualization ๋ฐฉ์‹์œผ๋กœ ์‹ ๊ฐœ๋… ๊ณผํ•™ ์•„์ด๋””์–ด ์ž๋™์ƒ์„ฑ์„ ์‹คํ˜„ํ•ด, knowledge graph+LLM ๊ธฐ๋ฐ˜ SciMuse ์ ‘๊ทผ์˜ ๋Œ€์•ˆ์ด ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
434 ๋…ผ๋ฌธ์€ LLM๊ณผ ์ง€์‹๊ทธ๋ž˜ํ”„๋ฅผ ๊ฒฐํ•ฉํ•œ ๊ณผํ•™์  ์•„์ด๋””์–ด ์ƒ์„ฑ ๋ฐฉ์‹์œผ๋กœ, ๊ฐœ๋… ์ฒด๊ณ„(ontological regime) ์ˆ˜์ • ์ธก๋ฉด์—์„œ ์ฐธ๊ณ ๋  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์„œ์ง€๊ณ„๋Ÿ‰ํ•™์  ๋„คํŠธ์›Œํฌ ๋ถ„์„์„ ์ถ”๊ฐ€์ ์ธ ๊ด€์ ์—์„œ ๋ฐœ์ „์‹œํ‚จ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
AI ๊ฐœ๋… ์ง€์‹ ๊ทธ๋ž˜ํ”„์—์„œ ๋งํฌ ์˜ˆ์ธก๊ณผ ๊ณผํ•™์  ์•„์ด๋””์–ด ํ™•์žฅ์— ๊ด€ํ•œ ์—ฐ๊ตฌ๋กœ, ๋ฏธ๋ž˜ AI ์—ฐ๊ตฌ ๋ฐฉํ–ฅ ์˜ˆ์ธก์˜ ์ถ”๊ฐ€์ ์ธ ์ ‘๊ทผ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
518์€ ๋‹ค์ค‘ LLM ์—์ด์ „ํŠธ ํ˜‘์—…(VIRSCI)์œผ ๋กœ, 434์˜ LLM+์ง€์‹๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฐ˜ ๊ฐœ์ธํ™” ์•„์ด๋””์–ด ์ƒ์„ฑ์˜ ์ง‘๋‹จ์ ยท์ƒํ˜ธ์ฐธ์กฐ ํ™•์žฅ ๋ชจ๋ธ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Interesting scientific idea generation using knowledge graph ๋…ผ๋ฌธ์€ ์ƒ์„ฑํ˜• ์ ‘๊ทผ๋ฒ•์„ ๊ณผํ•™ ์•„์ด๋””์–ด ์ƒ์„ฑ/์žฌ๊ตฌ์„ฑ์— ์ ์šฉํ•ด Graphusion์˜ ์‹ค์ œ ํ™œ์šฉ ๋ฐฉํ–ฅ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Interesting Scientific Idea Generation using Knowledge Graph(434)์€ ์ฆ๊ฑฐ ๊ธฐ๋ฐ˜ ์•„์ด๋””์–ด ์ถ”์ฒœ ์ ‘๊ทผ์„ ํ™•์žฅํ•˜์—ฌ, 420์—์„œ ์ œ์•ˆํ•œ ์ฆ๊ฑฐ ์ค‘์‹ฌ ์ธ์šฉ ์ถ”์ฒœ์˜ ํ•™์ˆ ์  ํ™œ์šฉ๋„๋ฅผ ๋ณด์—ฌ์ค€๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ง€์‹๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฐ˜์˜ ์ž๋™ ๊ณผํ•™ ์•„์ด๋””์–ด ์ƒ์„ฑ ๋ฐฉ๋ฒ•์„ ํ†ตํ•ด, AstroAgents์˜ ๋ฌธํ—Œ ๊ฒ€ํ† ์™€ ๋ฐ์ดํ„ฐ ํ•ด์„ ๊ธฐ๋Šฅ์„ ๋ณด์™„ํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ํ™•์žฅ๋  ์ˆ˜ ์žˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Interesting Scientific Idea Generation using Knowledge Graph ๋…ผ๋ฌธ์€ ๊ฒฝ์ œํ•™ ์ด์™ธ ๋ถ„์•ผ์—์„œ ์ง€์‹ ๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฐ˜ ๊ฐ€์„ค ์ƒ์„ฑ๊ณผ ํ‰๊ฐ€๋ฌธ์ œ๋ฅผ ์‹ฌ์ธต ๋ถ„์„ํ•˜๋ฏ€๋กœ 631 ์ฃผ์ œ๋ฅผ ๋„“ํž ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ง€์‹๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฐ˜ ์•„์ด๋””์–ด ์ƒ์„ฑ ์‹œ์Šคํ…œ ์—ฐ๊ตฌ๋กœ, LLM ๊ธฐ๋ฐ˜ ์•„์ด๋””์—์ด์…˜์„ ๊ตฌ์ฒด์  ๋ฐฉ๋ฒ•๋ก ์œผ๋กœ ๊ตฌํ˜„ํ•œ ์‚ฌ๋ก€์—ฌ์„œ 019์˜ ํ”„๋ ˆ์ž„์›Œํฌ์™€ ์ƒํ˜ธ๋Œ€ํ‘œ์„ฑ ํ™•์ธ ๊ฐ€๋Šฅ.
ํ›„์† ์—ฐ๊ตฌ
434์€ ์•„์ด๋””์–ด ์ƒ์„ฑ์— ๋…ผ๋ฌธ๊ณผ LLM์„ ๊ฒฐํ•ฉํ•˜๋Š” ์‹œ์Šคํ…œ(SciMuse)์œผ๋กœ, 425์˜ ๋ฐ์ดํ„ฐ ๋ฐ ์ž๋™ ๊ฒ€์ฆ ํ™œ์šฉ ์•„์ด๋””์–ด ์ƒ์„ฑ ํ”„๋ ˆ์ž„์›Œํฌ์˜ ํ™•์žฅ ์‚ฌ๋ก€์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
434๋ฒˆ ๋…ผ๋ฌธ์€ ์ง€์‹๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฐ˜ ๊ณผํ•™์  ์•„์ด๋””์–ด ์ƒ์„ฑยท์ถ”์ฒœ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•˜์—ฌ, 216๋ฒˆ์˜ ์ง€์‹๋ฒ ์ด์Šค๋ฅผ ์‘์šฉํ•  ์ˆ˜ ์žˆ๋Š” ๊ตฌ์กฐ์  ๋Œ€์•ˆ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
VASPilot ๋…ผ๋ฌธ์€ ๋ฉ€ํ‹ฐ์—์ด์ „ํŠธ ๊ธฐ๋ฐ˜ ๊ฐ€์„ค ํƒ์ƒ‰ ๋ฐ ์ถ”์ฒœ ์‹œ์Šคํ…œ์„ ์‹ค์ œ ํ™”ํ•™ยท์žฌ๋ฃŒ ๋ถ„์•ผ์— ์ ์šฉํ•˜์—ฌ, SciMuse์˜ ๋Œ€ํ˜• ๋…ผ๋ฌธ DB ๊ธฐ๋ฐ˜ ์•„์ด๋””์–ด ์ƒ์„ฑ ๊ตฌ์กฐ๋ฅผ ํ™•์žฅ ํ™œ์šฉํ•ฉ๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
434๋Š” ์ง€์‹ ๊ทธ๋ž˜ํ”„๋ฅผ ์ด์šฉํ•œ ๊ณผํ•™ ์•„์ด๋””์–ด ์ƒ์„ฑ์—์„œ ๋Šฅ๋™์  ์งˆ๋ฌธ ์„ ํƒ ๋ฐฉ์‹์„ ์‹ค์ œ ์ฐฝ์˜์  ์•„์ด๋””์–ด ์ƒ์„ฑ์— ์ ์šฉํ•ฉ๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
Interesting Scientific Idea Generation ๋…ผ๋ฌธ ์—ญ์‹œ LLM๊ณผ ์ง€์‹๊ทธ๋ž˜ํ”„๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ ๊ณผํ•™์  ์•„์ด๋””์–ด ์ƒ์„ฑ์„ ์ง€์›ํ•˜๋ฏ€๋กœ, HypoChainer์˜ ํ˜‘์—…์  ํƒ์ƒ‰ ๋ฐ ๊ฐ€์„ค์‚ฌ์Šฌ ๋ฐฉ๋ฒ•๊ณผ ์ง์ ‘์ ์œผ๋กœ ๋งž๋‹ฟ์•„ ์žˆ์Šต๋‹ˆ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
409 ๋…ผ๋ฌธ์€ AI ์•„์ด๋””์–ด๊ฐ€ ์ธ๊ฐ„ ์ฐฝ์˜์„ฑยท๋‹ค์–‘์„ฑยท์ง„ํ™”์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ์‹คํ—˜์ ์œผ๋กœ ๋ถ„์„ํ•ด, 434์™€ ๋Œ€๋น„๋˜๋Š” ์ธ๊ฐ„-AI ์ฐฝ์˜์„ฑ ๋…ผ์Ÿ์— ๊ธฐ์—ฌํ•œ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •