Predicting new research directions in materials science using large language models and concept graphs

์ €์ž: Thomas Marwitz, Alexander Colsmann, Ben Breitung, Christoph Brabec, Christoph Kirchlechner, Eva Blasco, Gabriel Cadilha Marques, Horst Hahn, Michael Hirtz, Pavel A. Levkin, Yolita M. Eggeler, Tobias Schlรถder, Pascal Friederich | ๋‚ ์งœ: 2026-04-01 | DOI: 10.1038/s42256-026-01206-y 📄 PDF


Essence

Figure 1

Fig. 1 | Generation of labelled data. Manual labelling (concept extraction) of 100 abstracts, fine-tuning of an LLM-base

LLM์„ ์ด์šฉํ•ด ์žฌ๋ฃŒ๊ณผํ•™ ๋…ผ๋ฌธ ์ดˆ๋ก์—์„œ ์˜๋ฏธ๋ก ์  ๊ฐœ๋… ๊ทธ๋ž˜ํ”„๋ฅผ ๊ตฌ์ถ•ํ•˜๊ณ , ๋จธ์‹ ๋Ÿฌ๋‹์œผ๋กœ ์—ญ์‚ฌ์  ๋ฐ์ดํ„ฐ๋ฅผ ํ•™์Šตํ•˜์—ฌ ๋ฏธ๋ž˜์˜ ์ƒˆ๋กœ์šด ์—ฐ๊ตฌ ์ฃผ์ œ ์กฐํ•ฉ์„ ์˜ˆ์ธกํ•˜๋Š” ์‹œ์Šคํ…œ์„ ์ œ์•ˆํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Fig. 2 | Map of materials science. Two-dimensional UMAP25 projection of all extracted concepts with the highest-degree c

How

Figure 1

Fig. 1 | Generation of labelled data. Manual labelling (concept extraction) of 100 abstracts, fine-tuning of an LLM-base

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: LLM ๊ธฐ๋ฐ˜ ๊ฐœ๋… ์ถ”์ถœ๊ณผ semantic embedding์„ ๊ฒฐํ•ฉํ•œ novelํ•œ ์ ‘๊ทผ์œผ๋กœ, ์žฌ๋ฃŒ๊ณผํ•™ ๋ถ„์•ผ์˜ ์ฐฝ์˜์  ์—ฐ๊ตฌ ๋ฐฉํ–ฅ ๋ฐœ๊ตด์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ง€์›ํ•˜๋Š” ์‹ค์šฉ์ ์ด๊ณ  ์˜๋ฏธ ์žˆ๋Š” ์—ฐ๊ตฌ์ด๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋Œ€ํ˜•์–ธ์–ด๋ชจ๋ธ์„ ํ†ตํ•œ ์˜๋ฏธ๋ก ์  ๊ฐœ๋…์ง€๋„ ๊ตฌ์ถ• ๋ฐ ์‹ ๊ณผํ•™์  ๊ฐ€์„ค ์˜ˆ์ธก์˜ ์ด๋ก ์  ๋…ผ์˜๊ฐ€ ์žฌ๋ฃŒ๊ณผํ•™ ๋ฏธ๋ž˜ ์ฃผ์ œ ์˜ˆ์ธก ์‹œ์Šคํ…œ์˜ ๊ธฐ์ดˆ์™€ ์—ฐ๊ฒฐ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AI/LLM์„ ํ™œ์šฉํ•ด ํ•™๋ฌธ ๊ฐ„ ์ฐฝ์˜์  ์ฃผ์ œ ์กฐํ•ฉ ๋ฐ ์‹ ๊ทœ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ ์ œ์•ˆ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์ƒ์„ธํžˆ ๋ถ„์„ํ•˜์—ฌ ๋ฏธ๋ž˜ ์—ฐ๊ตฌ ์ฃผ์ œ ์˜ˆ์ธก ์‹œ์Šคํ…œ์˜ ์ด๋ก  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Forecasting the future of artificial intelligence with machine learning ๋…ผ๋ฌธ์€ AI ๊ธฐ๋ฐ˜ ๋ฏธ๋ž˜ ์—ฐ๊ตฌ ์ฃผ์ œ ์˜ˆ์ธก์„ ๋˜๋‹ค๋ฅธ ๋„๋ฉ”์ธ(๊ณผํ•™๋„ ์•„๋‹Œ AI ์ž์ฒด)์—์„œ ์‹ค์ฆ์ ์œผ๋กœ ๋‹ค๋ฃจ์–ด, LLM ์‘์šฉ์˜ ๊ฒฐ๊ณผ๋ฌผ ์˜ˆ์ธก ์ธก๋ฉด์„ ๋น„๊ตํ•  ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
AI ๊ธฐ๋ฐ˜ ๋…ผ๋ฌธยท์—ฐ๊ตฌ ๊ฒ€ํ†  ๊ฒฐ๊ณผ ์ž๋™ํ™” ๋ฐ ์žฌํ˜„์„ฑ ์ง€์› ์‹œ์Šคํ…œ์˜ ๋‹ค๋ฅธ ์‘์šฉ ์‚ฌ๋ก€๋กœ ๋ฏธ๋ž˜ ์—ฐ๊ตฌ ์ฃผ์ œ ์˜ˆ์ธก๊ณผ ํ‰๊ฐ€ ์ž๋™ํ™”์˜ ์ฐจ๋ณ„์„ฑ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM์˜ ์—ฐ๊ตฌ ์•„์ด๋””์–ด ์ƒ์„ฑ, ํ’ˆ์งˆ ํ‰๊ฐ€, ํ˜์‹ ์„ฑ ํƒ์ง€๊นŒ์ง€ ๋‹ค๋ฃจ์–ด ๋…ผ๋ฌธ์— ์ œ์‹œ๋œ ์ƒˆ๋กœ์šด ์กฐํ•ฉ ์˜ˆ์ธก์— ์ž๋™ํ™”๋œ ๋น„ํŒ์  ์‚ฌ๊ณ ๋ฅผ ๊ทผ๊ฑฐ๋กœ ๋ณด์™„ํ•  ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Scientific hypothesis generation by large language models ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜ ๊ณผํ•™์  ์—ฐ๊ตฌ ์ฃผ์ œ ๋ฐ ์กฐํ•ฉ ์˜ˆ์ธก์„ ์‹œ๋„ํ•œ ์—ฐ๊ตฌ๋กœ์„œ, 3212์˜ ์žฌ๋ฃŒ๊ณผํ•™ ํŠนํ™” ์•„์ด๋””์–ด ์˜ˆ์ธก๊ณผ ๋น„๊ตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
3212๋ฒˆ ๋…ผ๋ฌธ์€ ์†Œ์žฌ๊ณผํ•™์—์„œ ์ƒˆ๋กœ์šด ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์„ ์˜ˆ์ธกํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก ์„ ๋‹ค๋ฃจ์–ด 962๋ฒˆ ๋…ผ๋ฌธ๊ณผ ์œ ์‚ฌ ๋ฌธ์ œ์— ๋Œ€ํ•œ ๋Œ€์•ˆ์  ์‹œ๊ฐ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
3388์€ LLM ๊ธฐ๋ฐ˜ ๋ฏธ๋ž˜ ์—ฐ๊ตฌ ์ฃผ์ œ ์˜ˆ์ธก ๋ฐ ๋…ผ๋ฌธ์˜ ์ž„ํŒฉํŠธ ํ‰๊ฐ€ ๋ฐฉ๋ฒ•์„ ์†Œ๊ฐœํ•˜์—ฌ, 3212๊ฐ€ ์ œ์•ˆํ•˜๋Š” ์ƒˆ๋กœ์šด ์—ฐ๊ตฌ ๋ฐฉํ–ฅ ์˜ˆ์ธก ์‹œ์Šคํ…œ๊ณผ ๋ฐฉ๋ฒ•๋ก  ๋ฉด์—์„œ ๋น„๊ต๋ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
494๋Š” ์—ฐ๊ตฌ ์•„์ด๋””์–ด ์ƒ์„ฑ์˜ ์ฐฝ์˜์„ฑ๊ณผ ํ˜์‹ ์  ๋ฐฉํ–ฅ์„ฑ ํ‰๊ฐ€์—์„œ LLM์˜ ํ•œ๊ณ„์™€ ๋ณด์™„๋ฒ•์„ ์‹คํ—˜์ ์œผ๋กœ ๋‹ค๋ฃจ์–ด, 3212์˜ ์˜๋ฏธ๋ก ์  ๊ฐœ๋… ๊ทธ๋ž˜ํ”„ ๋ฐ ์˜ˆ์ธก ํ”„๋ ˆ์ž„ ํ™•์žฅ์— ์ธ์‚ฌ์ดํŠธ๋ฅผ ์ค๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Enabling AI Scientists to Recognize Innovation ๋…ผ๋ฌธ์€ AI๊ฐ€ ์ƒ์„ฑํ•œ ์—ฐ๊ตฌ ์•„์ด๋””์–ด์˜ ํ˜์‹ ์„ฑ ์ž๋™ ํ‰๊ฐ€๋ฒ•์„ ์ œ์‹œํ•ด, 3212์˜ ๋ฏธ๋ž˜ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์„ฑ ์˜ˆ์ธก ๊ฒฐ๊ณผ ์ •๋Ÿ‰์  ํ‰๊ฐ€์— ์ถ”๊ฐ€ ๊ธฐ์—ฌํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
3212๋Š” ์žฌ๋ฃŒ ๊ณผํ•™ ๋ถ„์•ผ์—์„œ ์ƒˆ๋กœ์šด ์—ฐ๊ตฌ ๋ฐฉํ–ฅ ์˜ˆ์ธก์— AI๋ฅผ ํ™œ์šฉํ•œ ์‹ค์ œ ์‘์šฉ ๋…ผ๋ฌธ์ž…๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
์žฌ๋ฃŒ๊ณผํ•™ ๋ฐ ์ƒ๋ช…๊ณผํ•™์—์„œ ์ƒˆ๋กœ์šด ์—ฐ๊ตฌ ๋ฐฉํ–ฅ ์˜ˆ์ธก์— ๋ณธ ๋ฐฉ๋ฒ•๋ก ์„ ์ ์šฉ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •