Scientific hypothesis generation by large language models: laboratory validation in breast cancer treatment

์ €์ž: Abbi Abdel-Rehim, Hector Zenil, Oghenejokpeme Orhobor, Marie Fisher, Ross J. Collins, Elizabeth Bourne, Gareth W. Fearnley, Emma Tate, Holly X. Smith, Larisa N. Soldatova, Ross King | ๋‚ ์งœ: 06/2025 | DOI: 10.1098/rsif.2024.0674 📄 PDF


Essence

Figure 1

Figure 1. The overall structure of our experiments. GPT4 was previously trained on data on a large fraction of the text

GPT-4๋ฅผ ์ด์šฉํ•˜์—ฌ ์œ ๋ฐฉ์•” ์น˜๋ฃŒ๋ฅผ ์œ„ํ•œ ์ƒˆ๋กœ์šด ์•ฝ๋ฌผ ์กฐํ•ฉ ๊ฐ€์„ค์„ ์ƒ์„ฑํ•˜๊ณ  ์‹คํ—˜์‹ค์—์„œ ๊ฒ€์ฆํ•˜์—ฌ, LLM(Large Language Model)์ด ๊ณผํ•™์  ๊ฐ€์„ค ์ƒ์„ฑ์˜ ๊ฐ€์น˜ ์žˆ๋Š” ๋„๊ตฌ์ž„์„ ์ž…์ฆํ–ˆ๋‹ค.

Motivation

Achievement

How

Figure 1

Figure 1. The overall structure of our experiments. GPT4 was previously trained on data on a large fraction of the text

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ์—ฐ๊ตฌ๋Š” LLM์ด ๋‹จ์ˆœํ•œ ์ •๋ณด ๋„๊ตฌ๋ฅผ ๋„˜์–ด ๊ณผํ•™์  ๋ฐœ๊ฒฌ์˜ ์‹ค์งˆ์  ํŒŒํŠธ๋„ˆ๊ฐ€ ๋  ์ˆ˜ ์žˆ์Œ์„ ์ตœ์ดˆ๋กœ ์—„๊ฒฉํ•˜๊ฒŒ ์ž…์ฆํ•œ ํš๊ธฐ์  ์—ฐ๊ตฌ์ด๋‹ค. ์•ฝ๋ฌผ ์กฐํ•ฉ ๊ฐœ๋ฐœ์ด๋ผ๋Š” ์ž„์ƒ์ ์œผ๋กœ ์ค‘์š”ํ•œ ๋ถ„์•ผ์—์„œ ์‹คํ˜„ ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์—ฌ์ฃผ์—ˆ์œผ๋‚˜, ํ†ต๊ณ„์  ๊ฒฌ๊ณ ์„ฑ๊ณผ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ๊ทœ๋ช…์„ ์œ„ํ•œ ์ถ”๊ฐ€ ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•˜๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
719๋Š” LLM ๊ธฐ๋ฐ˜ ๊ณผํ•™์  ๊ฐ€์„ค ํƒ์ƒ‰์˜ ์›๋ฆฌ๋ฅผ ์ œ์‹œํ•˜์—ฌ AlphaFold์˜ ํ˜์‹ ์  ๋ฐœ๊ฒฌ์—์„œ ๋‚˜ํƒ€๋‚œ AI-์ฃผ๋„ ๊ณผํ•™๋ฐœ๊ฒฌ ํ”„๋ ˆ์ž„์›Œํฌ์˜ ์ด๋ก ์  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
419 ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜ ๊ณผํ•™์  ๊ฐ€์„ค ์ƒ์„ฑ์˜ ๊ฐœ๋… ๋ฐ ๊ธฐ๋ฒ•์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ •๋ฆฌํ•˜์—ฌ, 719์˜ ๊ฐ€์„ค ์ƒ์„ฑ ์‹ค์ฆ ์—ฐ๊ตฌ์˜ ์ด๋ก ์  ๊ธฐ๋ฐ˜์ด ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Scientific hypothesis generation by large language models ๋…ผ๋ฌธ์€ LLM์„ ๊ณผํ•™์  ๊ฐ€์„ค ์ƒ์„ฑ์— ํ™œ์šฉํ•˜๋Š” ๊ธฐ๋ณธ ํ•œ๊ณ„์™€ ๋ฌธ์ œ๋ฅผ ์ข…ํ•ฉ์ ์œผ๋กœ ๊ณ ์ฐฐ, KG-CoI ์‹œ์Šคํ…œ์˜ ์ด๋ก ์  ๋ฐฐ๊ฒฝ์„ ํ˜•์„ฑํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
LLM์˜ ๊ณผํ•™์  ๊ฐ€์„ค ์ƒ์„ฑ ๋ฐ ์ถ”๋ก  ์„ฑ๋Šฅ ๋ฉ”์ปค๋‹ˆ์ฆ˜์ด ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋ฐฉ์‹๊ณผ ๊ทผ๋ณธ์ ์œผ๋กœ ์—ฐ๊ณ„๋œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
579 ๋…ผ๋ฌธ์€ ๊ณผํ•™์  ์ฃผ์žฅ ์ƒ์„ฑ ๋ฐ ๊ฒ€์ฆ์„ ์œ„ํ•œ ๋ฐ์ดํ„ฐ๋งˆ์ด๋‹ ๋ฐฉ์‹์„ ์ œ์‹œํ•ด 719์˜ LLM ๊ธฐ๋ฐ˜ ๊ฐ€์„ค ์ƒ์„ฑ์˜ ๊ธฐ์ดˆ๊ฐ€ ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ๋Š” ๊ณผํ•™์  ๊ฐ€์„ค ์ƒ์„ฑ ๋ฐ ํ‰๊ฐ€ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ํ†ตํ•ด ์‹คํ—˜์‹ค ๊ฒ€์ฆ ์—ฐ๊ตฌ์˜ ์‹ ๋ขฐ์„ฑ ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Scientific hypothesis generation by large language models ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜์˜ ๊ณผํ•™์  ๊ฐ€์„ค ๋ฐ ์‹คํ—˜ ์˜ˆ์ธก์˜ ์ด๋ก ์  ๊ทผ๊ฑฐ์™€ ์—ฐ๊ตฌ๋™ํ–ฅ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๊ณผํ•™์  ๊ฐ€์„ค ์ƒ์„ฑ์— LLM์„ ์ ์šฉํ•œ ์‚ฌ๋ก€๋ฅผ ์ค‘์‹ฌ์œผ๋กœ, 002์˜ ๋‹ค์–‘ํ•œ ์‘์šฉ์ด ๊ณผํ•™์  ๋ฐœ๊ฒฌ์— ์–ด๋–ป๊ฒŒ ๊ธฐ์—ฌํ•˜๋Š”์ง€ ์ด๋ก ์  ๋ฐฐ๊ฒฝ์„ ์ดํ•ดํ•  ์ˆ˜ ์žˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Scientific hypothesis generation by large language models(719)์€ LLM ๊ธฐ๋ฐ˜ ์•„์ด๋””์–ด ๋ฐ ๊ฐ€์„ค ์ƒ์„ฑ์˜ ์›๋ฆฌ์™€ ํ•œ๊ณ„๋ฅผ ๋…ผ์˜ํ•˜๋ฉฐ, 425์˜ ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ์•„์ด๋””์–ด ํ–ฅ์ƒ ๊ธฐ๋ฒ•์— ์ด๋ก ์  ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
719๋Š” LLM ๊ธฐ๋ฐ˜ ๊ฐœ๋ฐฉํ˜• ๊ณผํ•™ ๊ฐ€์„ค์ƒ์„ฑ์˜ ์ž ์žฌ๋ ฅ๊ณผ ํ•œ๊ณ„์— ๋Œ€ํ•œ ๋น„๊ต ํ‰๊ฐ€๋ฅผ ํ†ตํ•ด, 473 ๋…ผ๋ฌธ์˜ ์ž๋™ํ™” ์‹œ์Šคํ…œ๊ณผ ์ƒํ˜ธ ๋ณด์™„์  ๋…ผ์˜๊ฐ€ ๊ฐ€๋Šฅํ•˜๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
SciMON์€ ์ž๋™์œผ๋กœ ๋ฌธํ—Œ์„ ๊ทผ๊ฑฐ๋กœ ์•„์ด๋””์–ด๋ฅผ ์ƒ์„ฑํ•˜๋ฉฐ, LLM ๊ธฐ๋ฐ˜ ๊ฐ€์„ค ์ƒ์„ฑ์˜ ๋˜ ๋‹ค๋ฅธ ํ”„๋ ˆ์ž„์›Œํฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
763 ๋…ผ๋ฌธ์€ ๊ฐ€์„ค์ƒ์„ฑ์„ ๊ตฌ์กฐํ™”๋œ ๋…ผ๋ฌธ ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜์œผ๋กœ ์ ‘๊ทผํ•˜๋Š” ๋ฐ˜๋ฉด, 719๋Š” ์ž„์ƒ ํ™œ์šฉ์„ ์œ„ํ•œ ์‹คํ—˜์  ๊ฒ€์ฆ์„ ๊ฐ•์กฐํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
668๋ฒˆ ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜์˜ ๋ฐ˜๋ณต์ ์ธ ์•„์ด๋””์–ด ์ƒ์„ฑ ๋ฐฉ์‹์„ ์ œ์•ˆํ•˜์—ฌ, LLM์˜ ๊ณผํ•™์  ๊ฐ€์„ค ์ƒ์„ฑ ๋Šฅ๋ ฅ์„ ๋‹ค๋ฅธ ๋ฌธ์ œ ์˜์—ญ์—์„œ ํ‰๊ฐ€ํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
โ€˜Scientific hypothesis generation by large language modelsโ€™๋Š” LLM์˜ ๊ฐ€์„ค ์ƒ์„ฑ ๋Šฅ๋ ฅ์„ ํ‰๊ฐ€ํ•˜๋ฏ€๋กœ, ๋‚ด๋ถ€ ๊ตฌ์กฐํ•™์Šต๊ณผ ๊ฐ€์„ค ํ‰๊ฐ€ ๊ด€์ ์—์„œ ์ƒํ˜ธ๋ณด์™„์ ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Scientific hypothesis generation by large language models ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜ ๊ณผํ•™์  ์—ฐ๊ตฌ ์ฃผ์ œ ๋ฐ ์กฐํ•ฉ ์˜ˆ์ธก์„ ์‹œ๋„ํ•œ ์—ฐ๊ตฌ๋กœ์„œ, 3212์˜ ์žฌ๋ฃŒ๊ณผํ•™ ํŠนํ™” ์•„์ด๋””์–ด ์˜ˆ์ธก๊ณผ ๋น„๊ตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ง€์‹ ๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฐ˜ LLM ์•„์ด๋””์–ด ์ƒ์„ฑ ๋ฐ ๊ฐ€์„ค ์ƒ์„ฑ ๋ฐฉ๋ฒ•์„ ์‹ค์ œ ์‹คํ—˜๊ฒ€์ฆ ๋‹จ๊ณ„๋กœ ํ™•์žฅํ•˜์˜€๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
418๋ฒˆ ๋…ผ๋ฌธ์€ LLM์„ ํ™œ์šฉํ•œ ์†Œ์žฌ ๋ฐœ๊ฒฌ ๊ฐ€์„ค ์ƒ์„ฑ ๋ฐฉ๋ฒ•์„ ๋‹ค๋ฃจ์–ด, 719๋ฒˆ๊ณผ ๋น„์Šทํ•œ AI ๊ธฐ๋ฐ˜ ๊ฐ€์„ค ์ƒ์„ฑ์„ ์†Œ์žฌ ๊ณผํ•™์— ์‘์šฉํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
777 ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜ ๊ฐ€์„ค ํƒ์ƒ‰ ๋ฐ ํ˜์‹  ์„ผ์‹ฑ ๋ฐฉ์‹์„ ๊ตฌ์กฐ์  ๋ถ„์„๊ณผ ์ •๋Ÿ‰์  ์ง€ํ‘œ ๋„์ž…์œผ๋กœ ํ™•์žฅํ–ˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
LLM ๊ธฐ๋ฐ˜ ๊ณผํ•™์  ๊ฐ€์„ค ์ƒ์„ฑ ๋ฐ ๋งคํ•‘ ๋Šฅ๋ ฅ์˜ ํ‰๊ฐ€๋ฅผ ํ†ตํ•ด, 621์˜ PINN-Kalman ์œตํ•ฉ๋ฐฉ๋ฒ•์ด ์‹ค์„ธ๊ณ„ ๋ฌธ์ œ ํ•ด๊ฒฐ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ๋ถ„์„ํ•œ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •