Scientific hypothesis generation by large language models: laboratory validation in breast cancer treatment

์ €์ž: Abbi Abdel-Rehim, Hector Zenil, Oghenejokpeme Orhobor, Marie Fisher, Ross J. Collins, Elizabeth Bourne, Gareth W. Fearnley, Emma Tate, Holly X. Smith, Larisa N. Soldatova, Ross King | ๋‚ ์งœ: 06/2025 | DOI: 10.1098/rsif.2024.0674 📄 PDF


Essence

Figure 1

Figure 1. The overall structure of our experiments. GPT4 was previously trained on data on a large fraction of the text

๋ณธ ๋…ผ๋ฌธ์€ GPT4์™€ ๊ฐ™์€ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด๋ชจ๋ธ(LLM)์ด ๊ณผํ•™์  ๊ฐ€์„ค ์ƒ์„ฑ์— ํ™œ์šฉ๋  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์—ฌ์ค€๋‹ค. LLM์˜ hallucination์„ ๋ถ€์ •์ ์œผ๋กœ๋งŒ ๋ณด์ง€ ์•Š๊ณ , ์‹คํ—˜์  ๊ฒ€์ฆ์ด ๊ฐ€๋Šฅํ•œ ์ƒˆ๋กœ์šด ๊ณผํ•™์  ๊ฐ€์„ค๋กœ ํ™œ์šฉํ•  ์ˆ˜ ์žˆ์Œ์„ ์œ ๋ฐฉ์•” ์น˜๋ฃŒ์ œ ์กฐํ•ฉ ๋ฐœ๊ฒฌ์„ ํ†ตํ•ด ์ž…์ฆํ–ˆ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1. The overall structure of our experiments. GPT4 was previously trained on data on a large fraction of the text

์ฒซ ๋ฒˆ์งธ ๋ผ์šด๋“œ ์„ฑ๊ณผ: 12๊ฐœ ๊ฐ€์„ค ์ค‘ 3๊ฐœ(itraconazole + atenolol, simvastatin + disulfiram, dipyridamole + mebendazole)๊ฐ€ ์–‘์„ฑ ๋Œ€์กฐ๊ตฐ์„ ์ดˆ๊ณผํ•˜๋Š” synergy scores ๋‹ฌ์„ฑ. ๋‘ ๋ฒˆ์งธ ๋ผ์šด๋“œ ์„ฑ๊ณผ: ์ดˆ๊ธฐ ๊ฒฐ๊ณผ๋ฅผ ํ•™์Šตํ•œ GPT4๊ฐ€ ์ƒ์„ฑํ•œ 4๊ฐœ ์กฐํ•ฉ ์ค‘ 3๊ฐœ๊ฐ€ ์–‘์„ฑ synergy scores ๋‹ฌ์„ฑ. ํ†ตํ•ฉ ์„ฑ๊ณผ: 12๊ฐœ ๊ฐ€์„ค ์กฐํ•ฉ ์ค‘ 10๊ฐœ์—์„œ synergistic areas ๋ฐœ๊ฒฌ, 8๊ฐœ๊ฐ€ MCF7์—์„œ MCF10A ๋Œ€๋น„ ๋†’์€ HSA score ๋‹ฌ์„ฑ.

How

Figure 1

Figure 1. The overall structure of our experiments. GPT4 was previously trained on data on a large fraction of the text

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ LLM์˜ hallucination์„ ๊ณผํ•™์  ์ž์‚ฐ์œผ๋กœ ์žฌํ•ด์„ํ•˜๊ณ  ์‹คํ—˜์  ๊ฒ€์ฆ์„ ํ†ตํ•ด ๊ทธ ๊ฐ€์น˜๋ฅผ ์ž…์ฆํ•œ ์ฐฝ์˜์ ์ด๊ณ  ์‹ค์ฆ์ ์ธ ์—ฐ๊ตฌ์ด๋‹ค. ์•ฝ๋ฌผ ๋ฐœ๊ฒฌ ํŒŒ์ดํ”„๋ผ์ธ์˜ ์ดˆ๊ธฐ ๋‹จ๊ณ„์—์„œ LLM ํ™œ์šฉ์˜ ์‹ค์งˆ์  ์ž ์žฌ๋ ฅ์„ ๋ณด์—ฌ์ฃผ๋‚˜, ์ œํ•œ๋œ ์Šค์ฝ”ํ”„์™€ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ๋ถ„์„ ๋ถ€์žฌ ๋“ฑ์ด ๊ฐœ์„ ์ ์ด๋‹ค. ํ•™์ œ ๊ฐ„ ํ˜์‹  ์—ฐ๊ตฌ๋กœ์„œ์˜ ๊ฐ€์น˜์™€ ๊ณผํ•™์—์„œ์˜ AI ํ™œ์šฉ ํŒจ๋Ÿฌ๋‹ค์ž„ ์ „ํ™˜์— ๋Œ€ํ•œ ๊ธฐ์—ฌ๋„๊ฐ€ ๋†’๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๋‹ค๋ฅธ ์ ‘๊ทผ
์žฌ๋ฃŒ๊ณผํ•™ ๋ถ„์•ผ์˜ ์ง€์‹ ๋ฐœ๊ฒฌ์„ ์œ„ํ•œ ๋‹ค๋ฅธ NLP ์ ‘๊ทผ๋ฒ•์„ ์‚ฌ์šฉํ•œ ๊ด€๋ จ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
AI๋ฅผ ํ™œ์šฉํ•œ ๊ณผํ•™์  ๋ฐœ๊ฒฌ๊ณผ ์—ฐ๊ตฌ ์ž๋™ํ™”๋ฅผ ๋‹ค๋ฃจ๋Š” ์œ ์‚ฌํ•œ ๋งฅ๋ฝ์˜ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM์˜ ๊ณผํ•™์  ์—ฐ๊ตฌ ์ง€์› ๋Šฅ๋ ฅ์„ ํƒ๊ตฌํ•˜๋Š” ์œ ์‚ฌํ•œ ์ฃผ์ œ๋ฅผ ๋‹ค๋ฃจ๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ƒ์˜ํ•™ ๋ฌธํ—Œ ๋ถ„์„์„ ํ†ตํ•œ ์ง€์‹ ๋ฐœ๊ฒฌ ๋ฐฉ๋ฒ•๋ก ์„ ํ™•์žฅํ•˜๋Š” ์œ ์‚ฌ ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
LLM์„ ํ™œ์šฉํ•œ ๊ณผํ•™์  ๊ฐ€์„ค ์ƒ์„ฑ ๋ฐ ๋ฐœ๊ฒฌ์„ ๋‹ค๋ฃจ๋Š” ์ง์ ‘์ ์ธ ์—ฐ์žฅ ์—ฐ๊ตฌ๋กœ ๋†’์€ ์œ ์‚ฌ๋„๋ฅผ ๋ณด์ธ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •