On the Reliability of AI Methods in Drug Discovery: Evaluation of Boltz-2 for Structure and Binding Affinity Prediction

์ €์ž: | ๋‚ ์งœ: 2026-03-02 | URL: https://arxiv.org/abs/2603.05532 📄 PDF


Essence

Figure 4

Fig. 4 Correlation between binding free energies predicted by Boltz-2 (โˆ†GBoltz) and calculated via

Boltz-2๋Š” ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ๊ตฌ์กฐ ๋ฐ ๊ฒฐํ•ฉ์นœํ™”๋„๋ฅผ ๋น ๋ฅด๊ฒŒ ์˜ˆ์ธกํ•˜๋Š” AI ๊ธฐ๋ฐ˜ foundation model์ด์ง€๋งŒ, ๋ณธ ์—ฐ๊ตฌ๋Š” ๋‘ ๊ฐœ์˜ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์…‹(38,482 ํ™”ํ•ฉ๋ฌผ)์„ ํ†ตํ•ด ์ดˆ๊ธฐ ์Šคํฌ๋ฆฌ๋‹ ์†๋„๋Š” ์šฐ์ˆ˜ํ•˜๋‚˜ ๋ฆฌ๋“œ ์‹๋ณ„์„ ์œ„ํ•œ ์—๋„ˆ์ง€ ํ•ด์ƒ๋„๊ฐ€ ๋ถ€์กฑํ•จ์„ ๊ทœ๋ช…ํ–ˆ๋‹ค.

Motivation

Achievement

Figure 4

Fig. 4 Correlation between binding free energies predicted by Boltz-2 (โˆ†GBoltz) and calculated via

How

Figure 3

Fig. 3 The distribution of confidence scores from the Boltz-2 predictions of both 3CLPro and TNKS2

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ์—ฐ๊ตฌ๋Š” Boltz-2์˜ ์„ฑ๋Šฅ์„ ์—„๊ฒฉํ•œ ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ๋ฒค์น˜๋งˆํฌ(ESMACS)๋กœ ํ‰๊ฐ€ํ•œ ์ฒซ ๋Œ€๊ทœ๋ชจ ์—ฐ๊ตฌ๋กœ, AI foundation model์˜ ์•ฝ๋ฌผ ๋ฐœ๊ฒฌ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์— ๋Œ€ํ•œ ์ค‘์š”ํ•œ ํ˜„์‹ค์  ์ง„๋‹จ์„ ์ œ๊ณตํ•œ๋‹ค. ์ดˆ๊ธฐ ์Šคํฌ๋ฆฌ๋‹ ๋‹จ๊ณ„์—๋Š” ์œ ์šฉํ•˜๋‚˜ ๋ฆฌ๋“œ ์‹๋ณ„ ๋‹จ๊ณ„์—์„œ๋Š” ์—ฌ์ „ํžˆ ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•์˜ ํ•„์š”์„ฑ์„ ๊ฐ•์กฐํ•˜๋Š” ์ ์—์„œ ํ•™์ˆ ์ ยท์‹ค๋ฌด์  ๊ฐ€์น˜๊ฐ€ ๋†’๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Boltz-1์€ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์—์„œ์˜ F.M. ํ™œ์šฉ ์›ํ˜•์„ ์ œ๊ณต, Boltz-2์˜ ๋ฐœ์ „๋ฐฉํ–ฅ ์ดํ•ด์— ๋ฐ˜๋“œ์‹œ ํ•จ๊ป˜ ์ฝ์–ด์•ผ ํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Foundation models in bioinformatics ๋…ผ๋ฌธ์€ ๋Œ€๊ทœ๋ชจ ๋ถ„์žยท์ƒ๋ฌผ ๋ฐ์ดํ„ฐ์—์„œ foundation model์˜ ์žฅ๋‹จ์ ๊ณผ ์‹ ๋ขฐ์„ฑ ๋…ผ์˜๋ฅผ ํฌ๊ด„์ ์œผ๋กœ ์ œ๊ณตํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ƒ์ฒด๋ถ„์ž ๊ตฌ์กฐ ์˜ˆ์ธก์„ ์œ„ํ•œ ์˜คํ”ˆ์†Œ์Šค ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์„ ๊ฐœ๋ฐœํ•˜๋Š” ์œ ์‚ฌํ•œ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์„ ์ทจํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ „์ด๊ธˆ์† ์ฐฉ๋ฌผ์˜ ํ™”ํ•™ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ๊ตฌ์ถ•์—์„œ ์œ ์‚ฌํ•œ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฅด๊ฒŒ ์ ‘๊ทผํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋ฆฌ๊ฐ„๋“œ ๊ฒฐํ•ฉ ๋ถ€์œ„ ์˜ˆ์ธก์„ ์œ„ํ•œ ๋‹ค๋ฅธ ๊ธฐํ•˜ํ•™์  ๋”ฅ๋Ÿฌ๋‹ ์ ‘๊ทผ๋ฒ•์„ ์ทจํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ธฐ๋Šฅ์  ํ•ต์‚ฐ ์„ค๊ณ„๋ฅผ ์œ„ํ•œ ๋‹ค๋ฅธ AI ๊ธฐ๋ฐ˜ ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ-์†Œ๋ถ„์ž ์ƒํ˜ธ์ž‘์šฉ ์„ค๊ณ„๋ฅผ ์œ„ํ•œ ์œ ์‚ฌํ•œ ๊ณ„์‚ฐ์  ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์ผ๊ด€์„ฑ ๋ฐ ํ’ˆ์งˆ ํ‰๊ฐ€์— ๋Œ€ํ•œ ์œ ์‚ฌํ•œ ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Docking, docking-rescoring, sampling ๋“ฑ ์Šคํฌ๋ฆฌ๋‹ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์ „๋ฐ˜์„ ์‹คํ—˜์ ์œผ๋กœ ๋น„๊ตํ•˜์—ฌ, AI ๊ธฐ๋ฐ˜ ์Šคํฌ๋ฆฌ๋‹ ํ•ด์ƒ๋„ยท์„ฑ๋Šฅ์˜ ํ˜„์ฃผ์†Œ๋ฅผ ํ‰๊ฐ€ํ•˜๋Š” ๋ฐ ๋„์›€์„ ์ค€๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Bolek ๋…ผ๋ฌธ์€ ๋ถ„์žยท๋ฆฌ๊ฐ„๋“œ reasoning์—์„œ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์—ฌ, ๊ฒฐํ•ฉ ์นœํ™”๋„ ์˜ˆ์ธก์˜ SOTA์™€ Boltz-2 ๋ชจ๋ธ์˜ ํ•œ๊ณ„ ๋น„๊ต์— ๋„์›€์ด ๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ๊ฒฐํ•ฉ ์˜ˆ์ธก์„ ์œ„ํ•œ ๋Œ€์•ˆ์  ๊ณ„์‚ฐ ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
On the Reliability of AI Methods in Drug Discovery ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ ์ž๋™ํ™”์˜ ์‹ ๋ขฐ์„ฑ ๋ฌธ์ œ๋ฅผ ๋น„ํŒ์ ์œผ๋กœ ๋‹ค๋ฃน๋‹ˆ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
๋Œ€ํ˜• AI ๋ชจ๋ธ์ด ์‹ค์ œ ์‹ ์•ฝ๊ฐœ๋ฐœ ์„ฑ๋Šฅ์—์„œ ๋ฐ˜๋“œ์‹œ ์ž‘์€ ๋ชจ๋ธ ๋Œ€๋น„ ์šฐ์›”ํ•˜์ง€ ์•Š์Œ์„ ์ž…์ฆํ•˜๋ฉฐ, foundation model ํ•œ๊ณ„๋ฅผ ์ง€์ ํ•ฉ๋‹ˆ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
3194 ๋…ผ๋ฌธ์€ AI ๊ธฐ๋ฐ˜ ์•ฝ๋ฌผ/๊ฒฐํ•ฉ ์˜ˆ์ธก์˜ ์‹ ๋ขฐ์„ฑ ํ•œ๊ณ„ยท๊ฒ€์ฆ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฃจ์–ด, 3027์—์„œ ๋น„๊ตํ•˜๊ณ ์ž ํ•˜๋Š” ML/๋ฌผ๋ฆฌ๊ธฐ๋ฐ˜ hybrid ์ „๋žต์˜ ์‹ค์ œ ํ•œ๊ณ„๋ฅผ ๋…ผ์˜ํ•ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •