Benchmarking Single-Pose Docking, Consensus Rescoring, and Supervised ML on the LIT-PCBA Library

์ €์ž: Youssef Abo-Dahab, Xiaoiang Xiang, Joanne Chun, Liang Zhao | ๋‚ ์งœ: 2026-05-03 | URL: https://arxiv.org/abs/2605.01681 📄 PDF


Essence

Figure 1

Figure 1. illustrates an example of EF1% calculation, where the percentage of actives in the entire library is 5%.

๋ณธ ๋…ผ๋ฌธ์€ LIT-PCBA ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์˜ 15๊ฐœ ํƒ€๊นƒ๊ณผ 578,000+ ๋ฆฌ๊ฐ„๋“œ-ํƒ€๊นƒ ์Œ์—์„œ AutoDock-GPU, DiffDock, GNINA, DiffDock-NMDN ๋“ฑ ์—ฌ๋Ÿฌ ๋„ํ‚น ๋ฐ ์žฌ๋žญํ‚น ๋ฐฉ๋ฒ•์˜ ์„ฑ๋Šฅ์„ ์ฒด๊ณ„์ ์œผ๋กœ ๋ฒค์น˜๋งˆํ‚นํ•œ๋‹ค. ์ง€๋„ํ•™์Šต ์žฌ๋žญํ‚น์ด ๊ฐ€์žฅ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ–ˆ์œผ๋ฉฐ, ์–ด๋А ํ•œ ๊ฐ€์ง€ ๋„ํ‚น ๊ธฐ๋ฒ•๋„ ๋ชจ๋“  ํƒ€๊นƒ์—์„œ ์ผ๊ด€๋˜๊ฒŒ ์ž‘๋™ํ•˜์ง€ ์•Š์Œ์„ ๋ณด์—ฌ์ค€๋‹ค.

Motivation

Achievement

Figure 4

Figure 4. Comparison of EF1% performance across scoring methods for the DiffDock and AutoDock pathways.

How

Figure 3

Figure 3. The flowchart of consensus docking.

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ์‹คํ—˜ ๊ธฐ๋ฐ˜ ๋Œ€๊ทœ๋ชจ ๋ฒค์น˜๋งˆํฌ์—์„œ ์ตœ์‹  ๋„ํ‚น ๋ฐฉ๋ฒ•๋“ค์„ ์—„๋ฐ€ํ•˜๊ฒŒ ํ‰๊ฐ€ํ•œ ์˜๋ฏธ ์žˆ๋Š” ์—ฐ๊ตฌ๋‹ค. AutoDock-GNINA๊ฐ€ ๊ฐ€์žฅ ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ๋‹จ์ˆœํ•œ ML ๊ธฐ๋ฐ˜ ์ตœ์‹  ๋ฐฉ๋ฒ•์ด ํ•ญ์ƒ ์šฐ์ˆ˜ํ•˜์ง€ ์•Š์Œ์„ ์‹ค์ฆ์ ์œผ๋กœ ๋ณด์—ฌ์ค€๋‹ค. ์ง€๋„ํ•™์Šต ์žฌ๋žญํ‚น์˜ ์‹ค์งˆ์  ๊ฐ€์น˜๋ฅผ ์ž…์ฆํ•˜๋˜, ์–ด๋А ๊ธฐ๋ฒ•๋„ ๋ชจ๋“  ํƒ€๊นƒ์—์„œ ์ผ๊ด€๋˜๊ฒŒ ์ž‘๋™ํ•˜์ง€ ์•Š์Œ์„ ๊ฐ•์กฐํ•œ๋‹ค. ์ œํ•œ๋œ ํƒ€๊นƒ ์ˆ˜์™€ ์ ˆ๋Œ€ ์„ฑ๋Šฅ์˜ ๋‚ฎ์Œ์ด ์•ฝ์ ์ด์ง€๋งŒ, ์‹ค๋ฌด์  ํ†ต์ฐฐ๋ ฅ์ด ๋†’๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
870 ๋…ผ๋ฌธ์€ ํ•™์ˆ  ๋…ผ๋ฌธ ํ‰๊ฐ€(ํ…์ŠคํŠธ ๋งค์นญ)์˜ ํ•œ๊ณ„์™€ ์ทจ์•ฝ์ ์„ ์ง€์ ํ•˜์—ฌ, 3037์˜ ๋„๊ตฌ ํ‰๊ฐ€ ์‹ ๋ขฐ๋„ ๋ฐ ๋„์ถœ ์ง€ํ‘œ ํ•ด์„ ์‹œ ์ฐธ๊ณ ํ•  ์ˆ˜ ์žˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋ถ„์ž ๋„ํ‚น ๋ฐ ์žฌ๋žญํ‚น ๋ฐฉ๋ฒ• ๋ฒค์น˜๋งˆํ‚น์— ์ดˆ์ ์„ ๋งž์ถ˜ ๋…ผ๋ฌธ์œผ๋กœ, ๊ฐ™์€ ํ‘œ์ค€ ํ‰๊ฐ€ ์ ‘๊ทผ๋ฐฉ์‹์„ ๋…ผ์˜ํ•˜๋ฏ€๋กœ ํ•จ๊ป˜ ๋น„๊ตํ•ด๋ณด๋ฉด ์œ ์šฉํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ๋ชจ๋ธ๋ง์—์„œ ์ง€์˜ค๋ฉ”ํŠธ๋ฆญ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์„ ์ œ์‹œํ•˜์—ฌ ๋ณธ ๋…ผ๋ฌธ์˜ ๋„ํ‚น ๋ฒค์น˜๋งˆํฌ์™€ ๊ตฌ์กฐ์  ๋‹ค์–‘์„ฑ ์ ‘๊ทผ๋ฒ•์„ ๋น„๊ตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
3203 ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ๊ฒฐํ•ฉ ์นœํ™”๋„ ์˜ˆ์ธก์—์„œ ์ƒˆ๋กœ์šด ๋ชจ๋ธ๋ง ๊ธฐ๋ฒ•์„ ์ œ์•ˆํ•˜์—ฌ, 3037์˜ ๋‹ค์–‘ํ•œ ๋„ํ‚น ๊ธฐ๋ฒ• ๋ฒค์น˜๋งˆํ‚น๊ณผ ๋น„๊ตํ•ด๋ณผ ๋งŒํ•˜๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Benchmarking Single-Pose Docking, Consensus Rescoring, and Small Molecule Conformational Search for Accurate Binding Affinity Prediction์€ ๋‹ค๋ฅธ ๊ตฌ์กฐ๊ธฐ๋ฐ˜(ํ•ฉ์„ฑ ์ ‘๊ทผ) ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ๊ฒฐํ•ฉ ์˜ˆ์ธก์„ ๋‹ค๋ค„, 3139์™€ ๋น„๊ตํ•  ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Docking, docking-rescoring, sampling ๋“ฑ ์Šคํฌ๋ฆฌ๋‹ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์ „๋ฐ˜์„ ์‹คํ—˜์ ์œผ๋กœ ๋น„๊ตํ•˜์—ฌ, AI ๊ธฐ๋ฐ˜ ์Šคํฌ๋ฆฌ๋‹ ํ•ด์ƒ๋„ยท์„ฑ๋Šฅ์˜ ํ˜„์ฃผ์†Œ๋ฅผ ํ‰๊ฐ€ํ•˜๋Š” ๋ฐ ๋„์›€์„ ์ค€๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
ํ™”ํ•™ ๋„๊ตฌ์™€ LLM์„ ๊ฒฐํ•ฉํ•œ ์ ‘๊ทผ๋ฒ•์ด ๋„ํ‚นยท์žฌ๋žญํ‚น์— ์–ด๋–ป๊ฒŒ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ์„์ง€ ๋…ผ์˜ํ•˜๋Š” ์‹ค์šฉ์  ๋งฅ๋ฝ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ƒ๋ฌผํ•™์  ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ์œ„์ƒยท๊ธฐํ•˜์  ๊ตฌ์กฐ ํ‰๊ฐ€๊ฐ€ ๋ฆฌ๊ฐ„๋“œ-ํƒ€๊นƒ ๊ฒฐํ•ฉ ๋ฒค์น˜๋งˆํฌ์™€ ์ง์ ‘ ์—ฐ๊ด€๋œ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
de novo ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„์˜ ์‹ ๋ขฐ์„ฑ ํ‰๊ฐ€ ๋ฌธ๋งฅ์—์„œ ๋‹ค์ค‘ ํƒ€๊นƒ ๋ฒค์น˜๋งˆํฌ ๊ฒฐ๊ณผ๋ฅผ ์ ์šฉํ•ด๋ณผ ์ˆ˜ ์žˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •