A deep learning predictor of bindable protein surfaces to guide generative synthetic biology (IARA)

์ €์ž: | ๋‚ ์งœ: 2026-04-16 | URL: https://www.biorxiv.org/content/10.64898/2026.04.16.718848v1 📄 PDF


Essence

Figure 1

Figure 1 โ€“ Building a graph neural network to predict bindable protein regions. a, Dataset generation strategy. b, Resid

๋ณธ ๋…ผ๋ฌธ์€ de novo ๋‹จ๋ฐฑ์งˆ ๊ฒฐํ•ฉ์ œ ์„ค๊ณ„ ํŒŒ์ดํ”„๋ผ์ธ์˜ ๊ณ„์‚ฐ ๋น„์šฉ์„ ํš๊ธฐ์ ์œผ๋กœ ๊ฐ์†Œ์‹œํ‚ค๊ธฐ ์œ„ํ•ด Graph Attention Network ๊ธฐ๋ฐ˜์˜ IARA ๋ชจ๋ธ์„ ์ œ์•ˆํ•œ๋‹ค. ์ด๋Š” ์ƒ์„ฑ ๋ชจ๋ธ(RFdiffusion, BindCraft, BoltzGen)์˜ ๊ฒฐ๊ณผ๋ฅผ ์‹ ์†ํ•˜๊ฒŒ ์‚ฌ์ „ ํ‰๊ฐ€ํ•˜์—ฌ ๊ฐ€์žฅ ์œ ๋งํ•œ ๊ฒฐํ•ฉ ๋ถ€์œ„๋ฅผ ์ˆ˜์ดˆ ๋‚ด์— ์‹๋ณ„ํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2 โ€“ IARA predictions for BindCraft. a, Frequency distribution of the predicted interface accuracies for

์„ฑ๊ณต์ ์ธ ํ‘œ๋ฉด ๊ฒฐํ•ฉ์„ฑ ์˜ˆ์ธก: ๊ฒ€์ฆ ์„ธํŠธ์˜ 107๊ฐœ ํ‘œ์ ์—์„œ IARA > 50 ์ž„๊ณ„๊ฐ’์—์„œ 100%, > 60์—์„œ 96%, > 70์—์„œ 92% ์ •ํ™•๋„๋กœ ๊ฒฐํ•ฉ ์ธํ„ฐํŽ˜์ด์Šค ์‹๋ณ„. ์šฐ์ˆ˜ํ•œ ๊ธฐ์ค€์„  ์„ฑ๋Šฅ: MaSIF ๋Œ€๋น„ 100% vs 46.7%์˜ ํ‘œ์  ๊ฐ์ง€์œจ๋กœ ์œ ์˜๋ฏธํ•˜๊ฒŒ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ. ๋ฒ”์šฉ์„ฑ ๊ฒ€์ฆ: BindCraft, RFdiffusion, BoltzGen ์„ธ ๊ฐ€์ง€ ์ƒ์„ฑ ๋„๊ตฌ์—์„œ ๋ชจ๋‘ ์„ฑ๊ณต์ ์œผ๋กœ ์ตœ์  ๊ฒฐํ•ฉ ์ธํ„ฐํŽ˜์ด์Šค ์‹๋ณ„. ์ ‘๊ทผ์„ฑ ๊ฐœ์„ : ์ดˆ ๋‹จ์œ„ ์‹คํ–‰ ์‹œ๊ฐ„, GPU ๋ถˆํ•„์š”, Python ์Šคํฌ๋ฆฝํŠธ/PyMol/ChimeraX ํ”Œ๋Ÿฌ๊ทธ์ธ/Google Colab์œผ๋กœ ์ œ๊ณต๋˜์–ด ์—ฐ๊ตฌ์ž ์ ‘๊ทผ์„ฑ ๊ทน๋Œ€ํ™”.

How

Figure 1

Figure 1 โ€“ Building a graph neural network to predict bindable protein regions. a, Dataset generation strategy. b, Resid

Originality

Limitation & Further Study

์ œํ•œ ์‚ฌํ•ญ: ํ•ฉ์„ฑ RFdiffusion ํ‘œ์ ์œผ๋กœ๋งŒ ํ›ˆ๋ จ๋˜์–ด ์ž์—ฐ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ์— ๋Œ€ํ•œ ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ ๋ฏธํ™•์ธ. ๊ฒ€์ฆ์ด ์ฃผ๋กœ ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ์…‹ ๋‚ด์—์„œ ์ˆ˜ํ–‰๋˜์–ด ์‹ค์ œ ์‹คํ—˜์  ๊ฒ€์ฆ ๋ถ€์กฑ. 7๊ฐœ ํŠน์„ฑ ์„ ํƒ์˜ ์ •๋‹น์„ฑ์— ๋Œ€ํ•œ ablation study ๋ฏธ์ œ์‹œ. ํ›„์† ์—ฐ๊ตฌ: ์ž์—ฐ ๋‹จ๋ฐฑ์งˆ ํ‘œ์ ์— ๋Œ€ํ•œ ์„ฑ๋Šฅ ํ‰๊ฐ€, ์‹ค์ œ ์‹คํ—˜์  ๊ฒ€์ฆ์„ ํ†ตํ•œ ์ƒ์„ฑ ๊ฒฐํ•ฉ์ œ์˜ ํ‘œํ˜„ ๊ฐ€๋Šฅ์„ฑ ํ™•์ธ, ์ถ”๊ฐ€ ํŠน์„ฑ์˜ ๋„์ž…์œผ๋กœ ์„ฑ๋Šฅ ํ–ฅ์ƒ ๊ฐ€๋Šฅ์„ฑ ํƒ์ƒ‰, ๋‹ค์–‘ํ•œ GNN ์•„ํ‚คํ…์ฒ˜ ๋น„๊ต.

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 5/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: IARA๋Š” ์‹ค์งˆ์ ์ธ ๊ณ„์‚ฐ ๋ณ‘๋ชฉ์„ ํ•ด๊ฒฐํ•˜๋Š” ํšจ์œจ์ ์ด๊ณ  ์ ‘๊ทผ ๊ฐ€๋Šฅํ•œ ๋„๊ตฌ๋กœ์„œ ํ•ฉ์„ฑ ์ƒ๋ฌผํ•™์„ ๋ฏผ์ฃผํ™”ํ•˜๋Š” ๊ฐ•๋ ฅํ•œ ๊ธฐ์—ฌ๋ฅผ ํ•œ๋‹ค. ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ์…‹ ๊ธฐ๋ฐ˜ ํ›ˆ๋ จ์˜ ํ•œ๊ณ„์™€ ์‹ค์ œ ์‹คํ—˜์  ๊ฒ€์ฆ ๋ถ€์กฑ์ด ์žˆ์œผ๋‚˜, ์„ธ ๊ฐ€์ง€ ์ƒ์„ฑ ๋„๊ตฌ์— ๊ฑธ์นœ ๊ฐ•๋ ฅํ•œ ๊ฒ€์ฆ ์„ฑ๋Šฅ๊ณผ ๋†’์€ ์‹ค์šฉ์  ๊ฐ€์น˜๊ฐ€ ์ด๋ฅผ ์ƒ์‡„ํ•œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
256๋ฒˆ ๋…ผ๋ฌธ์€ RFdiffusion ๊ธฐ๋ฐ˜ de novo ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ ํŒŒ์ดํ”„๋ผ์ธ์„ ์ œ์‹œ, IARA๊ฐ€ ์‚ฌ์ „ํ‰๊ฐ€ํ•  ์ƒ์„ฑ ๊ฒฐ๊ณผ์˜ ๋Œ€ํ‘œ์  ์˜ˆ์‹œ๋กœ ์ฐธ๊ณ  ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆ ๊ฒฐํ•ฉ ํ‘œ๋ฉด ์˜ˆ์ธก ๋“ฑ ๊ตฌ์กฐ ์˜ˆ์ธก ๊ธฐ๋ฐ˜ ์‹ค์šฉ์  ์ƒ์„ฑ AI ์„ค๊ณ„๊ฐ€ ๋ฐ€์ ‘ํ•˜๊ฒŒ ์—ฐ๊ฒฐ๋˜์–ด ์žˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold2 ๊ธฐ๋ฐ˜์œผ๋กœ ๋‹จ๋ฐฑ์งˆ ํ‘œ๋ฉด ๋ฐ”์ธ๋”ฉ ์˜ˆ์ธก์„ ๋‹ค๋ฃฌ ๋…ผ๋ฌธ์œผ๋กœ, PISCO์˜ ๊ตฌ์กฐ ์ •๋ณด ๊ธฐ๋ฐ˜ ๋ฐ”์ธ๋”ฉ ์˜ˆ์ธก ๊ธฐ์ˆ ๊ณผ ๊ฐœ๋…์ ์œผ๋กœ ์—ฐ๊ณ„๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํ•ญ์ฒด/๋‹จ๋ฐฑ์งˆ ๊ฒฐํ•ฉ ์นœํ™”๋„ ์˜ˆ์ธก์˜ ์ตœ์ข… ๊ฒ€์ฆ ๋‹จ๊ณ„์—์„œ IARA์™€ Rosetta ๊ธฐ๋ฐ˜ ํ‰๊ฐ€๋ฅผ ๊ฐ๊ธฐ ํ™œ์šฉํ•˜๋Š” ์‚ฌ๋ก€๋กœ, ๊ณ„์‚ฐ-์‹คํ—˜ ์—ฐ๊ณ„ ํ‰๊ฐ€ ๋ฐฉ์‹์„ ๋‹ค์–‘ํ™”ํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
PhaSeMotif์™€ ์œ ์‚ฌํ•˜๊ฒŒ ๋‹จ๋ฐฑ์งˆ ๊ฒฐํ•ฉ ๋ถ€์œ„ ์˜ˆ์ธก์„ ์ˆ˜ํ–‰ํ•˜์ง€๋งŒ, ๋ณด๋‹ค ๋‹จ์ˆœํ•œ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์˜ˆ์ธก ๋ชจ๋ธ์„ ๋‹ค๋ฃฌ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ํ‘œ๋ฉด์˜ ๊ฒฐํ•ฉ ๊ฐ€๋Šฅ ๋ถ€์œ„ ์˜ˆ์ธก์„ ์œ„ํ•ด ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜๋Š” ๋Œ€์‹ , ๊ฒฐํ•ฉ์„ฑ ์˜ˆ์ธก์„ ์œ„ํ•œ ์ƒˆ๋กœ์šด ์˜ˆ์ธก๊ธฐ๋ฅผ ์ œ์•ˆํ•ด ์ƒํ˜ธ๋ณด์™„์ ์ž…๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
2988๋ฒˆ ๋…ผ๋ฌธ์€ graph attention network๋กœ ๊ฒฐํ•ฉ๋ถ€์œ„ ์˜ˆ์ธก์„ ์ˆ˜ํ–‰ํ•ด, AlphaFold2 ๊ธฐ๋ฐ˜ ํ†ต๊ณ„๋ชจ๋ธ ์ ‘๊ทผ์ธ AF2BIND์™€ ์ฐจ๋ณ„์  ๋น„๊ต๊ฐ€ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ๋‹จ๋ฐฑ์งˆ ํ‘œ๋ฉด ๋ฐ”์ธ๋”ฉ ์˜ˆ์ธก ๋ฐ ๊ฐ€์ด๋“œ ์—ฐ๊ตฌ๋กœ, NTF2 ๋„๋ฉ”์ธ ๋‹จ๋ฐฑ์งˆ ๊ฐ€๊ณ„ ์„ค๊ณ„์™€ ๋‹ค๋ฅธ ๋ฐฉ๋ฒ•์„ ๋น„๊ตํ•ด๋ณผ ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
3245๋ฒˆ ๋…ผ๋ฌธ์€ ์„œ์—ด ๋ฐ ๊ตฌ์กฐ ์ •๋ณด๋ฅผ ํ™œ์šฉํ•œ deep learning ๊ธฐ๋ฐ˜ ๋‹จ๋ฐฑ์งˆ ๊ฒฐํ•ฉ ์˜ˆ์ธก์œผ๋กœ, IARA ๋ชจ๋ธ์˜ binding site ์˜ˆ์ธก๊ณผ ๋‹ค๋ฅธ ๋ฐฉํ–ฅ์ž…๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
2988 ๋…ผ๋ฌธ์€ GAT ๊ธฐ๋ฐ˜ bindable surface ์˜ˆ์ธก, 2991์€ physics ๋”ฅ๋Ÿฌ๋‹ ๊ฒฐํ•ฉํ˜• ํŒŒ์ดํ”„๋ผ์ธ์œผ๋กœ, de novo ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ์„ค๊ณ„์—์„œ ์ „๋žต ์ฐจ์ด๋ฅผ ๋ณด์ž…๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ธฐ๊ณ„ํ•™์Šต ๊ธฐ๋ฐ˜ ๊ฒฐํ•ฉ ๋ถ€์œ„ ์˜ˆ์ธก์— ๋Œ€ํ•œ ๋˜ ๋‹ค๋ฅธ ์ ‘๊ทผ๋ฒ•์œผ๋กœ, ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ ๋น„๊ตยท๋ถ„์„์ด ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ƒ์„ฑ ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ ํŒŒ์ดํ”„๋ผ์ธ์—์„œ ๊ฒฐํ•ฉ ๋ถ€์œ„ ๊ฒ€์ฆ ๋ฐ ์‚ฌ์ „ ์„ ๋ณ„์„ ํ†ตํ•ด ํšจ์œจ์ ์ธ ์‹คํ—˜ ์ „์‚ฐ ๊ฒ€์ฆ์œผ๋กœ ํ™•์žฅ๋œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
2988 ๋…ผ๋ฌธ์€ RFdiffusion ๋“ฑ ์ƒ์„ฑ ๋‹จ๋ฐฑ์งˆ ๋ชจ๋ธ์˜ ๊ฒฐํ•ฉ ๋ถ€์œ„ ์˜ˆ์ธก ์ •ํ™•๋„ ๊ฐœ์„ ์„ ๋ชฉ์ ์œผ๋กœ ์‚ฌ์ „ ํ‰๊ฐ€ํ•˜๋Š” GAT ๊ธฐ๋ฐ˜ ๋ชจ๋ธ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
FlashPPI์˜ ๋Œ€๊ทœ๋ชจ ๋‹จ๋ฐฑ์งˆ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก๊ณผ ์œ ์‚ฌํ•˜๊ฒŒ, bindable protein surface ์˜ˆ์ธก์— ๋Œ€ํ•œ ํ™•์žฅ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
2991๋ฒˆ ๋…ผ๋ฌธ์€ deep learning๊ณผ ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ์ ‘๊ทผ์„ ๊ฒฐํ•ฉํ•ด ๊ฒฐํ•ฉ ๋ถ€์œ„ ์˜ˆ์ธก์˜ ์ •๋ฐ€๋„๋ฅผ ๋†’์ด๊ณ  IARA์˜ ํ‰๊ฐ€๋ชจ๋“ˆ ๋“ฑ ํŒŒ์ดํ”„๋ผ์ธ ํšจ์œจํ™”๋ฅผ ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •