Graph Neural Networks (GNNs) for Protein-Ligand Interaction Prediction

์ €์ž: | ๋‚ ์งœ: 2026-04-23 | URL: https://www.biorxiv.org/content/10.64898/2026.04.23.720519v1 📄 PDF


Essence

๋ณธ ๋…ผ๋ฌธ์€ Graph Neural Networks (GNNs)๋ฅผ ํ™œ์šฉํ•˜์—ฌ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ์„ ์˜ˆ์ธกํ•˜๋Š” ์‹œ์Šคํ…œ์„ ์ œ์‹œํ•œ๋‹ค. ๋‹จ๋ฐฑ์งˆ ์–ธ์–ด๋ชจ๋ธ๊ณผ์˜ ๊ฒฐํ•ฉ์„ ํ†ตํ•œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์•„ํ‚คํ…์ฒ˜๋ฅผ ํ†ตํ•ด ํˆฌ๋ช…์„ฑ๊ณผ ์—๋„ˆ์ง€ ํšจ์œจ์„ฑ์„ ๋™์‹œ์— ํ™•๋ณดํ•˜๊ณ , ์•ฝ๋ฌผ ๋ฐœ๊ฒฌ ํŒŒ์ดํ”„๋ผ์ธ์˜ ๊ฐ€์†ํ™”๋ฅผ ๋ชฉํ‘œ๋กœ ํ•œ๋‹ค.

Motivation

Achievement

์‹ค์ œ ์ ์šฉ ์‹œ์Šคํ…œ ๊ฐœ๋ฐœ: ์‚ฌ์šฉ์ž ์นœํ™”์  ์ธํ„ฐํŽ˜์ด์Šค(UI)๋ฅผ ํฌํ•จํ•œ end-to-end ์‹œ์Šคํ…œ ๊ตฌํ˜„์œผ๋กœ, ์—ฐ๊ตฌ์ž, ํ•™์ƒ, ์˜์‚ฌ, ์ œ์•ฝ ์‚ฐ์—…์ด ํ™œ์šฉ ๊ฐ€๋Šฅํ•œ ์‹ค๋ฌด ๋„๊ตฌ ์ œ๊ณต. ํ•ด์„๊ฐ€๋Šฅ์„ฑ ๊ฐ•ํ™”: Explainable GNNs (XGNNs)๋ฅผ ํ†ตํ•ด deep learning๊ณผ ์ƒํ™”ํ•™ ์ง€์‹์˜ ์—ฐ๊ฒฐ์„ ์‹œ๋„ํ•˜์—ฌ ๋ชจ๋ธ ์‹ ๋ขฐ๋„ ํ–ฅ์ƒ. ํšจ์œจ์„ฑ ํ™•๋ณด: ๊ณ -์ฒ˜๋ฆฌ๋Ÿ‰ ์Šคํฌ๋ฆฌ๋‹์„ ํ†ตํ•œ ๊ณ„์‚ฐ ๋น„์šฉ ์ ˆ๊ฐ ๋ฐ ์—๋„ˆ์ง€ ํšจ์œจ์„ฑ ๋‹ฌ์„ฑ. ์ƒ๋ฌผํ•™์  ํƒ€๋‹น์„ฑ: ์ˆ˜์†Œ๊ฒฐํ•ฉ, ์†Œ์ˆ˜์„ฑ ์ƒํ˜ธ์ž‘์šฉ, ์ •์ „๊ธฐ๋ ฅ ๋“ฑ ๋ณต์žกํ•œ ๋ถ„์ž ์ƒํ˜ธ์ž‘์šฉ์˜ ํฌ์ฐฉ.

How

โ€ข ๋‹จ๋ฐฑ์งˆ๊ณผ ๋ฆฌ๊ฐ„๋“œ ๊ตฌ์กฐ๋ฅผ ๋ถ„์ž ์ˆ˜์ค€์˜ node-edge ํŠน์„ฑ์œผ๋กœ ํ‘œํ˜„\nโ€ข Message passing ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ํ†ตํ•œ ๊ทธ๋ž˜ํ”„ ์ •๋ณด ์ „ํŒŒ\nโ€ข attention mechanism๊ณผ hierarchical feature extraction์œผ๋กœ ์ค‘์š” ๋ถ€๋ถ„ ๊ฐ•์กฐ\nโ€ข Self-supervised learning๊ณผ transfer learning์„ ํ†ตํ•œ ํ‘œํ˜„ ํ•™์Šต ํ–ฅ์ƒ\nโ€ข Molecular docking simulation๊ณผ์˜ ํ†ตํ•ฉ์œผ๋กœ ๋ฌผ๋ฆฌํ™”ํ•™์  ์‹ ๋ขฐ์„ฑ ํ™•๋ณด\nโ€ข ์‚ฌ์šฉ์ž ์ธํ„ฐํŽ˜์ด์Šค ๊ฐœ๋ฐœ์„ ํ†ตํ•œ ์‹ค๋ฌด ์ ์šฉ์„ฑ ํ™•๋ณด

Originality

โ€ข Protein language model๊ณผ GNN์˜ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์•„ํ‚คํ…์ฒ˜ ์„ค๊ณ„\nโ€ข ๋‹จ์ˆœ ๋ชจ๋ธ ๊ฐœ๋ฐœ์„ ๋„˜์–ด ์‹ค์ œ ์ ์šฉ ๊ฐ€๋Šฅํ•œ UI ์‹œ์Šคํ…œ ๊ตฌํ˜„์œผ๋กœ ์ฐจ๋ณ„ํ™”\nโ€ข Explainable GNNs (XGNNs)๋ฅผ ํ†ตํ•œ ํ•ด์„๊ฐ€๋Šฅ์„ฑ ๊ฐ•ํ™” ์‹œ๋„\nโ€ข Molecular docking๊ณผ deep learning์˜ ๊ฒฐํ•ฉ์œผ๋กœ ์ƒ๋ฌผํ•™์  ํƒ€๋‹น์„ฑ ์ถ”๊ตฌ\nโ€ข ์—๋„ˆ์ง€ ํšจ์œจ์„ฑ๊ณผ ํˆฌ๋ช…์„ฑ์„ ๋™์‹œ์— ๋‹ฌ์„ฑํ•˜๋ ค๋Š” ์‹œ๋„

Limitation & Further Study

โ€ข ๊ตฌ์ฒด์ ์ธ ์‹คํ—˜ ๊ฒฐ๊ณผ ๋ฐ ๋ฒค์น˜๋งˆํฌ ๋ฐ์ดํ„ฐ ๋ถ€์žฌ: ๋…ผ๋ฌธ์€ ์ฃผ๋กœ ๋ฐฉ๋ฒ•๋ก ๊ณผ ์„ค๊ณ„๋ฅผ ์„ค๋ช…ํ•˜๋ฉฐ ์ •๋Ÿ‰์  ์„ฑ๊ณผ ๊ฒฐ๊ณผ๊ฐ€ ์ œ์‹œ๋˜์ง€ ์•Š์Œ\nโ€ข ๋ชจ๋ธ ํ‰๊ฐ€ ๋ฐ ๊ฒ€์ฆ ๋ฐฉ๋ฒ• ๋ฏธํก: ๊ธฐ์กด ๋ฐฉ๋ฒ•๊ณผ์˜ ๋น„๊ต, cross-validation, ํ†ต๊ณ„์  ์œ ์˜์„ฑ ๊ฒ€์ฆ ๋ถ€์žฌ\nโ€ข Dataset bias ์ฒ˜๋ฆฌ ๋ฐฉ์•ˆ์ด ์ถฉ๋ถ„ํžˆ ์ƒ์„ธํ•˜์ง€ ์•Š์Œ\nโ€ข ํ›„์† ์—ฐ๊ตฌ: ๋Œ€๊ทœ๋ชจ ์‹ค์ œ ๋ฐ์ดํ„ฐ์…‹์—์„œ์˜ ์„ฑ๋Šฅ ๊ฒ€์ฆ, ๋‹ค์–‘ํ•œ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ๋ณตํ•ฉ์ฒด์— ๋Œ€ํ•œ ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ ํ‰๊ฐ€, UI์˜ ์‚ฌ์šฉ์„ฑ ๋ฐ ์ž„์ƒ ์‹ค์šฉ์„ฑ ๊ฒ€์ฆ ํ•„์š”\nโ€ข ๊ณ„์‚ฐ ๋ณต์žก๋„ ๋ฐ ํ™•์žฅ์„ฑ์— ๋Œ€ํ•œ ๋ถ„์„ ๋ถ€์กฑ\nโ€ข GNN์˜ ๊ณผ๋‹ค ํ‰ํ™œํ™”(over-smoothing) ๋ฌธ์ œ ํ•ด๊ฒฐ ๋ฐฉ์•ˆ ๋ฏธ์ œ์‹œ

Evaluation

Novelty: 3/5 Technical Soundness: 2/5 Significance: 3/5 Clarity: 2/5 Overall: 2/5

์ดํ‰: ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์„ ์œ„ํ•œ GNN ๊ธฐ๋ฐ˜ ์‹ค๋ฌด ์‹œ์Šคํ…œ ๊ฐœ๋ฐœ์˜ ์ค‘์š”์„ฑ์„ ๊ฐ•์กฐํ•˜๊ณ  ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์•„ํ‚คํ…์ฒ˜์˜ ์„ค๊ณ„ ๊ฐœ๋…์„ ์ œ์‹œํ•˜๋‚˜, ๊ตฌ์ฒด์ ์ธ ์‹คํ—˜ ๊ฒฐ๊ณผ, ์ •๋Ÿ‰์  ์„ฑ๋Šฅ ํ‰๊ฐ€, ๊ธฐ์ˆ ์  ์ƒ์„ธ์„ฑ์ด ๋งค์šฐ ๋ถ€์กฑํ•˜๋‹ค. ๋ฐฉ๋ฒ•๋ก ์˜ ์‹ ์„ฑํ•จ์€ ์ œํ•œ์ ์ด๋ฉฐ ๊ธฐ์ˆ ์  ํƒ€๋‹น์„ฑ์„ ์ž…์ฆํ•˜๋Š” ์‹คํ—˜ ๊ฒ€์ฆ์ด ์ „ํ˜€ ์ œ์‹œ๋˜์ง€ ์•Š์•„, ํ˜„์žฌ ์ƒํƒœ๋กœ๋Š” ํ•™์ˆ  ๋…ผ๋ฌธ์˜ ๊ธฐ์ค€์„ ์ถฉ์กฑํ•˜๊ธฐ ์–ด๋ ต๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
GNN์„ ๊ธฐ๋ฐ˜์œผ๋กœ ๋ถ„์ž ์ˆ˜์ค€ ํŠน์„ฑ์„ ์˜ˆ์ธกํ•˜๋Š” ํšจ์œจ์  ๋ฐ ๋“ฑ๋ณ€(equivariant) ๊ทธ๋ž˜ํ”„ ๋ชจ๋ธ๋ง ๊ธฐ๋ฒ•์„ ์ œ์‹œํ•ด, [3123]์˜ ์•„ํ‚คํ…์ฒ˜ ์„ ํƒ์— ์ด๋ก ์  ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
3123 ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์„ ์œ„ํ•ด ๋‹ค์–‘ํ•œ GNN ๊ตฌ์กฐ๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ๋ฆฌ๋ทฐํ•˜์—ฌ, 3009์—์„œ ํŽ˜์–ด ํ‘œํ˜„์„ feature๋กœ ์„ ํƒํ•œ ์—ฐ๊ตฌ์˜ ์ด๋ก ์  ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Graph Neural Networks (GNNs) for Protein-Ligand Interaction ๋…ผ๋ฌธ์€ PLL์—์„œ ์ ์šฉ๋œ ๋‹ค์–‘ํ•œ GNN ์ ‘๊ทผ๋ฒ•์˜ ์ด๋ก ์  ๋ฐ ์„ฑ๋Šฅ์  ๊ธฐ๋ฐ˜์„ ์ •๋ฆฌํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ๊ทธ๋ž˜ํ”„ ์‹ ๊ฒฝ๋ง์˜ ๊ธฐ๋ณธ ๊ตฌ์กฐ์™€ ํ‘œํ˜„๋ ฅ์„ ๋‹ค๋ฃธ์œผ๋กœ์จ 3217์˜ ๊ณ„์ธต์  ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ๋ถ„์„๊ณผ ๋งž๋‹ฟ์•„ ์žˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
3244์˜ SR-CGCNN์˜ ํŒŒ๋ผ๋ฏธํ„ฐ ํšจ์œจ์  ๊ตฌ์กฐ๋Š” 3123์—์„œ ์ œ์•ˆํ•œ ํ”„๋กœํ‹ด-๋ฆฌ๊ฐ„๋“œ ์˜ˆ์ธก ๋ชจ๋ธ์—์„œ๋„ ๋ชจ๋ธ ๊ฒฝ๋Ÿ‰ํ™”์™€ ํšจ์œจ์„ฑ ์ฆ๋Œ€์— ์ด๋ก ์  ๋””์ž์ธ ์ธ์‚ฌ์ดํŠธ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋ถ„์ž ๊ตฌ์กฐ ํ‘œํ˜„์—์„œ ๊ธฐํ•˜ํ•™์  ์ •๋ณด๋ฅผ ํ™œ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ํƒ๊ตฌํ•˜์—ฌ, [3123]์˜ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ธํ„ฐ๋ž™์…˜ ์˜ˆ์ธก ๋ฐฉ์‹๊ณผ ๋น„๊ต๋  ์ˆ˜ ์žˆ๋Š” ๋Œ€์•ˆ์  ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
856์€ ๊ณ„์ธต์  ๋งค์นญ ๊ธฐ๋ฐ˜์˜ ๋ถ„์ž-์•ฝ๋ฌผ ๊ตฌ์กฐ ๋ฐ ๊ณผ์ œ ์˜ˆ์ธก์„ ๋‹ค๋ฃจ๋ฉฐ, 3123์˜ GNN-๊ธฐ๋ฐ˜ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์˜ˆ์ธก ๋ฐฉ์‹๊ณผ ๋‹ค๋ฅธ meta-learning ์ ‘๊ทผ์„ ์“ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
GNN์„ ํ™œ์šฉํ•œ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์„ ๋‹ค๋ฃจ๋ฉฐ, BOS-Lig ๋ฐ์ดํ„ฐ์…‹์ด ML ๋ชจ๋ธ ์„ฑ๋Šฅ ๋ฒค์น˜๋งˆํฌ์— ์œ ์šฉํ•˜๊ฒŒ ์‚ฌ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Graph Neural Networks (GNNs) for Protein-Ligand Interaction ๋…ผ๋ฌธ์€ ๋ฐ”์ด์˜ค-ํ™”ํ•™ ์ƒํ˜ธ์ž‘์šฉ์„ ๋ชจ๋ธ๋งํ•˜๋Š” GNN ์ ‘๊ทผ์„ ํญ๋„“๊ฒŒ ๋‹ค๋ฃจ๋ฏ€๋กœ MolX์™€ ๋น„๊ตํ•ด๋ณผ ๋งŒํ•˜๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์˜ ๋ฒค์น˜๋งˆํ‚น ๋ฐฉ์‹์„ ์ œ๊ณตํ•˜๋ฉฐ, ๊ธฐ๋Šฅ ํ‰๊ฐ€ ํ”„๋ ˆ์ž„์›Œํฌ์ธ LAFA์™€ ๋น„๊ต๊ฐ€ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ƒ๋ฌผ์ •๋ณดํ•™ ๋ถ„์•ผ์—์„œ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ ํ™œ์šฉ ํ˜„ํ™ฉ์„ ์ •๋ฆฌํ•˜์—ฌ, [3123]์˜ GNN ๊ธฐ๋ฐ˜ ์˜ˆ์ธก ๋ชจ๋ธ๊ณผ LLM์„ ์—ฐ๊ฒฐ์ง“๋Š” ์—ฐ๊ตฌ ๋ฒ”์œ„๋ฅผ ๋„“ํž™๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
3123์€ GNN ๊ธฐ๋ฐ˜ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์œผ๋กœ 3017์˜ ๋ฉด์—ญํ‘œ์  ์„ค๊ณ„ ๋ฐ in silico ํ‰๊ฐ€ ๊ณผ์ •์˜ ๊ตฌ์กฐ์  ํˆฌ๋ช…์„ฑ์„ ํ•œ ๋‹จ๊ณ„ ๋” ๋ฐœ์ „์‹œ์ผœ ์‹ค์ œ ์•ฝ๋ฌผ-๋‹จ๋ฐฑ์งˆ ๊ฒฐํ•ฉ ์˜ˆ์ธก ๋“ฑ์— ๋ฐ”๋กœ ์ ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์˜ ์‹ฌ์ธต์ /์ง๊ต์  ํ‰๊ฐ€๋กœ ๋ณธ ๋…ผ๋ฌธ์˜ ๋ถˆํ™•์‹ค์„ฑ ์ „๋žต์˜ ์‹ค์ œ ์ ํ•ฉ๋„๋ฅผ ํ‰๊ฐ€ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
3017์˜ AI ๊ธฐ๋ฐ˜ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์—ฐ๊ตฌ์— ๋Œ€ํ•ด, 3123์˜ GNN ๊ธฐ๋ฐ˜ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก ์—ฐ๊ตฌ๋Š” ์ด๋Ÿฌํ•œ ์„ค๊ณ„ ๊ฒฐ๊ณผ๋ฅผ ๋” ์ •๊ตํ•˜๊ฒŒ ํ‰๊ฐ€ํ•˜๊ณ  ํŒŒ์ดํ”„๋ผ์ธ์— ํˆฌ๋ช…์„ฑ๊ณผ ํšจ์œจ์„ฑ์„ ๋”ํ•ด์ค„ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ธํ„ฐ๋ž™์…˜ ์˜ˆ์ธก ๋ถ„์•ผ์—์„œ ์–ธ์–ด๋ชจ๋ธ ๊ธฐ๋ฐ˜ ํšจ์†Œ ๊ธฐ๋Šฅ ํƒ์ƒ‰ ๊ฒฐ๊ณผ๋ฅผ ์ ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •