Structural bias in machine learning-guided peptide design

์ €์ž: | ๋‚ ์งœ: 2026-05-08 | URL: https://www.biorxiv.org/content/10.64898/2026.05.06.721805v1 📄 PDF


Essence

Figure 1

Figure 1. Predicted structural landscapes of (A) GRAMPA and (B) non-GRAMPA.

๋ณธ ์—ฐ๊ตฌ๋Š” ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ํŽฉํƒ€์ด๋“œ ์„ค๊ณ„์˜ ์‹ ๋ขฐ์„ฑ ๋ฌธ์ œ๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ๋ถ„์„ํ•œ๋‹ค. ํ•ญ๊ท  ํ™œ์„ฑ ์˜ˆ์ธก ๋ชจ๋ธ 16์ข…์„ ๊ฐ์‚ฌํ•˜์—ฌ ํ•™์Šต ๋ฐ์ดํ„ฐ์˜ ๋ถˆ๊ท ํ˜•ํ•œ fold ํ‘œํ˜„๊ณผ data leakage๊ฐ€ ๊ตฌ์กฐ์  ํŽธํ–ฅ์„ ์œ ๋ฐœํ•˜๋ฉฐ, ์ด๊ฒƒ์ด ๋ชจ๋ธ ์˜ˆ์ธก ์„ฑ๋Šฅ์„ ์‹ฌ๊ฐํ•˜๊ฒŒ ์™œ๊ณกํ•จ์„ ์ •๋Ÿ‰ํ™”ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1. Predicted structural landscapes of (A) GRAMPA and (B) non-GRAMPA.

How

Figure 4

Figure 4. Structural diversity of GRAMPA and non-GRAMPA external validation

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 5/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ์—ฐ๊ตฌ๋Š” ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ํŽฉํƒ€์ด๋“œ ์„ค๊ณ„์˜ ์‹ ๋ขฐ์„ฑ์„ ์œ„ํ˜‘ํ•˜๋Š” structural bias๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ๊ทœ๋ช…ํ•œ ์ค‘์š”ํ•œ ๊ฐ์‚ฌ ์—ฐ๊ตฌ๋‹ค. 16๊ฐœ ๋ชจ๋ธ์˜ ๊ด‘๋ฒ”์œ„ํ•œ ๋ฒค์น˜๋งˆํ‚น๊ณผ ์—„๊ฒฉํ•œ ์™ธ๋ถ€ ๊ฒ€์ฆ์„ ํ†ตํ•ด ํ•™์Šต ๋ฐ์ดํ„ฐ์˜ fold ๋ถˆ๊ท ํ˜•์ด ๋ชจ๋“  ์•„ํ‚คํ…์ฒ˜์˜ ์˜ˆ์ธก ์„ฑ๋Šฅ์„ ์ผ๊ด€๋˜๊ฒŒ ์™œ๊ณกํ•จ์„ ์ •๋Ÿ‰์ ์œผ๋กœ ์ž…์ฆํ–ˆ๋‹ค. ๋‹ค๋งŒ ๊ตฌ์ฒด์ ์ธ ์™„ํ™” ์ „๋žต์˜ ๋ถ€์žฌ์™€ AMP ์ค‘์‹ฌ์˜ ์ผ€์ด์Šค ์Šคํ„ฐ๋””๋กœ ์ธํ•ด ์˜ํ–ฅ๋ ฅ์€ ์ œํ•œ์ ์ด๋‚˜, ๋‹จ๋ฐฑ์งˆ ๋จธ์‹ ๋Ÿฌ๋‹ ๋ถ„์•ผ์˜ ์ฑ…์ž„ ์žˆ๋Š” ๊ฐœ๋ฐœ์„ ์œ„ํ•œ ํ‘œ์ค€ ๊ฐ์‚ฌ ๊ด€ํ–‰์„ ์ œ์‹œํ•œ ์ ์—์„œ ํ•™์ˆ ์ ยท์‹ค๋ฌด์  ๊ฐ€์น˜๊ฐ€ ํฌ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๊ธฐ๊ณ„ํ•™์Šต ๊ธฐ๋ฐ˜ ์•ฝ๋ฌผ์„ค๊ณ„์˜ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ๋ฐ ํ•ด์„๊ฐ€๋Šฅ์„ฑ์— ๋Œ€ํ•œ ๊ธฐ์ดˆ์  ์ ‘๊ทผ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋ถ„์ž ์„ค๊ณ„, ์•ฝ๋ฌผ ๋ฐ ํŽฉํƒ€์ด๋“œ ์ƒ์„ฑ์— ์žˆ์–ด ๋ฐ์ดํ„ฐ ๋ถ„ํฌ ๋ฐ bias์˜ ์˜ํ–ฅ, ๋ฒค์น˜๋งˆํ‚น ๋ฐฉ๋ฒ•์— ๋Œ€ํ•œ ์ „๋ฐ˜์  ๋…ผ์˜๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํŠธ๋žœ์Šคํฌ๋จธ ํ•ด์„ ํ”„๋ ˆ์ž„์›Œํฌ์—์„œ bias, ๋ฐ์ดํ„ฐ ํŽธํ–ฅ, model reliability์— ๋Œ€ํ•œ ์‹ฌ์ธต์  ๋ถ„์„์œผ๋กœ, 3248์˜ ํŽฉํƒ€์ด๋“œ ์„ค๊ณ„ ํŽธํ–ฅ ๋ฌธ์ œ ์„ค๋ช…์— ๋Œ€์กฐ์  ํ•ด์„์„ ์ œ๊ณตํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Structural bias in machine learning-guided peptide design ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„์—์„œ ๋‹ค์–‘ํ•œ ๊ตฌ์กฐ์ /๋ถ„ํฌ์  ํŽธํ–ฅ ๋ฌธ์ œ๋ฅผ ์ œ์‹œํ•˜์—ฌ DDS์˜ density-adaptive ์ ‘๊ทผ๊ณผ ๋Œ€์กฐ๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
3248 ๋…ผ๋ฌธ์€ ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ํŽฉํƒ€์ด๋“œ ๋””์ž์ธ์— ๊ตฌ์กฐ์  bias ๋ฌธ์ œ๊ฐ€ ์žˆ์Œ์„ ํ†ต๊ณ„์ ์œผ๋กœ ๋ถ„์„ํ•˜์—ฌ, 3028์˜ ์‹ ๋ขฐ์„ฑยท์‹คํŒจ์ผ€์ด์Šค ๋ถ„์„๊ณผ ํ•จ๊ป˜ ์ฝ์œผ๋ฉด ํ˜„๋Œ€ ๊ตฌ์กฐ์ƒ์„ฑ์˜ ํ•œ๊ณ„๋ฅผ ๋‹ค์–‘ํ•œ ๊ด€์ ์—์„œ ์กฐ๋ช…ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ bias ๋ฌธ์ œ์™€ ๊ตฌ์กฐ ํ‰๊ฐ€ ์ง€ํ‘œ ๊ฐœ๋ฐœ์—์„œ ์ƒํ˜ธ ๋ณด์™„์  ๊ด€์ ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
GNN ๊ธฐ๋ฐ˜ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐยท๊ธฐ๋Šฅ ์˜ˆ์ธก์—์„œ ๋ฐ์ดํ„ฐ ๋ถˆ๊ท ํ˜• ๋ฐ ํŽธํ–ฅ ๋ฌธ์ œ๊ฐ€ ์‹ค์ „ ์„ค๊ณ„์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ๋น„๊ตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
๋จธ์‹ ๋Ÿฌ๋‹ ์œ ๋„ ํŽฉํƒ€์ด๋“œ ๋””์ž์ธ์˜ ๊ตฌ์กฐ์  ํŽธํ–ฅ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฃธ์œผ๋กœ์จ, ๊ตฌ์กฐ ๊ธฐ๋ฐ˜ ํ† ํฐํ™”๊ฐ€ ์‹ค์ œ ๋‹จ๋ฐฑ์งˆ/์•ฝ๋ฌผ ๋””์ž์ธ ์„ฑ๋Šฅ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ํ™•์žฅํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
ํŽฉํƒ€์ด๋“œ ์„ค๊ณ„ ์‹œ ๊ตฌ์กฐ์  ํŽธํ–ฅ๊ณผ ๋ฐ์ดํ„ฐ ์ธก๋ฉด ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๋Š” ๊ตฌ์กฐ ๊ธฐ๋ฐ˜ ์ƒ์„ฑํ˜• ์„ค๊ณ„ ์ ‘๊ทผ๋ฒ•์˜ ์‚ฌ๋ก€์ž…๋‹ˆ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ํŽฉํƒ€์ด๋“œ ๋””์ž์ธ์—์„œ ๊ตฌ์กฐ์  ๋ฐ”์ด์–ด์Šค ๋ฌธ์ œ๋ฅผ ๋ถ„์„ํ•˜์—ฌ, RFdiffusion ํ•ญ์ฒด ์„ค๊ณ„์˜ ํ•œ๊ณ„์™€ ๊ฐœ์„ ์ ์„ ๋น„ํŒ์ ์œผ๋กœ ์‚ดํŽด๋ณผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
๋จธ์‹ ๋Ÿฌ๋‹ ์ฃผ๋„ ํŽฉํƒ€์ด๋“œ ์„ค๊ณ„์—์„œ์˜ ๊ตฌ์กฐ์  ํŽธํ–ฅ ๋ฌธ์ œ๋ฅผ ์ง€์ ํ•˜๋Š” ๋…ผ๋ฌธ์œผ๋กœ, AlphaFold๋ฅ˜ ๋ชจ๋ธ์˜ ํ•œ๊ณ„๋ฅผ ๋น„ํŒํ•ฉ๋‹ˆ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
๋ฆฌ๋”๋ณด๋“œ ๊ณผ์ ํ•ฉ ๋ฐ ์ผ๋ฐ˜ํ™” ํ•œ๊ณ„๋ฅผ ๋…ผ์˜ํ•˜์—ฌ ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ํŽฉํƒ€์ด๋“œ ์„ค๊ณ„์˜ ์‹ ๋ขฐ์„ฑ ๋ฌธ์ œ๋ฅผ ๋ณด์™„ํ•ฉ๋‹ˆ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
Structural bias in machine learning-guided peptide design ๋…ผ๋ฌธ์€ ๊ตฌ์กฐ์  ํŽธํ–ฅ์˜ ํ•œ๊ณ„๋ฅผ ์ œ์‹œํ•˜์—ฌ MOGP-MMF๊ฐ€ multi-view feature ์œตํ•ฉ์„ ํ†ตํ•ด ์ด ๋ฌธ์ œ๋ฅผ ๋ณด์™„ํ•  ์ˆ˜ ์žˆ์Œ์„ ์‹œ์‚ฌํ•œ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •