Linear-time prediction of proteome-scale microbial protein interactions

์ €์ž: | ๋‚ ์งœ: 2026-03-01 | URL: https://www.biorxiv.org/content/10.64898/2026.03.01.708874v1 📄 PDF


Essence

Figure 1

Figure 1: The FlashPPI framework for scalable protein-protein interaction prediction.

๋ณธ ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ-๋‹จ๋ฐฑ์งˆ ์ƒํ˜ธ์ž‘์šฉ(PPI) ์˜ˆ์ธก์˜ ์ด์ฐจ ๊ณ„์‚ฐ ๋ณต์žก๋„ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด FlashPPI๋ผ๋Š” ๋Œ€์กฐ ํ•™์Šต ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•œ๋‹ค. gLM2 ๊ฒŒ๋†ˆ ์–ธ์–ด ๋ชจ๋ธ๋กœ๋ถ€ํ„ฐ ๊ณต์ง„ํ™” ์‹ ํ˜ธ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์„ ํ˜• ์‹œ๊ฐ„ ๋ณต์žก๋„๋กœ ๋ฏธ์ƒ๋ฌผ ํ”„๋กœํ…Œ์˜ด ๊ทœ๋ชจ์˜ ์ƒํ˜ธ์ž‘์šฉ์„ ์˜ˆ์ธกํ•˜๋ฉฐ, ์ž”๊ธฐ ์ˆ˜์ค€์˜ ์ ‘์ด‰ ๋งต ์˜ˆ์ธก์„ ํ†ตํ•ด ํ•ด์„ ๊ฐ€๋Šฅ์„ฑ์„ ํ™•๋ณดํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2: Benchmarking FlashPPIโ€™s predictive performance and speed. (A) Precision-

How

Figure 1

Figure 1: The FlashPPI framework for scalable protein-protein interaction prediction.

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 5/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ PPI ์˜ˆ์ธก์˜ ๊ณ„์‚ฐ ๋ณต์žก๋„ ๋ฌธ์ œ๋ฅผ ํšจ์œจ์ ์œผ๋กœ ํ•ด๊ฒฐํ•˜๋ฉฐ ํ˜„์ €ํ•œ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋‹ฌ์„ฑํ•œ๋‹ค. ์„ ํ˜• ์‹œ๊ฐ„ ๋ณต์žก๋„ ๋‹ฌ์„ฑ, ๊ฒŒ๋†ˆ ์–ธ์–ด ๋ชจ๋ธ ํ™œ์šฉ, ์ž”๊ธฐ ์ˆ˜์ค€ ํ•ด์„ ๊ฐ€๋Šฅ์„ฑ์˜ ๊ฒฐํ•ฉ์€ ๋†’์€ ํ•™์ˆ ์  ๊ธฐ์—ฌ๋ฅผ ๋ณด์—ฌ์ค€๋‹ค. ์‹ค์šฉ์  ํ”Œ๋žซํผ ์ œ๊ณต๊ณผ ๊ตฌ์กฐ ์˜ˆ์ธก ๋ชจ๋ธ ๋Œ€๋น„ ๊ฒฝ์Ÿ๋ ฅ ์žˆ๋Š” ์„ฑ๋Šฅ์€ ์ฆ‰์‹œ ์‘์šฉ ๊ฐ€์น˜๋ฅผ ์ž…์ฆํ•œ๋‹ค. ๋‹ค๋งŒ k๊ฐ’ ์„ ํƒ ์ตœ์ ํ™”, ๋‹ค์–‘ํ•œ ์œ ๊ธฐ์ฒด ํ™•์žฅ์„ฑ, ๋น„-๋ฏธ์ƒ๋ฌผ ์‹œ์Šคํ…œ ์ ์šฉ์„ฑ์— ๋Œ€ํ•œ ์ถ”๊ฐ€ ๋ถ„์„์ด ๊ฐœ์„ ์ ์ด๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
459์˜ Langauge ๋ชจ๋ธ ๊ธฐ๋ฐ˜ sequence ์„ค๊ณ„ ํ”„๋ ˆ์ž„์›Œํฌ๋Š” 3155์˜ LLM ํ™œ์šฉ PPI ์ถ”๋ก  ๋ฐฉ์‹์˜ ์ด๋ก ์  ๋ฐ”ํƒ•์ด ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆ๊ณผ ANN ๊ตฌ์กฐ-ํ™œ์„ฑ ๋Œ€์‘์„ฑ ๋ถ„์„์—์„œ ๊ตฌ์กฐ์  ๋‹ค์–‘์„ฑ ๋ฐ ์˜ˆ์ธก ๋ถˆ๊ฐ€ ์ฐจ์›์ด ์–ด๋–ป๊ฒŒ ๋ชจ๋ธ๋ง๋˜๋Š”์ง€ ๊ธฐ์ดˆ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํ”„๋กฌํ”„ํŠธ ํšจ์œจํ™” ๋ฐ robust prompt selection์œผ๋กœ ๋Œ€๊ทœ๋ชจ ์˜ˆ์ธก ๋ฌธ์ œ์—์„œ ํšจ์œจ์„ฑ์„ ์ถ”๊ตฌํ•œ ๋‹ค๋ฅธ ์ ‘๊ทผ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋ฏธ์ƒ๋ฌผ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก์—์„œ ์‹ ์† ๋ณ€ํ˜•๊ฐ€๋Šฅํ•œ ๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฒ•์„ ํ™œ์šฉํ•˜๋ฏ€๋กœ, ๊ตฌ์กฐ์  ์ œ์•ฝ์— ๋‚ด์žฌ๋œ ์ƒ์„ฑ ๋ฐฉ์‹์˜ ์ฐจ์ด์ ์„ ํ˜ผํ•ฉ์ ์œผ๋กœ ์ดํ•ดํ•  ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Linear-time prediction of proteome-scale ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก ๋…ผ๋ฌธ์€ multimodal representation์ด ์•„๋‹Œ ์‹œํ€€์Šค ๊ธฐ๋ฐ˜ ์˜ˆ์ธก๋ฒ•์„ ์ œ์‹œํ•ด M2UMol์˜ multi-to-uni modal ์ „์ด์— ๋Œ€ํ•œ ๋‹ค๋ฅธ ๊ด€์ ์„ ์ œ๊ณตํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
3059๋Š” de novo ํšจ์†Œ ์„ค๊ณ„์— PPI ์˜ˆ์ธก์„ ์ ์šฉํ•˜๋Š” ๋”ฅ๋Ÿฌ๋‹ ํ”„๋ ˆ์ž„์›Œํฌ์—ฌ์„œ, ๋Œ€๊ทœ๋ชจ PPI ์˜ˆ์ธก์„ ์ฃผ์ œ๋กœ ํ•˜๋Š” 3155์™€ ์ƒํ˜ธ ๋ณด์™„์ ์ž…๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์„ ํ™œ์šฉํ•œ ๋Œ€๊ทœ๋ชจ ๊ธฐ๋Šฅ ์˜ˆ์ธก์—์„œ PPI ์˜ˆ์ธก์˜ ๋‹ค์–‘ํ•œ ์ ‘๊ทผ๋ฒ•์„ ๋น„๊ตํ•ด๋ณผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
FlashPPI์˜ ๋Œ€๊ทœ๋ชจ ๋‹จ๋ฐฑ์งˆ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก๊ณผ ์œ ์‚ฌํ•˜๊ฒŒ, bindable protein surface ์˜ˆ์ธก์— ๋Œ€ํ•œ ํ™•์žฅ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
FlashPPI์˜ ์„ ํ˜• ์‹œ๊ฐ„ ๋ณต์žก๋„ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์‹ค์ œ ํ™”ํ•™/์ƒ๋ฌผ ์ •๋ณด ์˜ˆ์ธก์— ์ ์šฉํ•œ ์—ฐ๊ตฌ๋กœ ์‹œ๋„ˆ์ง€๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
487์˜ ๋‹ค์ค‘๋ชจ๋‹ฌ ์ƒ๋ฌผ๋ถ„์ž ํ‘œํ˜„ ํ•™์Šต์ด ์‹ค์ œ ๋‹จ๋ฐฑ์งˆ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก ๋ฐ ๊ธฐ๋Šฅ๋ชจ๋ธ๋ง(3155)์— ์–ด๋–ป๊ฒŒ ์ ์šฉ๋˜๋Š”์ง€๋ฅผ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •