ProSSpeC: an interpretable probabilistic model predicting protease specificity

์ €์ž: Medel B. Lim Suan, Cheyenne Ziegler, Zain Syed, Arjun Sai Yedavalli, Jaimahesh Nagineni, Rodrigo Raposo, Ajay Tunikipati, Jaideep Kaur, Faruck Morcos, P. C. Dave P. Dingal | ๋‚ ์งœ: 2026-02-26 | DOI: 10.1038/s41467-026-69961-5 📄 PDF


Essence

Figure 1

Fig. 1 | Overview of ProSSpeC and experimental design. a Phylogeny of the

Direct coupling analysis(DCA)๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ ProSSpeC ๋ชจ๋ธ์ด ํ”„๋กœํ…Œ์•„์ œ-๊ธฐ์งˆ ์ƒํ˜ธ์ž‘์šฉ์˜ ๊ณต์ง„ํ™” ํŠน์ง•์„ ํ•™์Šตํ•˜์—ฌ ๋‹จ์ผ ์•„๋ฏธ๋…ธ์‚ฐ ํ•ด์ƒ๋„์—์„œ ํฌํ‹ฐ๋ฐ”์ด๋Ÿฌ์Šค ํ”„๋กœํ…Œ์•„์ œ์˜ ํŠน์ด์„ฑ๊ณผ ์ ˆ๋‹จ ํšจ์œจ์„ ์˜ˆ์ธกํ•˜๊ณ  ์กฐ์ž‘ํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Fig. 2 | Experimental validation of protease-substrate interactions. a Hspec dis-

How

Figure 1

Fig. 1 | Overview of ProSSpeC and experimental design. a Phylogeny of the

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ProSSpeC๋Š” ๊ณต์ง„ํ™” ๊ธฐ๋ฐ˜์˜ ์ •๋Ÿ‰์  ๋ชจ๋ธ๋กœ ๋ฏธ๊ทœ๋ช… ํ”„๋กœํ…Œ์•„์ œ์˜ ๊ธฐ๋Šฅ์„ ์˜ˆ์ธกํ•˜๊ณ  ๋‹จ์ผ ์•„๋ฏธ๋…ธ์‚ฐ ํ•ด์ƒ๋„์—์„œ ์กฐ์ž‘ํ•  ์ˆ˜ ์žˆ๋Š” ๊ฐ•๋ ฅํ•œ ๋„๊ตฌ๋ฅผ ์ œ์‹œํ•˜๋ฉฐ, 225์Œ ๊ฒ€์ฆ๊ณผ ํ•ฉ์„ฑ ์„ธํฌ ์‚ฌ๋ฉธ ํšŒ๋กœ ์‹œํ˜„์œผ๋กœ ์‹ค์šฉ์„ฑ์„ ์ž…์ฆํ–ˆ๋‹ค. ๋‹ค๋งŒ ํ™•์žฅ ๊ธฐ์งˆ ๋งฅ๋ฝ๊ณผ ์ž๊ฐ€์–ต์ œ ์˜ˆ์ธก์˜ ํ†ตํ•ฉ, ๊ทธ๋ฆฌ๊ณ  ํƒ€ ๋‹จ๋ฐฑ์งˆ ๋ถ„ํ•ดํšจ์†Œ๊ณ„ ํ™•๋Œ€ ์ ์šฉ์ด ํ›„์† ๊ณผ์ œ์ด๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Direct coupling analysis ๊ธฐ๋ฐ˜ ๊ณต์ง„ํ™” ํŠน์ง• ํ•™์Šต์˜ ์ด๋ก ์  ๊ธฐ๋ฐ˜์ด ๋˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ํŠน์ด์„ฑ ์˜ˆ์ธก์— ๋Œ€ํ•œ ์œ ์‚ฌํ•œ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฅธ ๋ฐฉ๋ฒ•์œผ๋กœ ์ ‘๊ทผํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์•ฝ๋™ํ•™์˜ ํŠน์ • ๊ณ„์ˆ˜ ์˜ˆ์ธก์ด ์•„๋‹Œ, ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ๋ฐ ํŠน์ด์„ฑ ํ™•๋ฅ  ๋ชจ๋ธ ์ ‘๊ทผ์œผ๋กœ ๋‹ค์ค‘ ํŠน์ง• ์˜ˆ์ธก ์‹คํ—˜์„ ์ˆ˜ํ–‰ํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‘˜ ๋‹ค ๋‹จ๋ฐฑ์งˆ์˜ ๊ฒฐํ•ฉ ๋ถ€์œ„ ๋ฐ ๋Œ์—ฐ๋ณ€์ด ๋ถ„์„์„ ์ž๋™ํ™”ํ•˜์ง€๋งŒ, ProSSpeC์€ probabilistic DCA ๊ธฐ๋ฐ˜์ด๊ณ  VARIANT๋Š” ๋ฐ”์ด๋Ÿฌ์Šค ๋ณ€์ด ์ž๋™ ๋ถ„์„์— ์ดˆ์ ์„ ๋‘ก๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํŽฉํƒ€์ด๋“œ ๋ฐ ๋‹จ๋ฐฑ์งˆํ‘œ์  ๋งž์ถค ์„ค๊ณ„ ์—ฐ๊ตฌ๋กœ, ๊ธฐ์งˆ ํŠน์ด์„ฑ ๋ฐ ์ ˆ๋‹จ ์œ„์น˜ ์˜ˆ์ธก์— ๋Œ€ํ•œ ML ์‘์šฉ๋ฒ•์ด ์ƒ์ดํ•˜๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Protein-RNA ์ƒํ˜ธ์ž‘์šฉ ๋ฐ ํ™”ํ•™์  grammar ํ•™์Šต ์‹œ DCA์™€ sequence ๊ธฐ๋ฐ˜ ๋ชจ๋ธ๋ง์ด ์–ด๋–ป๊ฒŒ ์ƒํ˜ธ์ž‘์šฉ ๊ฐ€๋Šฅํ•œ์ง€ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
DCA ๊ธฐ๋ฐ˜ ๋ชจ๋ธ์„ ํŠน์ • ๋‹จ๋ฐฑ์งˆ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์— ์ ์šฉํ•œ ๊ด€๋ จ ์‘์šฉ ์—ฐ๊ตฌ์ด๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
3218์˜ ๋‹จ๋ฐฑ์งˆ ๊ฒฐํ•ฉ ๋ถ€์œ„ ์˜ˆ์ธก ๋ชจ๋ธ์„ 3223์ฒ˜๋Ÿผ structure-informed deep learning ๋ฐฉ์‹์œผ๋กœ ์ข…๊ฐ„ ํŠน์ด์„ฑ ์ ์šฉ ๋“ฑ ์‹ค์ œ ๋ฌธ์ œ์— ์‘์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •