Cross-Attention Over RNA And Protein Sequences Enables Generalizable Interaction Prediction

์ €์ž: | ๋‚ ์งœ: 2026-04-22 | URL: https://www.biorxiv.org/content/10.64898/2026.04.22.720174v1 📄 PDF


Essence

Figure 3

Figure 3 | Cross-attention scores from CORAL recapitulate structural contact interfaces in the TetRโ€“RNA aptamer K1 compl

๋ณธ ์—ฐ๊ตฌ๋Š” pretrained protein ์–ธ์–ด๋ชจ๋ธ(ESM-2)๊ณผ RNA ์–ธ์–ด๋ชจ๋ธ(DNABERT2)์„ bidirectional cross-attention์œผ๋กœ ํ†ตํ•ฉํ•œ ๋”ฅ๋Ÿฌ๋‹ ํ”„๋ ˆ์ž„์›Œํฌ CORAL์„ ์ œ์‹œํ•˜๋ฉฐ, RNA-protein ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์—์„œ ๋ฐ์ดํ„ฐ ์ค‘๋ณต์„ฑ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ณ  ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚จ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2 | Redundancy definitions for training-test partitioning of RNA-protein interaction datasets. (a) Two operationa

How

Figure 1

Figure 1 | Overview of CORAL Architecture. The model employs a dual-encoder framework with pretrained language models: D

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 5/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ์—ฐ๊ตฌ๋Š” RNA-๋‹จ๋ฐฑ์งˆ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์˜ ์ผ๋ฐ˜ํ™” ๋ฌธ์ œ๋ฅผ ๋ฐ์ดํ„ฐ ์ค‘๋ณต์„ฑ ๊ด€์ ์—์„œ ์ฒ˜์Œ ์ฒด๊ณ„์ ์œผ๋กœ ๊ทœ๋ช…ํ•˜๊ณ , cross-attention ๊ธฐ๋ฐ˜ ์•„ํ‚คํ…์ฒ˜์™€ ์—„๊ฒฉํ•œ ๋ฒค์น˜๋งˆํ‚น์œผ๋กœ ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ๋Š” ์˜ˆ์ธก ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•œ ์˜๋ฏธ ์žˆ๋Š” ๊ธฐ์—ฌ์ด๋ฉฐ, ์ƒ๋ฌผํ•™์  ํ•ด์„ ๊ฐ€๋Šฅ์„ฑ ๋ถ„์„์€ ๋ชจ๋ธ์˜ ํˆฌ๋ช…์„ฑ์„ ๊ฐ•ํ™”ํ•œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Multi-Modal Foundation Models์˜ ๋ถ„์ž ๊ตฌ์กฐ ๋ฐ ์„œ์—ด ํ‘œํ˜„ ํ†ตํ•ฉ์— ๊ด€ํ•œ ์ด๋ก ์  ํ”„๋ ˆ์ž„์ด RNA-Protein ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์˜ ๊ทผ๊ฐ„์ด ๋œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ฐ”์ด์˜ค ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ์›๋ฆฌ์™€ ๋ฐ์ดํ„ฐ ์ค‘๋ณต์„ฑ, ์ผ๋ฐ˜ํ™” ๋ฌธ์ œ ๋“ฑ ๋Œ€๊ทœ๋ชจ ๋ชจ๋ธ ์„ค๊ณ„ ์ฒ ํ•™์„ ํ•จ๊ป˜ ์ดํ•ดํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆยทRNA ์–ธ์–ด๋ชจ๋ธ ๋‚ด๋ถ€ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ๋ฐ early-exit ๋“ฑ ํšจ์œจํ™” ๊ธฐ๋ฒ•์ด, Cross-Attention ๊ธฐ๋ฐ˜ ํ†ตํ•ฉ๋ชจ๋ธ์˜ ๊ธฐ๋ฐ˜์ด ๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
CORAL์€ cross-attention ๊ธฐ๋ฐ˜ RNA-protein ์˜ˆ์ธก ๋”ฅ๋Ÿฌ๋‹ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, CrossLLM-Mamba์˜ state-space ์„ค๊ณ„์™€ ์ ‘๊ทผ๋ฒ• ์ฐจ๋ณ„์„ฑ์„ ์‚ดํŽด๋ณผ ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ƒ๋ฌผํ•™์  ์‹œ์Šคํ…œ ํ•ด์„์— ์žˆ์–ด ์ธํ„ฐํ”„๋ฆฌํ„ฐ๋ธ” ๋จธ์‹ ๋Ÿฌ๋‹ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•˜์—ฌ, RNA-๋‹จ๋ฐฑ์งˆ ์ƒํ˜ธ๊ด€๊ณ„ ์˜ˆ์ธก์˜ ๋‹ค๋ฅธ ์ ‘๊ทผ๋ฒ•์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
RNA์™€ ๋‹จ๋ฐฑ์งˆ ๊ฐ„ multi-modal ๊ด€๊ณ„๋ฅผ cross-attention์œผ๋กœ ๋ชจ๋ธ๋งํ•˜๋ฉฐ, RNA ๊ธฐ๋ฐ˜ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ๋Œ€์ฒด ์—ฐ๊ตฌ์ž…๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
CrossLLM-Mamba๋Š” RNA-๋‹จ๋ฐฑ์งˆ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์˜ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ชจ๋ธ์„ ์ œ์‹œํ•˜์—ฌ, CORAL ํ”„๋ ˆ์ž„์›Œํฌ์˜ bidirectional cross-attention ๊ฐœ๋…์„ ์‹ค์งˆ์ ์œผ๋กœ ํ™•์žฅํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Cross-attention ๊ธฐ๋ฐ˜ RNA-๋‹จ๋ฐฑ์งˆ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์„ ์—ฌ๋Ÿฌ ์ด๊ธฐ์ข… ์ƒ๋ฌผ์ •๋ณด ๋ชจ๋‹ฌ๋ฆฌํ‹ฐ(๋„คํŠธ์›Œํฌ ๋“ฑ)๋ฅผ ์œตํ•ฉํ•˜๋Š” ๋‹ค์ค‘๋ชจ๋‹ฌ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ ํ™•์žฅํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
RNA-๋‹จ๋ฐฑ์งˆ ์„œ์—ด ๊ฐ„์˜ cross-attention์„ ํ™œ์šฉํ•œ ์ƒ์„ฑ์  ์˜ˆ์ธก ๋ชจ๋ธ๋กœ, ViraHinter์˜ ๊ตฌ์กฐ+์„œ์—ด ์œตํ•ฉ์ด๋ผ๋Š” ์ด์ค‘๋ชจ๋‹ฌ ํ”„๋ ˆ์ž„์›Œํฌ ํ™•์žฅ์— ๋„์›€์ด ๋ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •