Orthrus: toward evolutionary and functional RNA foundation models

์ €์ž: Philip Fradkin, Ruian โ€œIanโ€ Shi, Taykhoom Dalal, Keren Isaev, Brendan J. Frey, Leo J. Lee, Quaid Morris, Bo Wang | ๋‚ ์งœ: 2026-04-17 | DOI: 10.1038/s41592-026-03064-3 📄 PDF


Essence

Figure 1

Fig. 1 | Overview of Orthrus. a, Contrastive dataset construction: we treated

Mamba ๊ธฐ๋ฐ˜ ์„ฑ์ˆ™ RNA ๊ธฐ๋ฐ˜๋ชจ๋ธ Orthrus๋ฅผ ์ œ์‹œํ•˜๋ฉฐ, ์ƒ๋ฌผํ•™์  ๋Œ€์กฐํ•™์Šต(contrastive learning)์œผ๋กœ ์‚ฌ์ „ํ›ˆ๋ จํ•˜์—ฌ RNA ์†์„ฑยท๊ธฐ๋Šฅ ์˜ˆ์ธก์—์„œ ๊ธฐ์กด ๊ฒŒ๋†ˆ ๊ธฐ๋ฐ˜๋ชจ๋ธ์„ ๋Šฅ๊ฐ€ํ•˜๋Š” ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑ.

Motivation

Achievement

Figure 2

Fig. 2 | mRNA property prediction using Orthrus. a, Evaluation of Orthrusโ€™s

How

Figure 1

Fig. 1 | Overview of Orthrus. a, Contrastive dataset construction: we treated

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: Orthrus๋Š” ์ƒ๋ฌผํ•™์  ์›๋ฆฌ์— ๊ธฐ์ดˆํ•œ ๋Œ€์กฐํ•™์Šต์„ ํ†ตํ•ด ๊ธฐ์กด ๊ฒŒ๋†ˆ ๊ธฐ์ดˆ๋ชจ๋ธ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๋ฉฐ, ํŒŒ๋ผ๋ฏธํ„ฐ ํšจ์œจ์„ฑ๊ณผ ์ €๋ฐ์ดํ„ฐ ํ™˜๊ฒฝ์—์„œ์˜ ๊ฐ•๋ ฅํ•œ ์„ฑ๋Šฅ์œผ๋กœ ์‹ค์šฉ์  ๊ฐ€์น˜๊ฐ€ ๋†’๋‹ค. ๊ฐœ๋ณ„ RNA isoform ๊ธฐ๋Šฅ ์ฃผ์„์ด๋ผ๋Š” ์˜ค๋žœ ๊ณผ์ œ์— ์ง„์ „์„ ์ œ์‹œํ•˜๋Š” ์ ์—์„œ RNA ์ƒ๋ฌผํ•™ ๋ฐ ๋ถ„์ž์˜ํ•™ ๋ถ„์•ผ์— ์ƒ๋‹นํ•œ ๊ธฐ์—ฌ๊ฐ€ ๊ธฐ๋Œ€๋œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๊ณผํ•™์  ๋จธ์‹ ๋Ÿฌ๋‹ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ๋ถ„์•ผ๋ณ„ ํ˜„ํ™ฉ์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ •๋ฆฌํ•˜์—ฌ, RNA-๊ธฐ๋ฐ˜ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ ์—ฐ๊ตฌ์˜ ํฐ ํ‹€ ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
BioInformatics Agent๋Š” ์ƒ๋ฌผ์ •๋ณดํ•™&์œ ์ „์ฒด ๊ธฐ๋ฐ˜ LLM agent์˜ ์„ค๊ณ„์™€ ์‹คํ™œ์šฉ case๋ฅผ ์ œ์‹œํ•˜์—ฌ RNA Foundation Model์˜ agentๅŒ–, ์ ์šฉ ๋ฐฉํ–ฅ ์ฐธ๊ณ ๊ฐ€ ๋œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์ƒ๋ช…๊ณผํ•™์—์„œ์˜ ๋Œ€ํ˜• ์–ธ์–ด ๋ฐ ์ƒ๋ฌผํ•™ ๋ชจ๋ธ์„ ํฌ๊ด„์ ์œผ๋กœ ๋น„๊ต ๋ถ„์„ํ•˜์—ฌ, RNA foundation model Orthrus์˜ ์œ„์น˜์™€ ์ฐจ๋ณ„์ ์„ ์ดํ•ดํ•˜๋Š” ๋ฐ ๊ธฐ์ดˆ๊ฐ€ ๋œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Bioinformatics foundation model ์„œ๋ฒ ์ด๋กœ Orthrus์˜ RNA ๊ธฐ๋ฐ˜ foundation model ๊ฐœ๋ฐœ ๋ฐ ๋ฒค์น˜๋งˆํฌ ๋ฐฉํ–ฅ์„ฑ์˜ ์ด๋ก ์  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
RNA ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์„ ์œ„ํ•œ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋˜๋Š” LLM ๊ธฐ๋ฐ˜ ๋Œ€์•ˆ์  ์ ‘๊ทผ๋ฒ•์„ ๋‹ค๋ฃจ๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ƒ๋ถ„๋ฆฌ ๊ด€๋ จ ๋‹จ๋ฐฑ์งˆ ์„œ์—ด ๋ชจํ‹ฐํ”„๋ฅผ ์˜ˆ์ธกํ•˜๋Š” ๋Œ€์•ˆ์  ๊ณ„์‚ฐ ๋ฐฉ๋ฒ•์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
NMR ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ๋ถ„์„ ๋ฐ ๊ฒ€์ฆ์„ ์œ„ํ•œ ๋Œ€์•ˆ์  ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
RNA์™€ ๋‹จ๋ฐฑ์งˆ ๊ฐ„ multi-modal ๊ด€๊ณ„๋ฅผ cross-attention์œผ๋กœ ๋ชจ๋ธ๋งํ•˜๋ฉฐ, RNA ๊ธฐ๋ฐ˜ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ๋Œ€์ฒด ์—ฐ๊ตฌ์ž…๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
CrossLLM-Mamba๋Š” RNAยท๋‹จ๋ฐฑ์งˆ ์‹œํ€€์Šค ๊ฒฐํ•ฉ multi-modal state space ๋ชจ๋ธ์„ ๊ฐ•ํ™”ํ•œ ๋ฐฉ์‹์œผ๋กœ Orthrus์˜ ์ง„ํ™”์ /๊ธฐ๋Šฅ์  ํŠน์„ฑ ์˜ˆ์ธก๋ ฅ ํ™•์žฅ ๊ฐ€๋Šฅ์„ฑ์„ ์‹œ์‚ฌํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
RNA ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ์ง„ํ™”์ ยท๊ธฐ๋Šฅ์  ์˜ˆ์ธก ์—ญ๋Ÿ‰์„ ์‹œ์Šคํ…œ์ ์œผ๋กœ ์ œ์‹œํ•ด, ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ OOD ํ‰๊ฐ€ ๋ฐ ํ™•์žฅ ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Orthrus: toward evolutionary and functional RNA foundation models๋Š” ์ง„ํ™” ๋ฐ ๋ณ‘์›์„ฑ ๋ณ€์ด ์˜ˆ์ธก์„ RNA(๋‹จ๋ฐฑ์งˆ ์™ธ ์ƒ์ฒด๊ณ ๋ถ„์ž)๋กœ ํ™•์žฅํ•จ์œผ๋กœ์จ 3121์„ ๋ฒ”์šฉ์˜ค๋ฏน์Šค์— ์‘์šฉํ•œ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •