Proteo-R1: Reasoning Foundation Models for De Novo Protein Design

์ €์ž: Fang Wu, Weihao Xuan, Heli Qi, Hanqun Cao, Heng-Jui Chang, Zeqi Zhou, Haokai Zhao, Ma Jian, Carl Ma, Yu-Chi Cheng, Kuan Pang, Xiangru Tang, Zehong Wang, Guanlue Li, Hanchen Wang, Kejun Ying, Pan Lu, Chiho Im, Seungju Han, Peng Xia, Tinson Xu, Yinxi Li, Deyao Zhu, Pheng-Ann Heng, Naoto Yokoya, Masashi Sugiyama, Li Erran Li, Jure Leskovec, Yejin Choi | ๋‚ ์งœ: 2026-05-01 | URL: https://arxiv.org/abs/2605.02937 📄 PDF


Essence

Figure 1

Figure 1. Proteo-R1 couples a multimodal reasoning expert with a

์ด ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„์—์„œ ์ถ”๋ก ๊ณผ ์ƒ์„ฑ์„ ๋ช…์‹œ์ ์œผ๋กœ ๋ถ„๋ฆฌํ•˜๋Š” dual-expert ํ”„๋ ˆ์ž„์›Œํฌ์ธ Proteo-R1์„ ์ œ์•ˆํ•œ๋‹ค. multimodal LLM์ด ๊ธฐ๋Šฅ์ ์œผ๋กœ ์ค‘์š”ํ•œ ์ž”๊ธฐ๋ฅผ ์‹๋ณ„ํ•˜๊ณ  ์ด๋ฅผ hard constraint๋กœ diffusion ๊ธฐ๋ฐ˜ ์ƒ์„ฑ ๋ชจ๋ธ์— ์ „๋‹ฌํ•จ์œผ๋กœ์จ ํ•ด์„๊ฐ€๋Šฅํ•˜๊ณ  ๋ชจ๋“ˆ์‹์ธ ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„๋ฅผ ์‹คํ˜„ํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2. Three-stage training diagram of Proteo-R1. In Stage I (Multimodal Alignment), the framework uses general prote

How

Figure 2

Figure 2. Three-stage training diagram of Proteo-R1. In Stage I (Multimodal Alignment), the framework uses general prote

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: Proteo-R1์€ ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„์— ๋ช…์‹œ์  ์ถ”๋ก  ๋‹จ๊ณ„๋ฅผ ๋„์ž…ํ•˜์—ฌ ํ•ด์„๊ฐ€๋Šฅ์„ฑ, ์ œ์–ด์„ฑ, ๋ชจ๋“ˆ์„ฑ์„ ํš๊ธฐ์ ์œผ๋กœ ๊ฐœ์„ ํ•˜๋Š” ์ค‘์š”ํ•œ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•œ๋‹ค. dual-expert architecture์™€ residue-level constraint ๋ฉ”์ปค๋‹ˆ์ฆ˜์€ ์ฐฝ์˜์ ์ด๋ฉฐ ๊ธฐ์ˆ ์ ์œผ๋กœ ๊ฒฌ๊ณ ํ•˜๋‹ค. ๋‹ค๋งŒ ํ‰๊ฐ€๊ฐ€ antibody CDR์— ๊ตญํ•œ๋˜๊ณ  LLM ์˜ค๋ฅ˜ ์ „ํŒŒ๋‚˜ ๊ณ„์‚ฐ ํšจ์œจ์„ฑ ๋ถ„์„์ด ๋ถ€์กฑํ•œ ์ ์ด ์•„์‰ฝ๋‹ค. ํ–ฅํ›„ ๋‹ค์–‘ํ•œ ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ ์ž‘์—…์œผ๋กœ ์ผ๋ฐ˜ํ™”ํ•˜๊ณ  ์ด๋Ÿฌํ•œ ์‹ค๋ฌด์  ์ธก๋ฉด๋“ค์„ ๋ณด์™„ํ•œ๋‹ค๋ฉด ๋‹จ๋ฐฑ์งˆ ๊ณตํ•™ ๋ถ„์•ผ์—์„œ ๋งค์šฐ ์˜ํ–ฅ๋ ฅ ์žˆ๋Š” ๋ฐฉ๋ฒ•๋ก ์ด ๋  ์ˆ˜ ์žˆ์„ ๊ฒƒ์œผ๋กœ ํŒ๋‹จ๋œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์ƒ๋ฌผ์ •๋ณดํ•™ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ ํ™œ์šฉ ์„œ๋ฒ ์ด ๋…ผ๋ฌธ์œผ๋กœ, multi-modal LLM ๊ธฐ๋ฐ˜ ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ ์ „๋žต์˜ ์ „๋ฐ˜์  ๋ฐฐ๊ฒฝ์„ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
BioLLM ์ž„๋ฒ ๋”ฉ ๋ฐ RNA-interaction ์˜ˆ์ธก ์˜์—ญ์˜ foundation model์„ ๊ฒฐํ•ฉํ•œ ์—ฐ๊ตฌ๋กœ CrossLLM-Mamba ๊ฐœ๋…์˜ ์ด๋ก ์  ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
3181์˜ ๋‹ค์ค‘ ๊ด€์  ๋‹ค๋ชฉ์  ๋‹จ๋ฐฑ์งˆ์˜ˆ์ธก ํ”„๋ ˆ์ž„์›Œํฌ ๊ฐœ๋…์ด 3224์˜ reasoning-generative dual-expert ๊ตฌ์กฐ์— ์ด๋ก ์  ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
dual expert ๊ตฌ์กฐ์™€ diffusion ์ƒ์„ฑ ๋ชจ๋ธ์„ ๋ช…์‹œ์ ์œผ๋กœ ๊ฒฐํ•ฉํ•จ์œผ๋กœ์จ 3228์˜ RL-๊ฐ€์ด๋“œ PLM ๊ธฐ๋ฐ˜ ์„œ์—ด ์ƒ์„ฑ์˜ ์ด๋ก ์  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆ ๊ธฐ๋Šฅ ์˜ˆ์ธก์—์„œ foundation model ํ™œ์šฉ ๋ฐ ๋„๋ฉ”์ธ ํŠนํ™” autoregressive ์ถ”๋ก ์ด๋ก ์„ BioReason-Pro์˜ ๊ธฐ๋ฐ˜์œผ๋กœ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ ๋ฐ ๋‹จ์œ„ ๋ฐœ๊ฒฌ๊ณผ์ •์—์„œ foundation model ๋„์ž…์„, PUFFIN์˜ ๊ตฌ์กฐ-๊ธฐ๋Šฅ ์ง€๋„ ํ•™์Šต ํ”„๋ ˆ์ž„์›Œํฌ์˜ ๊ธฐ์ € ์•„์ด๋””์–ด๋กœ ์‚ผ์„ ์ˆ˜ ์žˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
de novo ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ ๋ฐ ์ƒ์„ฑ AI ๋ชจ๋ธ์˜ ํŒŒ์šด๋ฐ์ด์…˜ ์•„ํ‚คํ…์ฒ˜, ํ•™์Šต ํ”„๋ ˆ์ž„ ๋“ฑ์ด ๊ธฐ์ดˆ๋กœ ์ œ์‹œ๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ƒ์„ฑ์  ๊ธฐ๋ฐ˜ ๋””๋…ธ๋ณด ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ ๋ชจ๋ธ๋กœ, ๋‹จ๋ฐฑ์งˆ ๋™์—ญํ•™ยท์ง„๋™ ํŠน์„ฑ์„ ๋‹ค๋ฅด๊ฒŒ ๊ณ ๋ คํ•˜๋Š” ์ ‘๊ทผ ๋ฐฉ์‹์„ ๋น„๊ตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
de novo ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„๋ฅผ ์‹ฌ์ธต์ ์œผ๋กœ ๋‹ค๋ฃจ๋Š” Reasoning Foundation Model ์‚ฌ๋ก€๋กœ, ๊ตฌ์กฐ ์ •์ œ์˜ ์‘์šฉ๊ณผ ํ”„๋ ˆ์ž„์›Œํฌ์˜ ์ฐจ์ด์ ์„ ๋น„๊ตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
3224 ๋…ผ๋ฌธ์€ de novo ๋‹จ๋ฐฑ์งˆ ๋””์ž์ธ์„ ์œ„ํ•œ reasoning ๊ธฐ๋ฐ˜ ๊ธฐ์ดˆ๋ชจ๋ธ์„ ์ œ์‹œํ•ด, 3041๊ณผ ๋‹ค๋ฅธ ํฌ๋กœ์Šค์Šค์ผ€์ผ ํ‘œํ˜„ ํ†ตํ•ฉ ์ „๋žต์„ ๋น„๊ตํ•  ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
de novo ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ ๋ฐ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ์ƒˆ๋กœ์šด ๋‹จ๋ฐฑ์งˆ ์ƒ์„ฑ์„ ์ฃผ์ œ๋กœ, ๋‹ค์–‘ํ•œ ์ž๋™ ์„ค๊ณ„ ๋ฐฉ์‹์˜ ๋น„๊ต์™€ ์ฐจ๋ณ„์ ์„ ํŒŒ์•…ํ•  ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‘˜ ๋‹ค ๋‹จ๋ฐฑ์งˆ sequence design ๋ฌธ์ œ๋ฅผ ์‹ ๊ฒฝ-์‹ฌ๋ณผ๋ฆญ ๋ฐ ์ƒ์„ฑ AI ๊ด€์ ์—์„œ ํ’€์ง€๋งŒ, 2990์€ Neuro-Symbolic AI, 3224๋Š” LLM ๊ธฐ๋ฐ˜ dual-expert ๋ฐฉ์‹์ž…๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์‹ฌ์ธต ํ•™์Šต ๊ธฐ๋ฐ˜ de novo ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ ๋ฐ ๊ฒฐํ•ฉ ๋ถ€์œ„ ์˜ˆ์ธก์„ ํ•œ์ธต ํ™•์žฅํ•ด, AF2BIND์˜ ์ ์šฉ์„ฑ๊ณผ ๋ฏธ๋ž˜ ๋ฐœ์ „์— ์‹œ์‚ฌ์ ์„ ์ค๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
3181์—์„œ ์ œ๊ธฐ๋œ ํ•ด์„๊ฐ€๋Šฅ์„ฑ๊ณผ ํŠน์„ฑ ์œตํ•ฉ ๋ฌธ์ œ๋ฅผ 3224๊ฐ€ Reasoning ๋ฐ Generative dual-expert ํ”„๋ ˆ์ž„์›Œํฌ๋กœ ๋” ๋ฐœ์ „์‹œ์ผœ ํ’‰๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
BioReason-Pro์™€ ์œ ์‚ฌํ•˜๊ฒŒ ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„์™€ ๊ธฐ๋Šฅ์˜ˆ์ธก์—์„œ foundation models์™€ ์ƒ๋ฌผ์ •๋ณด ์ž„๋ฒ ๋”ฉ์„ ํ™œ์šฉํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Proteo-R1 ๋…ผ๋ฌธ์€ ์ƒ์„ฑํ˜• ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„์˜ reasoning evaluation ๋ฐ ํ‰๊ฐ€๋ฐฉ๋ฒ•์„ ๋‹ค๋ฃจ์–ด, 3025 ์—ฐ๊ตฌ์˜ ์‘์šฉ์„ฑ๊ณผ ํ•œ๊ณ„ ๋ณด์™„์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ด์ค๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
de novo ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„์™€ ๊ธฐ๋Šฅ ์˜ˆ์ธก์— ๊ด€ํ•œ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ์‘์šฉ ์‚ฌ๋ก€ ์—ฐ๊ตฌ๋กœ ํ™•์žฅ์„ฑ์„ ๋ณด์ผ ์ˆ˜ ์žˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
์—์ด์ „ํŠธ ๊ธฐ๋ฐ˜ de novo ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ ์‹ค์ œ ์„ค๊ณ„ ์ „๋žต ์ ์šฉ์— ๊ด€ํ•œ ์—ฐ๊ฒฐ๊ณ ๋ฆฌ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
de novo ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„์˜ ์‹ ๋ขฐ์„ฑ ํ‰๊ฐ€ ๋ฌธ๋งฅ์—์„œ ๋‹ค์ค‘ ํƒ€๊นƒ ๋ฒค์น˜๋งˆํฌ ๊ฒฐ๊ณผ๋ฅผ ์ ์šฉํ•ด๋ณผ ์ˆ˜ ์žˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •