General Multimodal Protein Design Enables DNA-Encoding of Chemistry

์ €์ž: Jarrid Rector-Brooks, Thรฉophile Lambert, Marta Skreta, Daniel Roth, Yueming Long, Zi-Qi Li, Xi Zhang, Miruna Cretu, Francesca-Zhoufan Li, Tanvi Ganapathy, Emily Jin, Avishek Joey Bose, Jason Yang, Kirill Neklyudov, Yoshua Bengio, Alexander Tong, Frances H. Arnold, Cheng-Hao Liu | ๋‚ ์งœ: 2026-04-06 | URL: https://arxiv.org/abs/2604.05181 📄 PDF


Essence

Figure 1

Figure 1. Multimodal protein design workflow with DISCO. (A) Inference overview, highlighting

DISCO๋Š” ๋‹จ๋ฐฑ์งˆ ์„œ์—ด๊ณผ 3D ๊ตฌ์กฐ๋ฅผ ๋™์‹œ์— ๊ณต๋™์„ค๊ณ„ํ•˜๋Š” multimodal diffusion ๋ชจ๋ธ๋กœ, ์‚ฌ์ „ ์ง€์ •๋œ ํ™œ์„ฑ๋ถ€์œ„ ์—†์ด ์ž์—ฐ๊ณ„์— ์กด์žฌํ•˜์ง€ ์•Š๋Š” carbene ์ „์ด ๋ฐ˜์‘์„ ์ˆ˜ํ–‰ํ•˜๋Š” ํ—ด ํšจ์†Œ๋ฅผ de novo๋กœ ์„ค๊ณ„ํ•  ์ˆ˜ ์žˆ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2. DISCO generates diverse, designable protein sequences and structures across a wide

How

Figure 1

Figure 1. Multimodal protein design workflow with DISCO. (A) Inference overview, highlighting

Originality

Limitation & Further Study

Evaluation

Novelty: 5/5 Technical Soundness: 4/5 Significance: 5/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: DISCO๋Š” multimodal diffusion์„ ํ†ตํ•œ ์ง„์ •ํ•œ sequence-structure ๊ณต๋™์„ค๊ณ„์™€ ์‚ฌ์ „ motif ์ •์˜ ์—†๋Š” de novo enzyme ์„ค๊ณ„๋กœ, ์ƒ๋ฌผ์ด‰๋งค ์„ค๊ณ„์˜ ํŒจ๋Ÿฌ๋‹ค์ž„์„ ๊ทผ๋ณธ์ ์œผ๋กœ ์ „ํ™˜ํ•˜๋Š” ํš๊ธฐ์  ๊ธฐ์—ฌ๋ฅผ ํ•œ๋‹ค. ์‹คํ—˜ ๊ฒ€์ฆ์ด carbene transfer์— ์ œํ•œ๋˜์–ด ์žˆ์œผ๋‚˜, ๋ฐฉ๋ฒ•๋ก ์˜ ํ˜์‹ ์„ฑ๊ณผ ๊ธฐ์ˆ ์  ์™„์„ฑ๋„, ๊ทธ๋ฆฌ๊ณ  DNA-encoded ํ™”ํ•™ ํ™•์žฅ์˜ ์ž ์žฌ๋ ฅ์œผ๋กœ ์ธํ•ด ๋งค์šฐ ๋†’์€ ๊ฐ€์น˜๋ฅผ ์ง€๋‹Œ ์—ฐ๊ตฌ์ด๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
749๋ฒˆ ๋…ผ๋ฌธ์€ ๋‹ค๋ชจ๋‹ฌ ์„ค๊ณ„ ๋ฐ ๊ตฌ์กฐ-์„œ์—ด ๊ณต๋™์ƒ์„ฑ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ํฌ๊ด„์ ์œผ๋กœ ๋‹ค๋ฃจ๋ฏ€๋กœ, 3112์˜ multimodal diffusion ๋ชจ๋ธ ๊ธฐ๋ฐ˜ de novo ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„์˜ ์ด๋ก ์  ๋ฐฐ๊ฒฝ์ด ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
MolHIT ๋…ผ๋ฌธ์€ ๋ถ„์ž ๊ทธ๋ž˜ํ”„ ์ƒ์„ฑ๊ณผ ๊ณ„์ธต์  ๊ตฌ์กฐ ๋ชจ๋ธ๋ง์„ ๋…ผ์˜ํ•˜์—ฌ, DISCO์—์„œ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ๋Š” ์ƒ์„ฑํ˜• ๊ตฌ์กฐ-์„œ์—ด ์„ค๊ณ„ ๋ฐฉ๋ฒ•์˜ ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก ๋ฐ ๋ณตํ•ฉ์ฒด ๋ชจ๋ธ๋ง ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฅธ ์ ‘๊ทผ๋ฒ•์œผ๋กœ ํ•ด๊ฒฐํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„์— ์—์ด์ „ํ‹ฑ ๋˜๋Š” ๋‹ค์ค‘ ๋ชจ๋ธ ํ˜‘๋ ฅ ์ ‘๊ทผ๋ฒ•์„ ์ ์šฉํ•˜๋Š” ์œ ์‚ฌํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ „์ด๊ธˆ์† ํ™”ํ•ฉ๋ฌผ์˜ ์ „ํ•˜ ๋ถ„์„์—์„œ ์œ ์‚ฌํ•œ ์›Œํฌํ”Œ๋กœ์šฐ๋ฅผ ์ทจํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
3249๋ฒˆ ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ๊ฒฐํ•ฉ ํ‘œ์  ๊ธฐ๋ฐ˜ ํŽฉํƒ€์ด๋“œ ๊ณต๋™์ƒ์„ฑ ์ ‘๊ทผ์ด๋ฏ€๋กœ, 3112๊ฐ€ ๋‹ค๋ฃจ๋Š” ์ง€์ •๋˜์ง€ ์•Š์€ ํ™œ์„ฑ๋ถ€์œ„์—์„œ์˜ de novo ๋””์ž์ธ ์ ‘๊ทผ๋ฒ•๊ณผ ๋น„๊ต ์šฐ์œ„๋ฅผ ํŒŒ์•…ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
de novo ํšจ์†Œ ์„ค๊ณ„์—์„œ ์„œ์—ด-๊ตฌ์กฐ ๋™์‹œ ์„ค๊ณ„ ๋ฐฉ๋ฒ•์ด FLIP2 ๋“ฑ๊ณผ ๋‹ฌ๋ฆฌ ๊ตฌ์ฒด์  ์‹ ๊ฒฝ๋ง ์ตœ์ ํ™” ๋ฐฉ๋ฒ•์„ ์ ์šฉํ•˜๋ฉฐ, multimodal ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„ ์ ‘๊ทผ๋ฒ• ๋น„๊ต์— ์œ ์šฉํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํŠน์ • ํ‘œ์ ์— ๋Œ€ํ•œ ๋‹จ๋ฐฑ์งˆ ๋ฐ”์ธ๋” ์„ค๊ณ„ ๋ฐ ์ตœ์ ํ™”์— ๋Œ€ํ•œ ๋Œ€์•ˆ์  ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Agentic End-to-End De Novo Protein Design์€ ์—์ด์ „ํŠธ ๊ธฐ๋ฐ˜์œผ๋กœ ๋‹จ๋ฐฑ์งˆ ๋‹ค์ด๋‚˜๋ฏน์Šค ์ƒ์„ฑํ˜• ์„ค๊ณ„ ์ž๋™ํ™” ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•˜์—ฌ, 3112์™€ ์ƒํ˜ธ๋ณด์™„์ ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
๋ฉ”์‹œ์ง€ ์ „๋‹ฌ ์‹ ๊ฒฝ๋ง์„ ํญ๋ฐœ๋ฌผ ๋˜๋Š” ์—๋„ˆ์ง€ ์žฌ๋ฃŒ์˜ ์„ฑ์งˆ ์˜ˆ์ธก์— ํ™•์žฅ ์ ์šฉํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •