InstructNA leverages nucleic acid large language models with HT-SELEX for de novo generation of functional nucleic acids

์ €์ž: Zhiming Zhang, Meng Jiang, Axin He, Youyuan Zhu, Ercheng Wang, Liqi Wan, Jiezhong Qiu, Pei Guo, Guangyong Chen, Da Han | ๋‚ ์งœ: 2026-03-11 | DOI: 10.1038/s43588-026-00965-3 📄 PDF


Essence

Figure 1

Fig. 1 | Overview of the InstructNA framework. a, Architecture of the InstructNA

InstructNA๋Š” ํ•ต์‚ฐ LLM(NA-LLM)๊ณผ HT-SELEX ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ฉํ•˜์—ฌ 3D ๊ตฌ์กฐ ์ •๋ณด ์—†์ด ๊ธฐ๋Šฅ์  ํ•ต์‚ฐ(์••ํƒ€๋จธ, ์ „์‚ฌ์ธ์ž ๊ฒฐํ•ฉ DNA)์„ ๋ฐ๋…ธ๋ณด ์ƒ์„ฑํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋‹ค. HC-HEBO ์•Œ๊ณ ๋ฆฌ์ฆ˜์œผ๋กœ ์ž ์žฌ๊ณต๊ฐ„ ๋‚ด ๋ฐ˜๋ณต์  ์ตœ์ ํ™”๋ฅผ ๊ตฌํ˜„ํ•ด ๊ธฐ์กด ๋ฐฉ๋ฒ• ๋Œ€๋น„ ๊ฐ•๊ฒฐํ•ฉ ์••ํƒ€๋จธ๋ฅผ 100~200% ๋” ๋งŽ์ด ๋„์ถœํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Fig. 2 | Generating aptamer binders with InstructNA. a, Schematic showing

How

Figure 1

Fig. 1 | Overview of the InstructNA framework. a, Architecture of the InstructNA

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: InstructNA๋Š” NA-LLM๊ณผ HT-SELEX ๋ฐ์ดํ„ฐ์˜ ์ฐฝ์˜์  ๊ฒฐํ•ฉ์œผ๋กœ ๊ธฐ๋Šฅ์  ํ•ต์‚ฐ ์„ค๊ณ„์˜ ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„์„ ์ œ์‹œํ•œ๋‹ค. HC-HEBO ์•Œ๊ณ ๋ฆฌ์ฆ˜๊ณผ ํ๋ฃจํ”„ ์ตœ์ ํ™”๋ฅผ ํ†ตํ•ด 100~200% ํ–ฅ์ƒ๋œ ์„ฑ๋Šฅ์„ ์ž…์ฆํ–ˆ์œผ๋‚˜, ์™ธ๋ถ€ ๋ฐ์ดํ„ฐ์…‹ ๊ฒ€์ฆ๊ณผ ์ž„์ƒ ์‘์šฉ์„ฑ ํ‰๊ฐ€๊ฐ€ ์ถ”๊ฐ€๋˜๋ฉด ์˜ํ–ฅ๋ ฅ์ด ๋”์šฑ ๊ฐ•ํ™”๋  ๊ฒƒ์ด๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
SELEX ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•œ ์••ํƒ€๋จธ ์„ค๊ณ„์˜ ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
ํ•ต์‚ฐ ๋ฐ ๋‹จ๋ฐฑ์งˆ ์„œ์—ด์˜ ์ œ์–ด์  ์ƒ์„ฑ ์ ‘๊ทผ์— ๋Œ€ํ•œ ์ตœ์‹  ์—ฐ๊ตฌ๋กœ, ์ƒ์„ฑ๋ฒ• ๋ฐ ํ‰๊ฐ€๋ฐฉ์‹์— ์ด๋ก ์  ๋ฐ”ํƒ•์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
ํ•ต์‚ฐ ๊ตฌ์กฐ-๊ธฐ๋Šฅ ๊ด€๊ณ„ ์˜ˆ์ธก์˜ ๊ธฐ๋ฐ˜ ๋ชจ๋ธ์„ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
3046์€ ๋ถ„์ž ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ถ”๋ก ์˜ ์ตœ์‹  ์–ธ์–ด ๋ชจ๋ธ ๊ตฌ์กฐ์™€ ํ‰๊ฐ€ ๋ฐฉ๋ฒ•์„ ๊ฐœ๊ด€ํ•˜์—ฌ, 3138์˜ ํ•ต์‚ฐ LLM ํ†ตํ•ฉ ์ ‘๊ทผ์˜ ์ด๋ก ์  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํ•ต์‚ฐ ์„œ์—ด ์ตœ์ ํ™”๋ฅผ ์œ„ํ•œ ๋‹ค๋ฅธ ์ƒ์„ฑ ๋ชจ๋ธ ์ ‘๊ทผ๋ฒ•์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํ™”ํ•™ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ๋ ˆ์ด์…˜์˜ ๋Œ€์•ˆ์  ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ธฐ๋Šฅ์  ํ•ต์‚ฐ ์„ค๊ณ„๋ฅผ ์œ„ํ•œ ๋‹ค๋ฅธ AI ๊ธฐ๋ฐ˜ ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
3228 ๋…ผ๋ฌธ์€ ๊ฐ•ํ™”ํ•™์Šต์„ ๊ฒฐํ•ฉํ•œ ๋‹จ๋ฐฑ์งˆ/ํ•ต์‚ฐ ์–ธ์–ด ๋ชจ๋ธ ์ƒ์„ฑ๊ธฐ๋ฅผ ์ œ์‹œํ•˜๋ฉฐ, ๋ฐ˜๋ณต์  ์ตœ์ ํ™” ์ธก๋ฉด์—์„œ 3138 ๋…ผ๋ฌธ๊ณผ ๋Œ€์กฐ๋ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
700 ๋…ผ๋ฌธ์€ ๋‹จ์ผ์„ธํฌ/ํ•ต์‚ฐ annotation์— LLM ๊ธฐ๋ฐ˜ ์ž๋™ํ™” ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ ์šฉํ•˜์—ฌ InstructNA(3138)์˜ HT-SELEX ๊ธฐ๋ฐ˜ ์ ‘๊ทผ์˜ ์‘์šฉ ์˜ˆ์‹œ๋ฅผ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ƒ๋ช…๊ณผํ•™ ์›Œํฌํ”Œ๋กœ reproducibility ์—ฐ๊ฒฐ์„ ์‹œ๋„ํ•˜์—ฌ, InstructNA์˜ ์‹ค์ œ AI ์‹คํ—˜ ์‘์šฉ๋ฒ”์œ„๋ฅผ ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
DCA ๊ธฐ๋ฐ˜ ๋ชจ๋ธ์„ ํŠน์ • ๋‹จ๋ฐฑ์งˆ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์— ์ ์šฉํ•œ ๊ด€๋ จ ์‘์šฉ ์—ฐ๊ตฌ์ด๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •