Protein Language Models Diverge from Natural Language: Comparative Analysis and Improved Inference

์ €์ž: | ๋‚ ์งœ: 2026-02-24 | URL: https://arxiv.org/abs/2602.20449 📄 PDF


Essence

Figure 2

Figure 2: Many PLMs display more variability in their attention focus than the cor-

๋‹จ๋ฐฑ์งˆ ์–ธ์–ด ๋ชจ๋ธ(PLM)์ด ์ž์—ฐ์–ด ๋ชจ๋ธ(NLM)๊ณผ ๋‚ด๋ถ€ ๋ฉ”์ปค๋‹ˆ์ฆ˜์—์„œ ๊ทผ๋ณธ์ ์œผ๋กœ ๋‹ค๋ฅด๊ฒŒ ์ž‘๋™ํ•จ์„ ๋ถ„์„ํ•˜๊ณ , ๋™์  early-exit ๊ธฐ๋ฒ•์„ ์ ์šฉํ•˜์—ฌ ์„ฑ๋Šฅ๊ณผ ํšจ์œจ์„ ๋™์‹œ์— ๊ฐœ์„ ํ•œ๋‹ค.

Motivation

Achievement

Figure 3

Figure 3: Early-Exit Improves both Performance and Efficiency in Non-Structural

How

Figure 1

Figure 1: The scheme for early-exit, based on Schwartz et al. (Schwartz et al., 2020).

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ PLM๊ณผ NLM์˜ ๊ทผ๋ณธ์  ์ฐจ์ด๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ๊ทœ๋ช…ํ•˜๊ณ  ์ด๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ์‹ค์งˆ์ ์ธ ์„ฑ๋Šฅ ๋ฐ ํšจ์œจ์„ฑ ๊ฐœ์„ ์„ ๋‹ฌ์„ฑํ•œ ์ค‘์š”ํ•œ ์—ฐ๊ตฌ์ด๋‹ค. ์ƒ๋ฌผํ•™์  ๋ฐ์ดํ„ฐ์— ํŠนํ™”๋œ ๋ชจ๋ธ ๊ฐœ๋ฐœ ๋ฐฉํ–ฅ์„ ์ œ์‹œํ•˜๋ฉฐ, ๋ฐฉ๋ฒ•๋ก ์˜ ์ฐฝ์˜์„ฑ๊ณผ ์‹คํ—˜์  ํƒ€๋‹น์„ฑ์ด ์šฐ์ˆ˜ํ•˜๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
440์˜ ๊ทธ๋ž˜ํ”„์‹ ๊ฒฝ๋ง ๋ฐ ๊ฐ•ํ™”ํ•™์Šต ๊ธฐ๋ฐ˜ ๊ตฌ์กฐ ์„ค๊ณ„๋Š” 3221์—์„œ ๋…ผ์˜ํ•˜๋Š” ์–ธ์–ด๋ชจ๋ธ ๊ธฐ๋ฐ˜ ํ™”ํ•™/์ƒ๋ช…๊ณผํ•™ ๊ตฌ์กฐ-์–ธ์–ด ์—ฐ๊ฒฐ ๋…ผ์˜์˜ ์ด๋ก ์  ํ† ๋Œ€์™€ ๋งž๋‹ฟ์•„ ์žˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆ ์–ธ์–ด ๋ชจ๋ธ๊ณผ ์ž์—ฐ์–ด ๋ชจ๋ธ์˜ ๋น„๊ต ๋ถ„์„์— ๋Œ€ํ•œ ๋ฐฉ๋ฒ•๋ก ์  ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆ ์–ธ์–ด๋ชจ๋ธ์˜ ๊ธฐ์ดˆ ๊ตฌ์กฐ ๋ฐ ์ž์—ฐ์–ด๋ชจ๋ธ๊ณผ์˜ ๋น„๊ต๋ฅผ ๋‹ค๋ฃจ์–ด 3221์˜ ๋‚ดยท์™ธ๋ถ€ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ๋น„๊ต ์—ฐ๊ตฌ์— ์ด๋ก  ๋ฐฐ๊ฒฝ์„ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
ProtoMech ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๋‹จ๋ฐฑ์งˆ ์–ธ์–ด ๋ชจ๋ธ์˜ ๋‚ด๋ถ€ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ์ฐจ์ด๋ฅผ ๋ถ„์„ํ•˜๋Š” method๊ฐ€ ๊ณต์œ ๋œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Protein Language Models Diverge from Natural Language ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ language model์˜ ๋…์ž์  ํŠน์„ฑ์„ ๋ถ„์„ํ•˜์—ฌ Neurotox์˜ ์‹œํ€€์Šค ๊ธฐ๋ฐ˜ deep embedding์˜ ์ด๋ก ์  ๊ทผ๊ฑฐ๋ฅผ ๊ฐ•ํ™”ํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆยทRNA ์–ธ์–ด๋ชจ๋ธ ๋‚ด๋ถ€ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ๋ฐ early-exit ๋“ฑ ํšจ์œจํ™” ๊ธฐ๋ฒ•์ด, Cross-Attention ๊ธฐ๋ฐ˜ ํ†ตํ•ฉ๋ชจ๋ธ์˜ ๊ธฐ๋ฐ˜์ด ๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ์–ธ์–ด ๋ชจ๋ธ์˜ ํ‘œํ˜„ ํ•™์Šต๊ณผ ํšจ์œจ์„ฑ ๊ฐœ์„ ์— ๋Œ€ํ•œ ์œ ์‚ฌํ•œ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฃจ๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ์–ธ์–ด ๋ชจ๋ธ์˜ ๋‚ด๋ถ€ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ๋ถ„์„์— ๋Œ€ํ•œ ๋Œ€์•ˆ์  ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํ”„๋ฆฌํŠธ๋ ˆ์ธ ํŠธ๋žœ์Šคํฌ๋จธ ํ•ด์„ ๊ฐ€๋Šฅ์„ฑ์— ๋Œ€ํ•œ ๋ฆฌ๋ทฐ๋กœ, PLM-NLM ๊ตฌ์กฐ ์ฐจ์ด ํ•ด์„์— ๋Œ€ํ•œ ๋ณด์™„์  ์‹œ๊ฐ์„ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ์–ธ์–ด๋ชจ๋ธ ๋‚ด๋ถ€์˜ ๋ฐ˜๋ณต ํŒจํ„ด ํƒ์ง€ ์—ฐ๊ตฌ๊ฐ€, PLM/NLM ๋‚ด๋ถ€ ๊ตฌ์กฐ์˜ ์ฐจ์ด ํ•ด์„๊ณผ ๋Œ€์กฐ๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ž์—ฐ์–ด์™€ ๋‹จ๋ฐฑ์งˆ์˜ ํ‘œํ˜„ ๊ณต๊ฐ„ ํŠน์„ฑ์„ ๋น„๊ตํ•˜๋ฉฐ ๋ถ„์žยท๋‹จ๋ฐฑ์งˆ ํ†ตํ•ฉ ํ‘œํ˜„์— ๋Œ€ํ•œ ๋‹ค๋ฅธ ๊ด€์ ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ์–ธ์–ด๋ชจ๋ธ์ด ์ž์—ฐ ์–ธ์–ด์™€ ์–ด๋–ป๊ฒŒ ๋‹ค๋ฅธ์ง€ ๋ถ„์„ํ•˜๋ฏ€๋กœ, PLM ๊ธฐ๋ฐ˜ ๊ธฐ๋Šฅ ์˜ˆ์ธก ์—ฐ๊ตฌ์™€ ์‹œ๋„ˆ์ง€ ํšจ๊ณผ๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์–ธ์–ด ๋ชจ๋ธ์˜ ๋‚ด๋ถ€ ์ž‘๋™ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ๋ถ„์„ ๋ฐ ํšจ์œจํ™”์— ๋Œ€ํ•œ ๋Œ€์•ˆ์  ๊ด€์ ์„ ์ œ์‹œํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์žฌ๋ฃŒ ๋ฐœ๊ฒฌ ๋ถ„์•ผ foundation model ํ™œ์šฉ ํ˜„์‹ค์„ ๋ถ„์„ํ•ด, ๋‹จ๋ฐฑ์งˆ ์–ธ์–ด๋ชจ๋ธ์˜ ์ž์—ฐ์–ด์™€์˜ ์ฐจ๋ณ„์  ์„ฑ๋Šฅ๊ณผ ์‹ค์ œ ์‘์šฉ์„ ๋…ผ์˜ํ•  ๋•Œ ์ฐธ๊ณ ๋œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Protein Language Models Diverge from Natural Language๋Š” ๋‹จ๋ฐฑ์งˆ ์–ธ์–ด๋ชจ๋ธ์˜ ๊ณ ์œ  ๊ตฌ์กฐ์™€ ์ง„ํ™” ์ •๋ณด ์ดํ•ด๋ฅผ ๋ถ„์„ํ•ด, 3109์˜ ๋‹จ๋ฐฑ์งˆ ๋ณ€์ด ์˜ˆ์ธก ์—ฐ๊ตฌ๋ฅผ ์ด๋ก ์ ์œผ๋กœ ์‹ฌํ™”ํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆ ์–ธ์–ด ๋ชจ๋ธ์˜ ๋‚ด๋ถ€ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ํ•ด์„์„ ๋น„๊ต/๋ถ„์„ํ•œ ๋…ผ๋ฌธ์œผ๋กœ, ProtoMech์˜ ๊ฒฐ๊ณผ์™€ ์ง์ ‘์ ์œผ๋กœ ์—ฐ๊ด€๋œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Unsupervised protein language models์˜ ํŒจํ„ด ํ•™์Šต์ด ์‹ค์ œ ์ƒ๋ฌผ ๋ฐ˜์‘๊ณผ ์–ด๋–ป๊ฒŒ ์—ฐ๊ฒฐ๋˜๋Š”์ง€ ๊ฒ€์ฆํ•ด์ฃผ๋Š” ๋…ผ๋ฌธ์ด๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
๋‹จ๋ฐฑ์งˆ ์–ธ์–ด๋ชจ๋ธ์˜ ๋‚ด๋ถ€ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ๋ถ„์„ ์—ฐ๊ตฌ๋กœ, bidirectional ๋น„๊ต์™€ reverse predictivity์˜ ์‹ค์ œ ์ ์šฉ ์‚ฌ๋ก€๊ฐ€ ๋œ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
3221 ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ ์–ธ์–ด ๋ชจ๋ธ๊ณผ ์ž์—ฐ์–ด ์ฒ˜๋ฆฌ ์–ธ์–ด๋ชจ๋ธ ๊ฐ„์˜ ๋ณธ์งˆ์  ์ฐจ์ด๋ฅผ ๋ถ„์„ํ•˜์—ฌ, 2196์˜ ์ ‘๊ทผ ๋ฐฉ์‹์— ๋Œ€ํ•œ ์‹ฌํ™” ํ† ๋ก ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
๋‹จ๋ฐฑ์งˆ ์–ธ์–ด ๋ชจ๋ธ์ด ์ž์—ฐ ์–ธ์–ด ๋ชจ๋ธ๊ณผ ๊ทผ๋ณธ์ ์œผ๋กœ ๋‹ค๋ฅธ ํŠน์„ฑ์„ ๋ณด์ธ๋‹ค๋Š” ์ ์„ ๋ฐํžˆ๋ฉฐ, ์•ฝ๋ฌผ ํ™œ์„ฑ๋‹จ ์ฐจ ์˜ˆ์ธก์˜ ํ•ด์„ ๋ฐฉ๋ฒ•์— ๋น„ํŒ์  ์‹œ๊ฐ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •