Multi-to-uni modal knowledge transfer pre-training for molecular representation learning

์ €์ž: Zhankun Xiong, Ziyan Wang, Feng Huang, Minyao Qiu, Shuyan Fang, Liuqing Yang, Xionghui Zhou, Shichao Liu, Ping Zhang, Wen Zhang | ๋‚ ์งœ: 2026-02-14 | DOI: 10.1038/s41467-026-69302-6 📄 PDF


Essence

Figure 1

Fig. 1 | Overview of the M2UMol framework. a The four types of molecular

๋ถ„์ž ํ‘œํ˜„ ํ•™์Šต(MRL)์—์„œ ์™„์ „ํ•œ ๋ชจ๋‹ฌ๋ฆฌํ‹ฐ๋ฅผ ์š”๊ตฌํ•˜๋Š” ๊ธฐ์กด ๋‹ค์ค‘ ๋ชจ๋‹ฌ ์‚ฌ์ „ ํ•™์Šต์˜ ํ•œ๊ณ„๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, M2UMol์€ 2D ๋ชจ๋‹ฌ๋ฆฌํ‹ฐ์— ๋‹ค์ค‘ ๋ชจ๋‹ฌ ์ง€์‹์„ ์ „์ดํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•œ๋‹ค. ์ด๋ฅผ ํ†ตํ•ด 2D ๊ทธ๋ž˜ํ”„๋งŒ ์ฃผ์–ด์ง„ ์‹ค์ œ ๋‹ค์šด์ŠคํŠธ๋ฆผ ๊ณผ์ œ์—์„œ๋„ ์ •ํ™•ํ•œ ๋ถ„์ž ์†์„ฑ ์˜ˆ์ธก์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•œ๋‹ค.

Motivation

Achievement

M2UMol์˜ ์ฃผ์š” ์„ฑ๊ณผ:

How

Figure 3

Fig. 3 | Investigation of the designed multi-to-uni modal knowledge transfer

Originality

Limitation & Further Study

ํ›„์† ์—ฐ๊ตฌ ๋ฐฉํ–ฅ:

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: M2UMol์€ ์‹ค์ œ ์‹ ์•ฝ ๊ฐœ๋ฐœ ํ™˜๊ฒฝ์˜ ๋ถˆ์™„์ „ํ•œ ๋ชจ๋‹ฌ๋ฆฌํ‹ฐ ๋ฌธ์ œ๋ฅผ ์ฐฝ์˜์ ์œผ๋กœ ํ•ด๊ฒฐํ•œ ์‹ค์šฉ์ ์ด๊ณ  ํ˜์‹ ์ ์ธ ์—ฐ๊ตฌ์ด๋‹ค. ๋‹ค์ค‘-๋‹จ์ผ ๋ชจ๋‹ฌ ์ง€์‹ ์ „์ด ํŒจ๋Ÿฌ๋‹ค์ž„๊ณผ ๋ชจ๋‹ฌ ํŠนํ™” ์–ด๋Œ‘ํ„ฐ ์„ค๊ณ„๋ฅผ ํ†ตํ•ด 2D ํ‘œํ˜„์—์„œ ๋‹ค์ค‘ ๋ชจ๋‹ฌ ์ •๋ณด๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ์ƒ์„ฑํ•˜๋ฉฐ, ์ข…ํ•ฉ์ ์ธ ์‹คํ—˜๊ณผ ์˜คํ”ˆ์†Œ์Šค ํŒจํ‚ค์ง€ ์ œ๊ณต์œผ๋กœ ๋†’์€ ์žฌํ˜„์„ฑ๊ณผ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์žฅํ•œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋ถ„์ž geometry ์ •๋ณด๋ฅผ ํ•™์Šต์— ๋ฐ˜์˜ํ•˜๋Š” ํ† ํฌ๋‚˜์ด์ง• ๋ฐฉ์‹์ด, multi-to-uni modal knowledge transfer ์‚ฌ์ „ํ•™์Šต์˜ ์ด๋ก ์  ๊ธฐ๋ฐ˜์ด ๋œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆยท๋ถ„์ž ๋ถ„์•ผ foundation model์˜ ๋ฐ์ดํ„ฐ ํ™œ์šฉ๊ณผ ๋ฒค์น˜๋งˆํฌ ํ˜„ํ™ฉ์„ ์•Œ ์ˆ˜ ์žˆ์œผ๋ฏ€๋กœ, M2UMol์˜ ์‚ฌ์ „ํ•™์Šตยท์ „์ด ์ „๋žต ์ดํ•ด์— ์ฐธ๊ณ ๊ฐ€ ๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ƒ์ฒด๋ถ„์ž์™€ ์ž์—ฐ์–ด ์‚ฌ์ด์˜ ๋‹ค์ค‘ ๋ชจ๋‹ฌ ๊ฒฐํ•ฉ์„ ํ†ตํ•œ ๋ถ„์ž ํ‘œํ˜„ ํ•™์Šต์„ ์‹œ๋„ํ•˜์—ฌ, M2UMol์˜ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌโ†’2D ์ „์ด ์ ‘๊ทผ๊ณผ ๋Œ€๋น„ํ•  ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Linear-time prediction of proteome-scale ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก ๋…ผ๋ฌธ์€ multimodal representation์ด ์•„๋‹Œ ์‹œํ€€์Šค ๊ธฐ๋ฐ˜ ์˜ˆ์ธก๋ฒ•์„ ์ œ์‹œํ•ด M2UMol์˜ multi-to-uni modal ์ „์ด์— ๋Œ€ํ•œ ๋‹ค๋ฅธ ๊ด€์ ์„ ์ œ๊ณตํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
ํ™”ํ•™ ๊ณต๊ฐ„์—์„œ ๋Œ€ํ˜•์–ธ์–ด๋ชจ๋ธ์„ ํ™œ์šฉํ•œ ํƒ์ƒ‰ ๋ฐฉ์‹์ด, M2UMol์˜ ์‹ค์ œ ๋ถ„์ž ์†์„ฑ ์˜ˆ์ธก ์ ์šฉ์— ์ง์ ‘ ์—ฐ๊ฒฐ๋œ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
BioMiner ๋…ผ๋ฌธ์€ multi-modal protein-ligand data extraction์„ ๋‹ค๋ฃจ์–ด M2UMol์˜ modality knowledge transfer ๋ฐฉ์‹์„ ์‹ค์ œ ์ƒ๋ฌผํ•™ ๋ฐ์ดํ„ฐ์— ์ ์šฉํ•˜๋Š” ์˜ˆ์‹œ๊ฐ€ ๋œ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •