A data-efficient foundation model for porous materials based on expert-guided supervised learning

์ €์ž: Jiawen Zou, Zirui Lv, Weimin Tan, Taoyang Wang, Runfeng Lin, Zhongyao Wang, Yi Yang, Qiaowei Li, Xiaomin Li, Bo Yan, Dongyuan Zhao | ๋‚ ์งœ: 2026-02-11 | DOI: 10.1038/s41467-026-69245-y 📄 PDF


Essence

Figure 1

Fig. 1 | Overall training pipeline of SpbNet. SpbNet consists of four main parts: (1)

๋ณธ ๋…ผ๋ฌธ์€ MOF ๋“ฑ ๋‹ค๊ณต์„ฑ ์†Œ์žฌ ๋Œ€์ƒ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ SpbNet์„ ์ œ์‹œํ•œ๋‹ค. potential energy surface (PES) ๊ธฐ์ €ํ•จ์ˆ˜๋ฅผ ํ†ตํ•ฉ ๋””์Šคํฌ๋ฆฝํ„ฐ๋กœ ํ™œ์šฉํ•˜๊ณ  dual-stream multi-modal ์•„ํ‚คํ…์ฒ˜๋ฅผ ํ†ตํ•ด ๊ตฌ์กฐ ์ •๋ณด์™€ ์—๋„ˆ์ง€ ์ •๋ณด๋ฅผ ์œตํ•ฉํ•˜์—ฌ, 0.1๋ฐฑ๋งŒ ๊ฐœ MOF ๋ฐ์ดํ„ฐ๋กœ ์‚ฌ์ „ํ•™์Šตํ•˜๋ฉด์„œ๋„ 20๋ฐฐ ํฐ ๋ฐ์ดํ„ฐ์…‹์œผ๋กœ ํ•™์Šตํ•œ ๋ชจ๋ธ์„ ๋Šฅ๊ฐ€ํ•˜๋Š” ๋ฐ์ดํ„ฐ ํšจ์œจ์  ์˜ˆ์ธก์„ ๋‹ฌ์„ฑํ•œ๋‹ค.

Motivation

Achievement

Figure 3

Fig. 3 | Prediction error distributions (box plots) across diverse tasks. a Gas

โ€ข 0.1๋ฐฑ๋งŒ MOF๋กœ ์‚ฌ์ „ํ•™์Šตํ•œ SpbNet์ด 2๋ฐฑ๋งŒ ๊ฐœ ์ด์ƒ์˜ ๋‹ค๊ณต์„ฑ ์†Œ์žฌ, 1.2์–ต ๊ฐœ ๋ถ„์žยท์žฌ๋ฃŒ ์ƒ˜ํ”Œ๋กœ ํ•™์Šตํ•œ ๊ธฐ์กด ๋ชจ๋ธ์„ ์ƒ๋Œ€์˜ค์ฐจ 20% ์ด์ƒ ๊ฐ์†Œ์‹œํ‚ค๋ฉฐ ๋Šฅ๊ฐ€

โ€ข 40๊ฐœ ์ด์ƒ MOF ๋ฒค์น˜๋งˆํฌ(๊ฐ€์Šค ํก์ฐฉ, ๋ถ„๋ฆฌ, ํƒ„์„ฑ๋ฅ  ๋“ฑ)์—์„œ ์ผ๊ด€๋˜๊ฒŒ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ

โ€ข COF, PPN, zeolite ๋“ฑ ๋ถ„ํฌ ์™ธ ์žฌ๋ฃŒ์— ๋Œ€ํ•ด์„œ๋„ ๊ฐ•ํ•œ ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ ์ž…์ฆ

โ€ข ๋ ˆ์ด๋ธ” ํšจ์œจ์„ฑ ์‹คํ—˜์—์„œ ์ œํ•œ๋œ ๋ฏธ์„ธ์กฐ์ • ๋ฐ์ดํ„ฐ๋กœ๋„ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ ์œ ์ง€

How

Figure 1

Fig. 1 | Overall training pipeline of SpbNet. SpbNet consists of four main parts: (1)

โ€ข PES ๊ธฐ์ €ํ•จ์ˆ˜์˜ ์„ ํ˜•๊ฒฐํ•ฉ์„ ํ†ตํ•œ 21์ฑ„๋„ 3D ์ด๋ฏธ์ง€ ํ‘œํ˜„์œผ๋กœ ๋ถ„์ž ํŽธํ–ฅ ์ œ๊ฑฐ

โ€ข Dual-stream multi-modal ์•„ํ‚คํ…์ฒ˜(GNN + Vision Transformer + cross-attention)๋กœ ๊ตฌ์กฐ์™€ ์—๋„ˆ์ง€ ์ •๋ณด ๋…๋ฆฝ์  ์ฒ˜๋ฆฌ ๋ฐ ํšจ์œจ์  ์œตํ•ฉ

โ€ข Multi-scale ์‚ฌ์ „ํ•™์Šต ํƒœ์Šคํฌ(์œ„์ƒ, ๊ณต๊ทน ํฌ๊ธฐ, ํ‘œ๋ฉด์ , ์ ‘๊ทผ์„ฑ ๋“ฑ)๋กœ ๋‹ค์–‘ํ•œ ๊ณต๊ฐ„ ๊ทœ๋ชจ์˜ ๊ธฐํ•˜ํ•™์  ํŠน์„ฑ ํ•™์Šต

โ€ข Multi-head fine-tuning ์ „๋žต์œผ๋กœ ๋‹ค์–‘ํ•œ downstream ํƒœ์Šคํฌ์— ์‹ ์† ์ ์‘

Originality

โ€ข PES ๊ธฐ์ €ํ•จ์ˆ˜๋ฅผ ๋ฒ”์šฉ ๋””์Šคํฌ๋ฆฝํ„ฐ๋กœ ๋„์ž…ํ•˜์—ฌ ๊ฒŒ์ŠคํŠธ ๋ถ„์ž๋ณ„ ํŽธํ–ฅ ์ œ๊ฑฐ โ€” ๊ธฐ์กด์˜ ๋ถ„์ž ํŠน์ด์  PES ํ‘œํ˜„์˜ ํ•œ๊ณ„ ๊ทน๋ณต

โ€ข Dual-stream multi-modal ์„ค๊ณ„์—์„œ GNN๊ณผ vision transformer์˜ ์ด์งˆ์  ํ‘œํ˜„์„ cross-attention์œผ๋กœ ์œตํ•ฉ โ€” ๊ฐ ๋ชจ๋‹ฌ๋ฆฌํ‹ฐ์˜ ๋‹ค์–‘์„ฑ ์œ ์ง€

โ€ข ์ „๋ฌธ ๋ฌผ๋ฆฌ ์ง€์‹(๋ถ„์ž ๋ ฅ์žฅ, ๊ธฐํ•˜ํ•™์  ๋ถ„์„)์„ ์ง์ ‘ ๊ฐ๋… ์‹ ํ˜ธ๋กœ ํ†ตํ•ฉํ•˜์—ฌ ์‚ฌ์ „ํ•™์Šต ๋ฐ์ดํ„ฐ ํšจ์œจ์„ฑ ํ–ฅ์ƒ โ€” ๊ธฐ์กด์˜ ๋ถ€๋ถ„์  ์ง€์‹ ํ™œ์šฉ์„ ๋„˜์–ด์„  ์ฒด๊ณ„์  ์ ‘๊ทผ

Limitation & Further Study

โ€ข ์‚ฌ์ „ํ•™์Šต์€ MOF ์ค‘์‹ฌ์ด๋ฏ€๋กœ ๋‹ค๋ฅธ ๋‹ค๊ณต์„ฑ ์†Œ์žฌ์—์„œ์˜ ์„ฑ๋Šฅ ์ƒํ•œ ๋ถˆ๋ช…ํ™•

โ€ข PES ๊ธฐ์ €ํ•จ์ˆ˜ 21๊ฐœ ํ•ญ์˜ ์„ ํƒ ๊ทผ๊ฑฐ์™€ ์ตœ์ ์„ฑ์— ๋Œ€ํ•œ ์ƒ์„ธํ•œ ๋ถ„์„ ๋ถ€์กฑ

โ€ข ๋‹ค์–‘ํ•œ GNN ์•„ํ‚คํ…์ฒ˜์— ๋Œ€ํ•œ ์ฒด๊ณ„์  ๊ฒ€์ฆ ๋ถ€์กฑ

โ€ข ํฐ ๊ทœ๋ชจ downstream ํƒœ์Šคํฌ์—์„œ์˜ ๋ฏธ์„ธ์กฐ์ • ์ˆ˜๋ ด ํŠน์„ฑ๊ณผ ๊ณผ์ ํ•ฉ ์œ„ํ—˜์— ๋Œ€ํ•œ ๋…ผ์˜ ํ•„์š”

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ expert knowledge๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ํ†ตํ•ฉํ•œ ํ˜์‹ ์  ์‚ฌ์ „ํ•™์Šต ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, ๋ฐ์ดํ„ฐ ๋ถ€์กฑ์ด ์‹ฌ๊ฐํ•œ ์žฌ๋ฃŒ๊ณผํ•™ ๋ถ„์•ผ์—์„œ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ์‹ค์šฉ์  ๊ฐ€๋Šฅ์„ฑ์„ ์ž…์ฆํ•œ๋‹ค. PES ๊ธฐ์ €ํ•จ์ˆ˜์˜ ๋„์ž…๊ณผ dual-stream ์•„ํ‚คํ…์ฒ˜์˜ ์„ค๊ณ„๊ฐ€ ์šฐ์ˆ˜ํ•˜๊ณ , ๊ด‘๋ฒ”์œ„ํ•œ ๋ฒค์น˜๋งˆํฌ์—์„œ ๊ฐ•ํ•œ ๋ฐ์ดํ„ฐ ํšจ์œจ์„ฑ์„ ๋ณด์ธ๋‹ค. ๋‹ค๋งŒ ๊ธฐ์ €ํ•จ์ˆ˜ ์„ ํƒ์˜ ์ตœ์ ์„ฑ ๋ถ„์„ ๋ฐ ๋‹ค์–‘ํ•œ ์†Œ์žฌ๊ตฐ์—์„œ์˜ ์‹œ์Šคํ…œ์  ํ‰๊ฐ€ ๊ฐ•ํ™”๊ฐ€ ์š”๊ตฌ๋œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
344๋ฒˆ ๋…ผ๋ฌธ์€ ๋ฐ”์ด์˜ค/์†Œ์žฌ ๋ถ„์•ผ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ๊ตฌ์กฐ์  ํ† ๋Œ€์™€ ๋ฐœ์ „ ๋ฐฉํ–ฅ์„ ์‹ฌ๋„ ๊นŠ๊ฒŒ ๋‹ค๋ค„ SpbNet ๊ธฐ์ดˆ ์ดํ•ด์— ๋„์›€์ด ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
343๋ฒˆ ๋…ผ๋ฌธ์€ ์†Œ์žฌ ๊ณผํ•™์—์„œ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ ์ „๋ฐ˜์„ ๋‹ค๋ฃจ๋ฏ€๋กœ, SpbNet์˜ ๋งฅ๋ฝ์ด๋‚˜ ํ™•์žฅ์ „๋žต ๋…ผ์˜์— ๊ธฐ์ดˆ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
325 ๋…ผ๋ฌธ์€ ํ•ฉ๊ธˆ ์„ค๊ณ„ยท๋ฐœ๊ฒฌ์—์„œ physics-aware agentic ์‹œ์Šคํ…œ์„ ๋„์ž…ํ•ด, SpbNet๊ณผ ๋ฌผ์งˆ ์„ค๊ณ„ ๋ฌธ์ œ ํ•ด๊ฒฐ ๋ฐฉ์‹์—์„œ ๋น„๊ต๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹ค๊ณต์„ฑ ์†Œ์žฌ์— ํŠนํ™”๋œ ๋ฐ์ดํ„ฐ ํšจ์œจ์  ์žฌ๋ฃŒ ๋ถ„์•ผ ํ”„๋ผ์ž„ ๋ชจ๋ธ์˜ ์‚ฌ๋ก€๋กœ, ML ๊ธฐ๋ฐ˜ ๊ณ ์ฐจ์› ์žฌ๋ฃŒ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์˜ ๋˜ ๋‹ค๋ฅธ ๋ฐฉํ–ฅ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
516๋ฒˆ ๋…ผ๋ฌธ์€ ๊ธฐ๊ณ„ํ•™์Šต ๊ธฐ๋ฐ˜ ์›์ž๊ฐ„ ํผํ…์…œ ์˜ˆ์ธก์— ํŠนํ™”๋œ ๋ฆฌ๋ทฐ๋กœ์„œ, SpbNet์˜ ์žฅ๋‹จ์ ์„ ๋ฌผ๋ฆฌ์  ๋งฅ๋ฝ์—์„œ ๋น„๊ต ๋ถ„์„ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹ค๋ฅธ ํ™•์žฅ ๊ฐ€๋Šฅํ•œ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ๋ถ„์ž ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์ ‘๊ทผ๋ฒ•์„ ์ œ์•ˆํ•˜์—ฌ, ๋‹ค์‹œ์„ค ์‹คํ—˜ ํ™˜๊ฒฝ์—์„œ์˜ ์—ฐํ•ฉํ•™์Šต ์ ์šฉ๊ณผ ๋น„๊ตํ•  ๊ฐ€์น˜๋ฅผ ๊ฐ€์ง‘๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹ค๊ณต์„ฑ ์†Œ์žฌ์— ํŠนํ™”๋œ ๊ณ ํšจ์œจ ์†Œ์ˆ˜ ๋ฐ์ดํ„ฐ ํ•™์Šต ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์„ ํ†ตํ•ด, ๋ฐฐํ„ฐ๋ฆฌ/์ „ํ•ด์งˆ ์šฉ๋งคํ™” ์˜ˆ์ธก๊ณผ ์œ ์‚ฌํ•œ ๋ฌธ์ œ๋ฅผ ํƒ€ ๋ฐ์ดํ„ฐ/๊ตฌ์กฐ ๊ธฐ๋ฐ˜ ์ ‘๊ทผ์œผ๋กœ ํ•ด๊ฒฐํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
2995 ๋…ผ๋ฌธ์€ DFT ๋Œ€์ฒด๋ฅผ ๋ชฉํ‘œ๋กœ ํ•˜๋Š” ML ๊ธฐ๋ฐ˜ interatomic potential์˜ ํŒจ๋Ÿฌ๋‹ค์ž„ ์ „ํ™˜ ๋…ผ์˜๋กœ, 2986์˜ ๋‹ค๊ณต์„ฑ ์†Œ์žฌ ํŠนํ™” ์•„ํ‚คํ…์ฒ˜์™€ ๊ฐ™์€ ๋„์ „์„ ๋‹ค๋ฃน๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹ค๊ณต์„ฑ ์†Œ์žฌ(์˜ˆ: ZIF) ํ˜•์„ฑ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ๋ฐ์ดํ„ฐ ํšจ์œจ์  ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์œผ๋กœ ๊ทœ๋ช…ํ•˜๋Š” ๋Œ€์กฐ์  ์—ฐ๊ตฌ์ž…๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ธฐ์กด ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ๋‹ค๊ณต์„ฑ ๋ฌผ์งˆ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋ฐ์ดํ„ฐ์…‹๊ณผ ๋ณธ ๋…ผ๋ฌธ์—์„œ ์ œ์‹œํ•˜๋Š” ๊ณ ์˜จ ๊ณ ์ฒด์ „ํ•ด์งˆ ๋ฐ์ดํ„ฐ์…‹์˜ ๋น„๊ต๊ฐ€ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
372๋ฒˆ ๋…ผ๋ฌธ์€ ์ œ๋„ˆ๋Ÿด ํผํฌ์ฆˆ MLIP ๊ฐœ๋ฐœ ์‚ฌ๋ก€๋กœ, SpbNet์ด ์ œ์‹œํ•œ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ์†Œ์žฌ ํ™•์žฅ์„ฑ๊ณผ ์ง์ ‘์ ์œผ๋กœ ์—ฐ๊ณ„๋ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •