Large Language Model Assisted Discovery of Optimal Dopants for Enhanced Thermoelectric Performance in CoSb3 Based Skutterudites

์ €์ž: | ๋‚ ์งœ: 2026-04-07 | URL: https://arxiv.org/abs/2604.06048 📄 PDF


Essence

Figure 1

Figure 1: Pipeline of the two approaches to predict ZT values. Approach 1 is a forward model

LLM์„ ํ™œ์šฉํ•˜์—ฌ 300์—ฌ ํŽธ์˜ ๋…ผ๋ฌธ์—์„œ ์กฐ์„ฑ ๋ฐ์ดํ„ฐ๋ฅผ ์ž๋™ ์ถ”์ถœํ•˜๊ณ , BERT ๊ธฐ๋ฐ˜ ๋จธ์‹ ๋Ÿฌ๋‹ ๋ชจ๋ธ๋กœ CoSb3 ์Šค์ปคํ„ฐ๋ฃจ๋‹ค์ดํŠธ์˜ ์—ด์ „ ์„ฑ๋Šฅ(ZT)์„ ์˜ˆ์ธกํ•œ ํ›„ DFT/MD ๊ณ„์‚ฐ์œผ๋กœ ๊ฒ€์ฆํ•˜๋Š” ๋ฐ์ดํ„ฐ ์ฃผ๋„ ์‹ ์†Œ์žฌ ๋ฐœ๊ฒฌ ํŒŒ์ดํ”„๋ผ์ธ์„ ์ œ์‹œํ–ˆ๋‹ค.

Motivation

Achievement

Figure 3

Figure 3: (a) MSE loss plots of ANN training and validation as well as BERT based model training

How

Figure 2

Figure 2: (a) One-hot encoded style representation of the composition used in the ANN model,

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜ ์ž๋™ ์ •๋ณด ์ถ”์ถœ๊ณผ BERT ์ž„๋ฒ ๋”ฉ์„ ํ™œ์šฉํ•œ ์ˆœ์ˆ˜ ์กฐ์„ฑ ๊ธฐ๋ฐ˜ ์—ด์ „ ์„ฑ๋Šฅ ์˜ˆ์ธก์ด๋ผ๋Š” ์ƒˆ๋กœ์šด ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•˜๋ฉฐ, ๋ฐ์ดํ„ฐ ์ฃผ๋„ ์‹ ์†Œ์žฌ ๋ฐœ๊ฒฌ์— LLM์˜ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ์ž…์ฆํ–ˆ๋‹ค. ์ œ1์›๋ฆฌ ๊ฒ€์ฆ์œผ๋กœ ์‹ ๋ขฐ์„ฑ์„ ๊ฐ•ํ™”ํ–ˆ์œผ๋‚˜, ๋ฐ์ดํ„ฐ์…‹ ๊ทœ๋ชจ ํ™•๋Œ€์™€ ๋„๋ฉ”์ธ ํŠนํ™” ๋ชจ๋ธ ๊ฐœ๋ฐœ์„ ํ†ตํ•œ ์ถ”๊ฐ€ ๋ฐœ์ „ ๊ฐ€๋Šฅ์„ฑ์ด ์ถฉ๋ถ„ํ•˜๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์†Œ์žฌ ์˜ˆ์ธก๊ณผ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ ์ถ”์ถœ์— ํ•„์š”ํ•œ ์Šค์ผ€์ผ๋ง ์ „๋žต์„ ๋‹ด์€ ๊ธฐ๋ฐ˜์  ๋…ผ๋ฌธ์ž…๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
004๋Š” ๊ณผํ•™ LLM์˜ ๋ฐœ์ „ ๋ฐ ์“ฐ์ž„์ƒˆ๋ฅผ ์ •๋ฆฌํ•œ ์„œ๋ฒ ์ด๋กœ, 3148 ๊ฐ™์€ LLM ๊ธฐ๋ฐ˜ ์ž๋™ ์ •๋ณด ์ถ”์ถœยทํ™œ์šฉ ์—ฐ๊ตฌ์˜ ์ด๋ก  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM ๊ธฐ๋ฐ˜ ์žฌ๋ฃŒ ์ž๋™๋ฐœ๊ฒฌ๊ณผ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ •๋ณด ์ถ”์ถœ์„ ํ†ตํ•ฉํ•œ ์‹œ์Šคํ…œ(MatterChat)์œผ๋กœ, LLM-DFT ์—ฐ๋™๊ณผ ๋ฐ์ดํ„ฐ ํ™•๋ณด ์ธก๋ฉด์˜ ์œ ์‚ฌ์„ฑ์ด ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
522๋Š” LLM๊ณผ ํ”„๋กฌํ”„ํŠธ ์ „๋žต์„ ์ด์šฉํ•ด ์žฌ๋ฃŒ ๊ณผํ•™ ํƒ์ƒ‰์„ ์ˆ˜ํ–‰ํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, 3148์˜ LLM+์žฌ๋ฃŒ๋ฐœ๊ฒฌ ํŒŒ์ดํ”„๋ผ์ธ๊ณผ ๋น„๊ต๋  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Material science์—์„œ LLM๊ณผ ๋„๋ฉ”์ธ ์ง€์‹ ์œตํ•ฉ์„ ํ†ตํ•œ ์ตœ์  ๋„ํŽ€ํŠธ ๋ฐ ์†Œ์žฌ ํ›„๋ณด ๋ฐœ๊ฒฌ์˜ ๋˜ ๋‹ค๋ฅธ ์ ‘๊ทผ๋ฒ•์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM์„ ํ™œ์šฉํ•œ ๊ณผํ•™ ๋ฌธํ—Œ ๋งˆ์ด๋‹ ๋ฐ ์žฌ๋ฃŒ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ๊ตฌ์ถ•์˜ ๋‹ค๋ฅธ ๋ฐฉ๋ฒ•์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์—ด์ „ ์žฌ๋ฃŒ ์ตœ์ ํ™”๋ฅผ ์œ„ํ•œ ๋‹ค๋ฅธ ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์ ‘๊ทผ๋ฒ•์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋„ํŽ€ํŠธ ํƒ์ƒ‰ ๋ฐ ์žฌ๋ฃŒ ๋ฐœ๊ฒฌ์„ ์œ„ํ•œ ๋Œ€์•ˆ์  AI ํŒŒ์ดํ”„๋ผ์ธ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Retrieval-augmented generation(RAG)๊ณผ ์†Œ์žฌ ์„ค๊ณ„ ํ†ตํ•ฉ์„ ๋ณด๋‹ค ๊ตฌ์กฐ์ ์œผ๋กœ ํ™•์žฅํ•œ ์ ‘๊ทผ์ด๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
LLM ๊ธฐ๋ฐ˜ ๋„ํŠธํŒํŠธ ๋ฐœ๊ฒฌ ์ตœ์ ํ™” ๋…ผ๋ฌธ์€ ChemCrow ์ ‘๊ทผ๋ฒ•์˜ ์‹ค์ œ ์‘์šฉ ์‚ฌ๋ก€๋กœ, ์ƒˆ๋กœ์šด ํ™”ํ•™์  ์ธ์‚ฌ์ดํŠธ ์‹คํ˜„ ์‚ฌ๋ก€๋ฅผ ์ฐธ๊ณ ํ•  ์ˆ˜ ์žˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •