MatClaw: An Autonomous Code-First LLM Agent for End-to-End Materials Exploration

์ €์ž: | ๋‚ ์งœ: 2026-04-03 | URL: https://arxiv.org/abs/2604.02688 📄 PDF


Essence

Figure 1

Figure 1 illustrates the overall architecture. MatClaw adopts the code-as-action paradigm [Wang

MatClaw๋Š” ๊ธฐ์กด LLM ์—์ด์ „ํŠธ์˜ ํŒŒ์ดํ”„๋ผ์ธ ๋ฐ”์ธ๋”ฉ๊ณผ ๋„๊ตฌ ํ•จ์ˆ˜ ์˜์กด์„ฑ์„ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด, Python ์ฝ”๋“œ๋ฅผ ์ง์ ‘ ์ž‘์„ฑยท์‹คํ–‰ํ•˜์—ฌ ๋„๋ฉ”์ธ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ(pymatgen, atomate2, DeePMD-kit ๋“ฑ)๋ฅผ ์ž์œ ๋กญ๊ฒŒ ์กฐํ•ฉํ•˜๊ณ  ์›๊ฒฉ HPC ํด๋Ÿฌ์Šคํ„ฐ์—์„œ ๋‹ค์ค‘ ์ฝ”๋“œ ์›Œํฌํ”Œ๋กœ๋ฅผ ์˜ค์ผ€์ŠคํŠธ๋ ˆ์ด์…˜ํ•˜๋Š” code-first LLM ์—์ด์ „ํŠธ์ด๋‹ค.

Motivation

Achievement

Figure 5

Figure 5: Chunking method comparison on pymatgen code QA (300 questions, Gemini 3.0 Flash,

API ์ •ํ™•๋„ ๊ฐœ์„ : RAG๋ฅผ ํ†ตํ•ด ๋‹จ๊ณ„๋‹น ์•ฝ 99% API-call ์ •ํ™•๋„ ๋‹ฌ์„ฑ. 3๊ฐœ์˜ end-to-end ๋ฐ๋ชจ: ferroelectric CuInP2S6์— ๋Œ€ํ•œ machine-learning force field training (active learning), Curie temperature ์˜ˆ์ธก, heuristic parameter-space search ์‹œ์—ฐ. ๊ฐ€์ด๋“œ ์ž์œจ์„ฑ ๋ชจ๋ธ: literature self-learning๊ณผ expert-specified constraints๋ฅผ ํ†ตํ•ด tacit domain knowledge ๋ถ€์žฌ ๊ทน๋ณต. ์˜คํ”ˆ์†Œ์Šค ๊ณต๊ฐœ: ๋ชจ๋“  ์ฝ”๋“œ์™€ ๋ฒค์น˜๋งˆํฌ ๊ณต๊ฐœ.

How

Figure 1

Figure 1 illustrates the overall architecture. MatClaw adopts the code-as-action paradigm [Wang

Originality

Limitation & Further Study

Tacit domain knowledge์˜ ๋ถ€์žฌ: ์ ์ ˆํ•œ simulation timescale, equilibration protocol, sampling strategy ๋“ฑ ์—ฐ๊ตฌ ๊ฒฝํ—˜์„ ํ†ตํ•ด ์ถ•์ ๋˜๋Š” ์ง€์‹ ๊ฒฐํ•. ๊ฐ€์ด๋“œ ํ•„์š”์„ฑ: ์™„์ „ ์ž์œจ์€ ์–ด๋ ต๊ณ  literature self-learning๊ณผ expert-specified constraints๊ฐ€ ํ•„์ˆ˜. ํ‰๊ฐ€ ๋ฒ”์œ„ ์ œํ•œ: CuInP2S6 ๋‹จ์ผ ์žฌ๋ฃŒ์— ๋Œ€ํ•œ 3๊ฐœ ์ผ€์ด์Šค ์‹œ์—ฐ์œผ๋กœ ๋‹ค์–‘ํ•œ materials ๋ฐ ์›Œํฌํ”Œ๋กœ ๋ฒ”์œ„ ํ™•๋Œ€ ํ•„์š”. ์—๋Ÿฌ ํšŒ๋ณต ๋ฉ”์ปค๋‹ˆ์ฆ˜ ์ƒ์„ธ ๋ฏธํก: ์‹คํŒจ ์ฒ˜๋ฆฌ ๋ฐ ์ž๋™ ์žฌ์‹œ๋„ ์ „๋žต์— ๋Œ€ํ•œ ๊ตฌ์ฒด์  ์„ค๋ช… ๋ถ€์กฑ. Hallucination ํ†ต์ œ: LLM์˜ ์ฝ”๋“œ ์ƒ์„ฑ ์˜ค๋ฅ˜๋‚˜ ํ™˜๊ฐ์ด complex workflow์—์„œ ๋ฏธ์น˜๋Š” ์˜ํ–ฅ ํ‰๊ฐ€ ํ•„์š”.

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: MatClaw๋Š” computational materials science์˜ ์ž๋™ํ™”์— ๋งค์šฐ ์‹ค์งˆ์ ์ธ ๊ธฐ์—ฌ๋ฅผ ํ•˜๋Š” code-first LLM ์—์ด์ „ํŠธ์ด๋‹ค. ํŒŒ์ดํ”„๋ผ์ธ ๋ฐ”์ธ๋”ฉ๊ณผ ๋„๊ตฌ ํ•จ์ˆ˜ ์˜์กด์„ฑ์ด๋ผ๋Š” ๊ธฐ์กด ์—์ด์ „ํŠธ์˜ ๊ทผ๋ณธ์  ์ œ์•ฝ์„ ๊ทน๋ณตํ•˜๊ณ , 4๊ณ„์ธต ๋ฉ”๋ชจ๋ฆฌ์™€ RAG๋ฅผ ํ†ตํ•ด ์žฅ๊ธฐ ์›Œํฌํ”Œ๋กœ ์‹คํ–‰์˜ ์ผ๊ด€์„ฑ์„ ์ƒ๋‹นํžˆ ๊ฐœ์„ ํ–ˆ๋‹ค. ๋‹ค๋งŒ tacit domain knowledge ๋ถ€์žฌ๋กœ ์™„์ „ ์ž์œจํ™”๋Š” ์•„์ง ๋ฏธํกํ•˜๋ฉฐ, ํ‰๊ฐ€๊ฐ€ ๋‹จ์ผ ์žฌ๋ฃŒ์— ๊ตญํ•œ๋˜์–ด ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ ๊ฒ€์ฆ์ด ํ•„์š”ํ•˜๋‹ค. ์ „์ฒด์ ์œผ๋กœ ๋ฐฉํ–ฅ์„ฑ๊ณผ ๊ธฐ์ˆ  ํ†ตํ•ฉ์€ ์šฐ์ˆ˜ํ•˜๋‚˜, ๊ด‘๋ฒ”์œ„ํ•œ ์‹ค์ œ ์‘์šฉ ๊ฒ€์ฆ์„ ์œ„ํ•ด ๋ณด์™„์ด ํ•„์š”ํ•œ ๋‹จ๊ณ„์ด๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
230์˜ Code llama ํ”„๋กœ์ ํŠธ๋Š” LLM ๊ธฐ๋ฐ˜ ์ฝ”๋“œ ์ƒ์„ฑยท์‹คํ–‰์˜ ๋Œ€ํ‘œ์  ๋ฐฉ๋ฒ•๋ก ์œผ๋กœ, 3160์˜ ์ฝ”๋“œํผ์ŠคํŠธ LLM ์—์ด์ „ํŠธ๊ฐ€ ์ง€ํ–ฅํ•˜๋Š” ๊ธฐ์ˆ ์  ๋ฐฐ๊ฒฝ์ž…๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
325 ๋…ผ๋ฌธ์€ ์ฝ”๋“œ ์ž‘์„ฑ ๋ฐ ์‹คํ–‰์ด LLM ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ์˜ ์„ฑ๋Šฅ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ๊ณผ ์„ค๊ณ„ ์ „๋žต์„ ์ƒ์„ธํžˆ ๋ถ„์„ํ•˜๋ฏ€๋กœ MatClaw(3160)์— ๊ธฐ์ˆ ์  ๋ฐฐ๊ฒฝ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
589๋Š” ๋„๋ฉ”์ธ ํŠนํ™” LLM๊ณผ retrieval ๊ธฐ๋ฐ˜ materials workflow ์ž๋™ํ™”๋ฅผ ์ง€ํ–ฅํ•˜์—ฌ, 3160์˜ code-first agentic ์ ‘๊ทผ๊ณผ ๊ทผ๋ณธ์ ์ธ ์ฐจ๋ณ„์ ์ด ์žˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
407 ๋…ผ๋ฌธ์€ MatClaw(3160)์™€ ์œ ์‚ฌํ•˜๊ฒŒ ์žฌ๋ฃŒ ๋ถ„์•ผ์—์„œ LLM ๊ธฐ๋ฐ˜ ๊ณผํ•™ ์—์ด์ „ํŠธ ์‹œ์Šคํ…œ์„ ๊ตฌ์ถ•ํ•˜๊ณ  ์žˆ์–ด, ์‹ค์šฉ์  ๊ตฌํ˜„ ์ „๋žต์— ์ถ”๊ฐ€์  ์ธ์‚ฌ์ดํŠธ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
340์˜ ๋„๋ฉ”์ธ๋ณ„ ์ฝ”๋“œ ๊ธฐ๋ฐ˜ ์ž๋™ํ™” ์—ฐ๊ตฌ๋Š” 3160์ด ์ œ์•ˆํ•˜๋Š” ์‹คํ—˜์  ์˜ค์ผ€์ŠคํŠธ๋ ˆ์ด์…˜์˜ ์‹ค์ œ ์ ์šฉ ์‚ฌ๋ก€๋ฅผ ์ œ๊ณตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
322๋Š” AGI agent ์‹œ์Šคํ…œ ํ‰๊ฐ€์—์„œ ์›Œํฌํ”Œ๋กœ์šฐ ์‹ ๋ขฐ์„ฑ๊ณผ ์ž๋™ํ™” ํ•œ๊ณ„๋ฅผ ๋น„ํŒ์ ์œผ๋กœ ๋‹ค๋ฃจ๋ฉฐ, 3160์˜ code orchestration ๋ฐฉ์‹๊ณผ ํ•œ๊ณ„๋ฅผ ์ ๊ฒ€ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •