LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation

์ €์ž: Yuan Chiang, Elvis Hsieh, Chia-Hong Chou, Janosh Riebesell | ๋‚ ์งœ: 2024-10-09 | DOI: 10.48550/arXiv.2401.17244 📄 PDF


Essence

Figure 1

Figure 1: Hierarchical ReAct agent planning in LLaMP. Two levels of agents are deployed using a

LLaMP๋Š” ๊ณ„์ธต์  ReAct ์—์ด์ „ํŠธ ๊ธฐ๋ฐ˜ multimodal RAG ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, Materials Project์™€ atomistic simulation ๋„๊ตฌ๋ฅผ ํ™œ์šฉํ•˜์—ฌ LLM์˜ hallucination์„ ์ค„์ด๊ณ  ์žฌ๋ฃŒ๊ณผํ•™ ๋ถ„์•ผ์—์„œ ๋†’์€ ์‹ ๋ขฐ๋„์˜ ์ง€์‹ ๊ฒ€์ƒ‰๊ณผ ๋ณต์žกํ•œ ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2: LLaMP RAG responses, baseline methods, and LLM intrinsic knowledge on material

How

Figure 1

Figure 1: Hierarchical ReAct agent planning in LLaMP. Two levels of agents are deployed using a

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: LLaMP๋Š” ๊ณ„์ธต์  ReAct ์—์ด์ „ํŠธ์™€ domain-specific data source๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ ๊ณผํ•™ ๋ถ„์•ผ์—์„œ LLM์˜ hallucination ๋ฌธ์ œ๋ฅผ ์‹ค์งˆ์ ์œผ๋กœ ์™„ํ™”ํ•˜๋Š” ์‹ค์šฉ์ ์ด๊ณ  ํ˜์‹ ์ ์ธ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•˜๋ฉฐ, materials informatics์—์„œ์˜ ๊ตฌ์ฒด์  ์„ฑ๊ณผ๋กœ scientific AI์˜ ์‹ ๋ขฐ์„ฑ ํ–ฅ์ƒ์— ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋ฅผ ํ•œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๊ณผํ•™ ๋„๋ฉ”์ธ LLMยท๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ RAG ํ”„๋ ˆ์ž„์›Œํฌ์— ๋Œ€ํ•œ ์ข…ํ•ฉ์  ์„œ๋ฒ ์ด๋กœ, LLaMP ์ ‘๊ทผ์˜ ์ด๋ก ์  ๋งฅ๋ฝ์„ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์ง€์‹์ถ”๋ก  ๊ธฐ๋ฐ˜์˜ LLM์„ ์žฌ๋ฃŒ๊ณผํ•™์— ํŠนํ™”์‹œํ‚ค๋Š” ๋ฐฉ์‹์„ ํ†ตํ•ด LLaMP์˜ ๊ณ ์‹ ๋ขฐ๋„ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ž‘์—… ์„ฑ๋Šฅ์ด ๋’ท๋ฐ›์นจ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
HoneyComb๋Š” LLM ๊ธฐ๋ฐ˜์˜ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์—์ด์ „ํŠธ๋กœ ์žฌ๋ฃŒ ๊ณผํ•™ ์ •๋ณด ๊ฒ€์ƒ‰ ๋ฐ ์ถ”๋ก  ์ž๋™ํ™” ์ธก๋ฉด์—์„œ LLaMP์™€ ๋น„๊ต ๋Œ€์ƒ์ด ๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ณผํ•™์  ์ง€์‹ ์ถ”์ถœ ๋ฐ ํ•ฉ์„ฑ์„ ์œ„ํ•œ LLM ํ™œ์šฉ์„ ๋‹ค๋ฃจ๋Š” ์œ ์‚ฌํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
MatterChat์€ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ LLM๊ณผ materials project ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•ด ์žฌ๋ฃŒ๊ณผํ•™ ๋ถ„์•ผ ๊ณ ์œ  ๋ฌธ์ œํ•ด๊ฒฐ์„ ์‹œ๋„ํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ์„œ์—ด ์ตœ์ ํ™”๋ฅผ ์œ„ํ•œ ๋Œ€์•ˆ์  ์ƒ์„ฑ ๋ชจ๋ธ ์ ‘๊ทผ๋ฒ•์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLaMP๋Š” LLM์„ ์œ„ํ•œ ๋ฐฐํ„ฐ๋ฆฌ ์†Œ์žฌ, ๊ณ ์ฒด ์ „ํ•ด์งˆ ๋“ฑ์—์„œ ํŠน์„ฑ ์˜ˆ์ธก์„ ์ง€ํ–ฅํ•˜์—ฌ, ML ์˜ˆ์ธก ๋ฐ ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ์›Œํฌํ”Œ๋กœ์šฐ์˜ ์œ ์šฉ์„ฑ์„ ํ‰๊ฐ€ ์ธก๋ฉด์—์„œ ๋น„๊ต ๊ฐ€๋Šฅํ•˜๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM์„ ํ™œ์šฉํ•œ ๋ถ„์ž ์ƒ์„ฑ ๋ฐ ์„ค๊ณ„์˜ ๋Œ€์•ˆ์  ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋ถ„์ž ์–ธ์–ด ๋ชจ๋ธ์„ ํ™œ์šฉํ•œ ํ™”ํ•™์  ์ถ”๋ก  ๋ฐ ๋ฌผ์„ฑ ์˜ˆ์ธก์„ ๋‹ค๋ฃจ๋Š” ์œ ์‚ฌํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
LLaMP๋Š” ์žฌ๋ฃŒ๊ณผํ•™ยท์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋„๊ตฌ์™€ RAG ๊ฒฐํ•ฉ์œผ๋กœ, GraphCast๋ฅ˜ ๊ธฐ์ƒ ์˜ˆ์ธก์—์„œ๋„ LLM ๊ธฐ๋ฐ˜ ๊ณ ์‹ ๋ขฐ๋„ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ž‘์—… ์ ์šฉ์˜ ๋ฐœ์ „ ๊ฐ€๋Šฅ์„ฑ์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์ด ์žฌ๋ฃŒ๊ณผํ•™ ์ •๋ณด ์ถ”์ถœ ๋ฐ ์ƒ์„ฑ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ๋ ฅ๊ณผ, ๋ถ„์•ผ๋ณ„ ์ „์ด ๊ฐ€๋Šฅ์„ฑ์„ ์„œ๋ฒ ์ดํ•ฉ๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
๋ณธ ๋…ผ๋ฌธ์—์„œ ์ œ์‹œ๋œ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ ํ™œ์šฉ ์ด๋ก ์ด, LLaMP๊ฐ€ ์‹ค์งˆ์ ์œผ๋กœ Materials Project ๋“ฑ ์†Œ์žฌ-์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋„๊ตฌ์—์„œ ๊ตฌํ˜„๋œ ์‚ฌ๋ก€๋ฅผ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •