Autonomous microscopy experiments through large language model agents

์ €์ž: Indrajeet Mandal, Jitendra Soni, Mohd Zaki, Morten M. Smedskjรฆr, Katrin Wondraczek, Lothar Wondraczek, Nitya Nand Gosvami, N. M. Anoop Krishnan | ๋‚ ์งœ: 2024 | DOI: ๋ฏธ์ œ๊ณต 📄 PDF


Essence

๋Œ€๊ทœ๋ชจ ์–ธ์–ด๋ชจ๋ธ(LLM) ๊ธฐ๋ฐ˜ ์ž๋™ํ™” ํ˜„๋ฏธ๊ฒฝ ์‹คํ—˜ ์‹œ์Šคํ…œ(AILA)์„ ๊ตฌ์ถ•ํ•˜๊ณ , ์›์ž๋ ฅ ํ˜„๋ฏธ๊ฒฝ(AFM) ์‹คํ—˜์˜ ์™„์ „ํ•œ ๊ณผํ•™์  ์›Œํฌํ”Œ๋กœ์šฐ๋ฅผ ํ‰๊ฐ€ํ•˜๋Š” ์ข…ํ•ฉ ๋ฒค์น˜๋งˆํฌ(AFMBench)๋ฅผ ๊ฐœ๋ฐœํ–ˆ๋‹ค. ์ตœ์ฒจ๋‹จ AI ๋ชจ๋ธ๋“ค๋„ ๊ธฐ๋ณธ ์ž‘์—…์—์„œ ์–ด๋ ค์›€์„ ๊ฒช์œผ๋ฉฐ, ๋„๋ฉ”์ธ ํŠนํ™” ์งˆ์˜์‘๋‹ต ์„ฑ๋Šฅ์ด ์‹ค์ œ ์—์ด์ „ํŠธ ๋Šฅ๋ ฅ์œผ๋กœ ์ „ํ™˜๋˜์ง€ ์•Š์Œ์„ ๋ฐํ˜”๋‹ค.

Motivation

Achievement

  1. AILA ํ”„๋ ˆ์ž„์›Œํฌ ๊ฐœ๋ฐœ: LLM ๊ธฐ๋ฐ˜ ํ”Œ๋ž˜๋„ˆ๊ฐ€ AFM Handler Agent(AFM-HA)์™€ Data Handler Agent(DHA)๋ฅผ ๋™์ ์œผ๋กœ ์กฐ์œจํ•˜์—ฌ ์‹คํ—˜ ์ œ์–ด์™€ ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์ž๋™ํ™”. ๋ฌธ์„œ ๊ฒ€์ƒ‰, ์ฝ”๋“œ ์‹คํ–‰, ์ด๋ฏธ์ง€ ๋ถ„์„ ๋“ฑ ํŠนํ™”๋œ ๋„๊ตฌ ํ†ตํ•ฉ
  2. AFMBench ๊ตฌ์ถ•: ๊ธฐ๋ณธ ์ž‘์—…(56%)๊ณผ ๊ณ ๊ธ‰ ์ž‘์—…(44%)์„ ํฌํ•จํ•œ 100๊ฐœ ๊ณผ์ œ๋กœ ๊ตฌ์„ฑ. ๋„๊ตฌ ์กฐ์œจ(69% ๋‹ค์ค‘ ๋„๊ตฌ), ์—์ด์ „ํŠธ ์กฐ์œจ(17% ๋‹ค์ค‘ ์—์ด์ „ํŠธ) ์š”๊ตฌ์‚ฌํ•ญ์„ ๋ฐ˜์˜ํ•˜์—ฌ ํ˜„์‹ค์  ๋ณต์žก๋„ ์žฌํ˜„
  3. ์„ฑ๋Šฅ ํ‰๊ฐ€์˜ ์—ญ์„ค์  ๋ฐœ๊ฒฌ:
    • GPT-4o: ๋ฌธ์„œ ๊ธฐ๋ฐ˜ ์ž‘์—… 88.3% ์„ฑ๊ณต๋ฅ  ๋‹ฌ์„ฑ
    • Claude-3.5-sonnet: ์žฌ๋ฃŒ๊ณผํ•™ ๋„๋ฉ”์ธ QA ๋ฒค์น˜๋งˆํฌ์—์„œ ์šฐ์ˆ˜ํ•˜๋‚˜ ์‹ค์ œ ์—์ด์ „ํŠธ ์ž‘์—…์—์„œ๋Š” ์˜ˆ์ƒ ์™ธ๋กœ ์ €์กฐ
    • ํ•ต์‹ฌ ํ†ต์ฐฐ: ๋„๋ฉ”์ธ ํŠนํ™” QA ๋Šฅ๋ ฅ์ด ์‹ค๋ฌด์  ์—์ด์ „ํŠธ ์—ญ๋Ÿ‰์œผ๋กœ ์ „ํ™˜๋˜์ง€ ์•Š์Œ
  4. ์‹ค์ œ ์‹คํ—˜ ์„ฑ๊ณต: AFM ์บ˜๋ฆฌ๋ธŒ๋ ˆ์ด์…˜, ํ‘์—ฐ ์ธต ๊ฐœ์ˆ˜ ๊ณ„์‚ฐ, ๊ทธ๋ž˜ํ•€ ์Šคํ… ์—ฃ์ง€ ๊ณ ํ•ด์ƒ๋„ ์ด๋ฏธ์ง•, HOPG ๋ถ€ํ•˜-์˜์กด์  ๊ฑฐ์น ๊ธฐ ํŠน์„ฑํ™” ๋“ฑ 5๊ฐœ ์‹ค์ œ ์‹คํ—˜ ์ˆ˜ํ–‰

How

Figure 2

๊ทธ๋ฆผ 2: AFMBench ๊ณผ์ œ ๋ถ„ํฌ ๋ฐ ๋ชจ๋“ˆ ํ™œ์šฉ. (a) ๋„๊ตฌ ๋ฐ ์—์ด์ „ํŠธ ์š”๊ตฌ์‚ฌํ•ญ ๋ถ„ํฌ (b) ์ž‘์—… ๋ณต์žก๋„ ๋ถ„๋ฅ˜ (c) ๋ชจ๋“ˆ๋ณ„ ํ™œ์šฉ ๋นˆ๋„ (d-e) ์ž‘์—… ์œ ํ˜• ๋ฐ ๋ณต์žก๋„ ์˜ˆ์‹œ

Originality

Limitation & Further Study

Evaluation

Novelty: 4.5/5 Technical Soundness: 4/5 Significance: 4.5/5 Clarity: 4/5 Overall: 4.2/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜ ์ž๋™ํ™” ์‹คํ—˜์‹ค์˜ ์‹ ๋ขฐ์„ฑ์„ ์ฒด๊ณ„์ ์œผ๋กœ ๊ฒ€์ฆํ•˜๋Š” ํ˜„์‹ค์ ์ด๊ณ  ์ค‘์š”ํ•œ ์—ฐ๊ตฌ๋กœ, ๋„๋ฉ”์ธ QA ์„ฑ๋Šฅ๊ณผ ์‹ค๋ฌด ๋Šฅ๋ ฅ์˜ ๋ถˆ์ผ์น˜ ํ˜„์ƒ ๊ฐ™์€ ์ค‘์š”ํ•œ ํ†ต์ฐฐ์„ ์ œ์‹œํ•œ๋‹ค. ๋‹ค๋งŒ AFM ํŠนํ™” ํ‰๊ฐ€, ํ”„๋กฌํ”„ํŠธ ๋ถˆ์•ˆ์ •์„ฑ์˜ ๊ทผ๋ณธ ์›์ธ ๋ถ„์„ ๋ฏธํก, ๊ทธ๋ฆฌ๊ณ  ํ˜„์žฌ ๋ชจ๋ธ์˜ ์ €์กฐํ•œ ์„ฑ๋Šฅ์œผ๋กœ ์ธํ•ด ์‹ค์ œ ๋ฐฐํฌ์— ์ด๋ฅด๋Š” ๊ฒฝ๋กœ๋Š” ์•„์ง ๋ช…ํ™•ํ•˜์ง€ ์•Š๋‹ค๋Š” ์ ์ด ํ•œ๊ณ„์ด๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Autonomous microscopy experiments ๋…ผ๋ฌธ์€ ์ž๋™ํ™”๋œ ๊ณผํ•™ ์‹คํ—˜ ํ™˜๊ฒฝ์—์„œ LLM ํ™œ์šฉ ๊ตฌ์กฐ๋ฅผ ์ œ์‹œํ•˜๋ฉฐ, Lego-prover์˜ ์ฆ๋ช… ์ž๋™ํ™” ๊ธฐ์ˆ  ์ ์šฉ์˜ ๊ฐœ๋…์  ํ† ๋Œ€๊ฐ€ ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Self-Driving Laboratories for Chemistry and Materials Science ๋…ผ๋ฌธ์€ ์‹คํ—˜์‹ค ์ž๋™ํ™” ๋ฐ AI ํ™œ์šฉ ๋ฐฉํ–ฅ์„ ํญ๋„“๊ฒŒ ์ •๋ฆฌํ•˜์—ฌ, AI ๊ธฐ๋ฐ˜ ์ž๋™ ํ˜„๋ฏธ๊ฒฝ ์‹คํ—˜ ํ”„๋ ˆ์ž„์›Œํฌ์˜ ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Building machines that learn and think with people ๋…ผ๋ฌธ์€ ์ธ๊ฐ„-๊ธฐ๊ณ„ ํ˜‘์—…์ด ์‹คํ—˜ยท๋ฐœ๊ฒฌ ์ž๋™ํ™”์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ์ด๋ก ์ ์œผ๋กœ ๋ถ„์„ํ•˜์—ฌ, AI ๊ธฐ๋ฐ˜ ํ˜„๋ฏธ๊ฒฝ ์‹คํ—˜์‹ค ๊ตฌ์ถ•์˜ ๊ธฐ์ˆ ์ ยท์ธ๊ฐ„์  ํ•œ๊ณ„๋ฅผ ๊ณ ์ฐฐํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Autonomous microscopy experiments ๋…ผ๋ฌธ์€ ์ƒ๋ช…๊ณผํ•™ ์˜์—ญ์˜ ์‹คํ—˜ ์ž๋™ํ™”์™€ ๋„๋ฉ”์ธ ํŠนํ™” LLM ํ™œ์šฉ ์‚ฌ๋ก€๋กœ, SpatialAgent์˜ ์ „์ฒด ์ƒ๋ฌผํ•™ ์—ฐ๊ตฌ ์ž๋™ํ™”์— ์ด๋ก ์  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
LLM ๊ธฐ๋ฐ˜ ๊ณผํ•™ ์ž๋™ํ™” ์—์ด์ „ํŠธ์˜ ์ด๋ก ์  ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๊ณผํ•™ ์‹คํ—˜ ์ž๋™ํ™”๋ฅผ ์œ„ํ•œ ์ž์œจ ์—์ด์ „ํŠธ์˜ ๊ธฐ๋ฐ˜ ๊ฐœ๋…์„ ์ œ๊ณตํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Autonomous microscopy experiments ๋…ผ๋ฌธ์€ VLM-AI ๊ธฐ๋ฐ˜ ์‹คํ—˜ ์ž๋™ํ™” ๊ฐœ๋…์„ ์ดˆ๊ธฐ๋ถ€ํ„ฐ ์ œ์‹œํ•˜์˜€์œผ๋ฉฐ, EAA ๊ฐœ๋…์˜ ์ถœ๋ฐœ์  ์—ญํ• ์„ ํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
139 ๋…ผ๋ฌธ์€ ์‹คํ—˜์‹ค ์ž๋™ํ™”์— LLM์„ ํ™œ์šฉํ•œ ์ž๋™ ํ˜„๋ฏธ๊ฒฝ ์‹คํ—˜ ์‚ฌ๋ก€๋ฅผ ์†Œ๊ฐœํ•ด, ์‹œ๊ฐ ์ž…๋ ฅ ๊ธฐ๋ฐ˜ ๊ฐ•ํ™”ํ•™์Šต ์—์ด์ „ํŠธ์ธ LaueRL ๋ฐฉ์‹์˜ ์ถœ๋ฐœ์ ์ด ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Autonomous microscopy experiments through large language models ๋…ผ๋ฌธ์€ LLM์„ ํ™œ์šฉํ•œ ์ž๋™ํ™”๋œ ์ƒ๋ช…๊ณผํ•™ ์‹คํ—˜์˜ ๋˜๋‹ค๋ฅธ ๋ฐฉ์‹์ด๋ฏ€๋กœ ๋น„๊ต ๊ฐ€์น˜๊ฐ€ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์‹คํ—˜์‹ค ์ž๋™ํ™”๋ฅผ ์œ„ํ•œ AI ์‹œ์Šคํ…œ์˜ ์œ ์‚ฌํ•œ ์ ‘๊ทผ ๋ฐฉ์‹์„ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM ๊ธฐ๋ฐ˜ ๋ฉ€ํ‹ฐ์—์ด์ „ํŠธ๋กœ ์‹คํ—˜์‹ค ์‹คํ—˜์„ ์ž๋™ํ™”ํ•˜๋Š” ์œ ์‚ฌํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
AtomAgents ๋…ผ๋ฌธ์€ ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ์‹œ๋ฎฌ๋ ˆ์ด์…˜๊ณผ LLM ๊ฒฐํ•ฉ ์—์ด์ „ํŠธ๋กœ ์†Œ์žฌ/ํ•ฉ๊ธˆ ์„ค๊ณ„ ์ž๋™ํ™”๋ฅผ, ๋ณธ ๋…ผ๋ฌธ์€ ์ž๋™ ํ˜„๋ฏธ๊ฒฝ ์‹คํ—˜ ์ž๋™ํ™”๋ฅผ ๊ฐ๊ฐ ๊ตฌํ˜„ํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
SciAgents ๋…ผ๋ฌธ์€ ํ˜„๋ฏธ๊ฒฝ ๋“ฑ ๋‹ค์–‘ํ•œ ์—ฐ๊ตฌ ์ž๋™ํ™” ์‹œ๋‚˜๋ฆฌ์˜ค์—์„œ LLM ๊ธฐ๋ฐ˜ ์›Œํฌํ”Œ๋กœ์šฐ ์‹คํ—˜ ๋ฐ ๋ฒค์น˜๋งˆํฌ๋ฅผ ์ œ๊ณตํ•˜์—ฌ, AILA ์‹œ์Šคํ…œ์˜ ์„ฑ๊ณผ์™€ ํ•œ๊ณ„๋ฅผ ๋Œ€์กฐ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ณผํ•™ ์‹คํ—˜ ์›Œํฌํ”Œ๋กœ์šฐ ์ž๋™ํ™”๋ฅผ ์œ„ํ•œ LLM ์—์ด์ „ํŠธ์˜ ๊ด€๋ จ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
EAA ๋…ผ๋ฌธ์€ ์ž๋™ํ™”๋œ ์†Œ์žฌ ๋ถ„์„ ์‹คํ—˜ ๋ฒค์น˜๋งˆํฌ๋ฅผ ๋‹ค๋ฃจ์–ด ํ˜„๋ฏธ๊ฒฝ ์‹คํ—˜ ์ž๋™ํ™” ๋ฐ ์‹ค์ œ ๋ฒค์น˜๋งˆํ‚น ํ”„๋ ˆ์ž„์›Œํฌ ์—ฐ๊ตฌ์™€ ํ˜„์žฅ์  ์—ฐ๊ฒฐ๊ณ ๋ฆฌ๋ฅผ ํ˜•์„ฑํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Semi-supervised GAN for smart microscopy ๋…ผ๋ฌธ์€ ํ˜„๋ฏธ๊ฒฝ ์ด๋ฏธ์ง• ๋ฐ ์‹คํ—˜ ์ž๋™ํ™” ๋ถ„์•ผ์—์„œ LLMยท๋”ฅ๋Ÿฌ๋‹์„ ๋‹ค์–‘ํ•œ ํ˜•ํƒœ๋กœ ์‘์šฉํ•œ ํ•ด๊ฒฐ๋ฐฉ์•ˆ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹ค๋ฅธ ์ž๋™ ์‹คํ—˜์‹ค ํ™˜๊ฒฝ์—์„œ LLM ํ™œ์šฉ์„ ์‹œ๋„ํ•œ ์—ฐ๊ตฌ๋กœ, ์ž๋™ํ™”๋œ ์‹คํ—˜ ์‹คํ–‰์˜ ์ถ”๊ฐ€ ์‚ฌ๋ก€๋ฅผ ์ œ์‹œํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Executable Code Actions Elicit Better LLM Agents ๋…ผ๋ฌธ์—์„œ ๊ณผํ•™ ์‹คํ—˜์— ํ•„์š”ํ•œ ์ฝ”๋“œ ์ƒ์„ฑ ๋ฐ ์‹คํ–‰ ๋Šฅ๋ ฅ์„ ์ƒˆ๋กญ๊ฒŒ ๋ถ„์„ํ•˜์—ฌ, LLM ๊ธฐ๋ฐ˜ ์ž๋™ ํ˜„๋ฏธ๊ฒฝ ์›Œํฌํ”Œ๋กœ์šฐ์˜ ์‹ค์ œ ์ž‘๋™๊ฐ€๋Šฅ์„ฑ๊ณผ ์—ฐ๊ฒฐ๋œ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •