LLM-SR: Scientific Equation Discovery via Programming with Large Language Models

์ €์ž: Parshin Shojaee, Kazem Meidani, Shashank Gupta, Amir Barati Farimani, Chandan K Reddy | ๋‚ ์งœ: 2024 | DOI: 10.48550/ARXIV.2404.18400 📄 PDF


Essence

Figure 1

Figure 1: The LLM-SR framework, consisting of three main steps: (a) Hypothesis Generation, where

LLM-SR์€ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด๋ชจ๋ธ์˜ ๊ณผํ•™ ์ง€์‹๊ณผ ์ฝ”๋“œ ์ƒ์„ฑ ๋Šฅ๋ ฅ์„ ํ™œ์šฉํ•˜์—ฌ ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ๊ณผํ•™ ๋ฐฉ์ •์‹์„ ๋ฐœ๊ฒฌํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, ๋ฐฉ์ •์‹์„ ํ”„๋กœ๊ทธ๋žจ์œผ๋กœ ํ‘œํ˜„ํ•˜๊ณ  ์ง„ํ™” ํƒ์ƒ‰๊ณผ ๊ฒฐํ•ฉํ•œ๋‹ค.

Motivation

Achievement

Figure 3

Figure 3: Best score trajectories of LLM-SR with GPT-3.5 and

How

Figure 1

Figure 1: The LLM-SR framework, consisting of three main steps: (a) Hypothesis Generation, where

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: LLM-SR์€ ๊ณผํ•™์  ๋ฐฉ์ •์‹ ๋ฐœ๊ฒฌ์ด๋ผ๋Š” ์ค‘์š”ํ•œ ๋ฌธ์ œ์— ๋Œ€ํ•ด LLM์˜ ๊ฐ•์ ์„ ์ฐฝ์˜์ ์œผ๋กœ ํ™œ์šฉํ•˜๋Š” ํ˜์‹ ์ ์ธ ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•˜๋ฉฐ, ์‹คํ—˜์ ์œผ๋กœ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ์ž…์ฆํ–ˆ๋‹ค. LLM ์•”๊ธฐ ์œ„ํ—˜์„ ๊ณ ๋ คํ•œ ์‹ ์ค‘ํ•œ ๋ฒค์น˜๋งˆํฌ ์„ค๊ณ„์™€ ํฌ๊ด„์ ์ธ ํ‰๊ฐ€๊ฐ€ ์ด ์ž‘์—…์˜ ์‹ ๋ขฐ์„ฑ์„ ๋†’์ธ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์ˆ˜์น˜๋ฐฉ์ •์‹์˜ ๋Œ€์นญ์„ฑ ๊ธฐ๋ฐ˜ ๋ฐœ๊ฒฌ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•˜๋ฉฐ, LLM-SR๊ณผ ๊ฐ™์€ ํ”„๋กœ๊ทธ๋ž˜๋ฐ ๊ธฐ๋ฐ˜ ์ž๋™ ๋ฐฉ์ •์‹ ๋ฐœ๊ฒฌ์— ํ•ต์‹ฌ์  ์ด๋ก ์„ ์ œ์‹œํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
LLM-SR ๋…ผ๋ฌธ์€ LLM์ด ํ”„๋กœ๊ทธ๋ž˜๋ฐ์„ ํ†ตํ•œ ๊ณผํ•™ ๋ฐฉ์ •์‹ ๋ฐœ๊ฒฌ์— ์–ด๋–ป๊ฒŒ ์ ‘๊ทผํ•˜๋Š”์ง€์— ๋Œ€ํ•œ ์ด๋ก ์  ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
25๋ฒˆ ๋…ผ๋ฌธ์€ ์†Œ์žฌ ๊ณผํ•™์—์„œ์˜ LLMยทFM ์‘์šฉ ๋ฆฌ๋ทฐ๋กœ, LLM-SR ๊ฐ™์ด ๊ณผํ•™ ๋…ผ๋ฆฌ ๋ฐœ๊ฒฌ์— ํ™œ์šฉ๋  ์ธ๊ณต์ง€๋Šฅ ๋ฐฉ๋ฒ•๋ก ์˜ ์ด๋ก ์  ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
289๋ฒˆ ๋…ผ๋ฌธ์€ ๊ฒฝํ—˜ ๊ธฐ๋ฐ˜/๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ์ด์ค‘ ์ถ”๋ก  ์ ‘๋ชฉ์„ ํ†ตํ•œ ๊ณผํ•™ ๋ฐฉ์ •์‹ ๋ฐœ๊ฒฌ์„ ์‹œ๋„ํ•˜๋ฏ€๋กœ, 2209์˜ LLM ํ”„๋กœ๊ทธ๋žจ-๊ธฐ๋ฐ˜ ๋ฐฉ์ •์‹ ํƒ์ƒ‰๊ณผ์˜ ๋ฐฉ๋ฒ•๋ก ์  ์ฐจ์ด๋ฅผ ๋น„๊ตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM-Feynman ๋…ผ๋ฌธ์€ LLM์„ ํ™œ์šฉํ•ด ๊ณผํ•™ ๋ฒ•์น™ ๋ฐœ๊ฒฌ์„ ์ž๋™ํ™”ํ•˜์—ฌ, LLM-SR์ด ์ถ”๊ตฌํ•˜๋Š” ๋ฐฉ์ •์‹ ๋ฐœ๊ฒฌ์˜ ๋˜๋‹ค๋ฅธ ์‹คํ˜„ ์‚ฌ๋ก€์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ธฐ๊ณ„ํ•™์Šต๊ณผ ๋ฌผ๋ฆฌ์  ๋Œ€์นญ์„ฑ์„ ๊ฒฐํ•ฉํ•˜์—ฌ ๊ณผํ•™์  ๋ฒ•์น™์„ ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ํ•™์Šตํ•˜๋Š” ์œ ์‚ฌํ•œ ๋ฐฉ๋ฒ•๋ก ์  ์ ‘๊ทผ์„ ๊ณต์œ ํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
623๋ฒˆ ๋…ผ๋ฌธ์€ ์›๋ฆฌ ๊ธฐ๋ฐ˜ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ๊ณผํ•™ ๋ฐœ๊ฒฌ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•˜์—ฌ, 2209์˜ LLM-SR ํ”„๋ ˆ์ž„์›Œํฌ์™€ ํ•จ๊ป˜ ๊ณผํ•™ ๋ฐฉ์ •์‹ ๋ฐœ๊ฒฌ ์‹œ์Šคํ…œ์˜ ์‹ค์งˆ์  ํ™•์žฅ์„ฑ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
LLM-SRBench๋Š” LLM-SR ํ”„๋ ˆ์ž„์›Œํฌ์˜ ์—์ด์ „ํŠธ ๊ธฐ๋ฐ˜ ๊ณผํ•™ ๋ฐฉ์ •์‹ ๋ฐœ๊ฒฌ ์„ฑ๋Šฅ์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ธก์ •ํ•˜๋ฉฐ ์‹ค์ œ ๋ฒค์น˜๋งˆํฌ๋กœ ์ด์–ด์ง„๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
2209๋ฒˆ ๋…ผ๋ฌธ์€ ๊ณผํ•™ ๋ฐฉ์ •์‹ ์ž๋™ ๋ฐœ๊ฒฌ์ด๋ผ๋Š” ์‘์šฉ ์ธก๋ฉด์—์„œ 142๋ฒˆ์˜ ์ž๋™ ์†”๋ฒ„ ์‹œ์Šคํ…œ์ด ์‹ค์ œ ๊ณผํ•™ ๋ฐ์ดํ„ฐ ํ•ด์„ ๋ฐ ๋ฐฉ์ •์‹ ๋ฐœ๊ฒฌ์— ์ง์ ‘ ์ ์šฉ๋  ์ˆ˜ ์žˆ๋Š” ํ™•์žฅ ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
2209๋ฒˆ ๋…ผ๋ฌธ์€ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด๋ชจ๋ธ์„ ํ™œ์šฉํ•œ ๊ณผํ•™ ๋ฐฉ์ •์‹ ๋ฐœ๊ฒฌ์—์„œ ํ”„๋กœ๊ทธ๋žจํ™”๋œ ํ…Œ์ด๋ธ” ๊ธฐ๋ฐ˜ ํ‘œ๊ธฐยท์—ฐ์‚ฐ์„ ์‚ฌ์šฉํ•ด 841์˜ ํ…Œ์ด๋ธ” ๊ตฌ์กฐ ์ถ”๋ก ๊ณผ ์—ฐ๊ด€์ด ๊นŠ์Šต๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •