Ds-agent: Automated data science by empowering large language models with case-based reasoning

์ €์ž: Laith Alzubaidi, Jinglan Zhang, Amjad J. Humaidi, Ayad Q. Al-Dujaili, Ye Duan, Omran Al-Shamma, Josรฉ Santamarรญa, Mohammed A. Fadhel, Muthana Alโ€Amidie, Laith Farhan | ๋‚ ์งœ: 2024 | URL: https://arxiv.org/abs/2402.17453 📄 PDF


Essence

Figure 1

Figure 1. (a) Overview of DS-Agent with CBR based LLMs. (b)

DS-Agent๋Š” LLM ์—์ด์ „ํŠธ์™€ case-based reasoning(CBR)์„ ๊ฒฐํ•ฉํ•˜์—ฌ ์ž๋™ํ™”๋œ ๋ฐ์ดํ„ฐ ์‚ฌ์ด์–ธ์Šค ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ์ด๋‹ค. ๊ฐœ๋ฐœ ๋‹จ๊ณ„์—์„œ ๋ฐ˜๋ณต์  ๊ฐœ์„ ์„ ํ†ตํ•ด ์ตœ์ ์˜ ML ๋ชจ๋ธ์„ ๊ตฌ์ถ•ํ•˜๊ณ , ๋ฐฐํฌ ๋‹จ๊ณ„์—์„œ ์ €์ž์› ํ™˜๊ฒฝ์— ๋งž์ถฐ ๊ณผ๊ฑฐ ์„ฑ๊ณต ์‚ฌ๋ก€๋ฅผ ์žฌ์‚ฌ์šฉํ•œ๋‹ค.

Motivation

Achievement

Figure 4

Figure 4. Success rate of four different agents in the development

How

Figure 2

Figure 2. Comparison between (a) RAG based LLMs and (b) CBR

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: DS-Agent๋Š” CBR๊ณผ LLM์„ ์ฐฝ์˜์ ์œผ๋กœ ๊ฒฐํ•ฉํ•˜์—ฌ ๋ฐ์ดํ„ฐ ์‚ฌ์ด์–ธ์Šค ์ž๋™ํ™”์—์„œ ์‹ค์งˆ์ ์ธ ์„ฑ๋Šฅ ๊ฐœ์„ ๊ณผ ๋น„์šฉ ํšจ์œจ์„ฑ์„ ๋™์‹œ์— ๋‹ฌ์„ฑํ–ˆ๋‹ค. Kaggle ์ง€์‹ ํ™œ์šฉ๊ณผ ์ด์›ํ™”๋œ ํŒŒ์ดํ”„๋ผ์ธ ์„ค๊ณ„๋Š” ์‹ค์šฉ์ ์ด๋ฉฐ, ๋ช…ํ™•ํ•œ ์‹คํ—˜ ๊ฒฐ๊ณผ์™€ ์˜คํ”ˆ์†Œ์Šค ๊ณต๊ฐœ๋กœ ํ›„์† ์—ฐ๊ตฌ๋ฅผ ์ด‰์ง„ํ•  ์ˆ˜ ์žˆ๋Š” ์šฐ์ˆ˜ํ•œ ๊ธฐ์—ฌ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Data Interpreter ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜ ์ž๋™ํ™” ๋ฐ์ดํ„ฐ ๊ณผํ•™ ํ”„๋ ˆ์ž„์›Œํฌ ๊ตฌํ˜„์˜ ๊ธฐ๋ณธ ๊ตฌ์กฐ๋ฅผ ๋‹ค๋ฃจ๋ฉฐ DS-Agent์˜ ๊ธฐ๋ฐ˜์  ์—ญํ• ์„ ํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Ds-agent ๋…ผ๋ฌธ์€ ๋‹ค์ค‘ LLM ์—์ด์ „ํŠธ ์‹œ์Šคํ…œ์˜ ์ž๋™ํ™” ๋ฐ์ดํ„ฐ ๊ณผํ•™ ํŒŒ์ดํ”„๋ผ์ธ ๊ตฌํ˜„์— ํ•„์š”ํ•œ ํ•ต์‹ฌ ๊ตฌ์„ฑ์š”์†Œ๋ฅผ ๋‹ค๋ฃน๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
LLM ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ์˜ ๋ฐ˜๋ณต์  ๊ฐœ์„  ๋ฐ ํ”ผ๋“œ๋ฐฑ ๋ฉ”์ปค๋‹ˆ์ฆ˜์— ๋Œ€ํ•œ ์ด๋ก ์  ๊ธฐ์ดˆ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
DSBench๋Š” ๋ฐ์ดํ„ฐ ์‚ฌ์ด์–ธ์Šค ์—์ด์ „ํŠธ์˜ ํ•œ๊ณ„์™€ ๋ฐœ์ „ ๊ฐ€๋Šฅ์„ฑ์„ ํ‰๊ฐ€ํ•˜์—ฌ DS-Agent ์ ‘๊ทผ์˜ ์„ฑ๋Šฅ ๊ฒ€์ฆยท๋น„๊ต์— ํ™œ์šฉ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
121๋ฒˆ ๋…ผ๋ฌธ์€ ์ž๋™ํ™”๋œ ๋ฐ์ดํ„ฐ ์‚ฌ์ด์–ธ์Šค ์ž‘์—…์„ ์œ„ํ•œ ๋ฉ€ํ‹ฐ ์—์ด์ „ํŠธ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•˜์—ฌ DS-agent์™€ ๋ฐฉ๋ฒ•์ƒ ์œ ์‚ฌ์ ์ด๋‚˜ ์ฐจ๋ณ„์ ์„ ์ดํ•ดํ•˜๋Š” ๋ฐ ๋„์›€์ด ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Ds-agent ๋…ผ๋ฌธ๋„ LLM ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ ์‚ฌ์ด์–ธ์Šค ์ž๋™ํ™” ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฃจ๋ฉฐ, Data Interpreter์™€ ์„ค๊ณ„์ฒ ํ•™๊ณผ ์„ฑ๋Šฅ, ์ ์šฉ๋ฒ”์œ„์˜ ์ฐจ์ด์ ์„ ๋น„๊ตํ•  ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
293๋ฒˆ ๋…ผ๋ฌธ์€ LLM์„ ํ†ตํ•œ ์ž๋™ํ™” ๋ฐ์ดํ„ฐ๊ณผํ•™ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, 540๋ฒˆ์—์„œ ์„ ํ–‰๋ฐฉ๋ฒ• ๊ฒ€์ƒ‰ ์ค‘์‹ฌ์œผ๋กœ ๋‹ค๋ฃจ๋Š” ๋ฌธ์ œ๋ฅผ ์ „์ฒด ์—ฐ๊ตฌ ๋ผ์ดํ”„์‚ฌ์ดํด ์ž๋™ํ™” ๊ด€์ ์—์„œ ๋น„๊ตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ž๋™ ๋ฐ์ดํ„ฐ ๋ถ„์„ ์—์ด์ „ํŠธ๋กœ์„œ LLM ๊ธฐ๋ฐ˜ ๋ถ„์„ ์„ฑ๋Šฅ์„ ์‹ค์ œ ๊ตฌํ˜„ ๋ฐ ํ‰๊ฐ€ํ•˜๋Š” ๋งฅ๋ฝ์„ ์ œ๊ณตํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
DS-Agent์˜ ๋ฐ˜๋ณต์  ๊ฐœ์„  ๋ฐ ๋ฐฐํฌ ๋‹จ๊ณ„ ์ตœ์ ํ™”๋ฅผ ํ™•์žฅํ•˜๋Š” ๊ด€๋ จ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ๊ณตํ•œ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
290๋ฒˆ ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜ ์ž๋™ ์•ฝ๋ฌผ ๋ฐœ๊ฒฌ ์—์ด์ „ํŠธ๋ฅผ ๋‹ค๋ฃจ๋ฉฐ ๋ฐ์ดํ„ฐ ์‚ฌ์ด์–ธ์Šค ์—์ด์ „ํŠธ ๊ธฐ์ˆ ์˜ ์‹ค์ œ ๋ฐ”์ด์˜ค๋ฉ”๋””์ปฌ ์‘์šฉ ์‚ฌ๋ก€๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •