Table-llm-specialist: Language model specialists for tables using iterative generator-validator finetuning

์ €์ž: Ziwei Ji, Tiezheng Yu, Yan Xu, Nayeon Lee, Etsuko Ishii, Pascale Fung | ๋‚ ์งœ: 2024 | URL: https://arxiv.org/abs/2410.12164 📄 PDF


Essence

ํ…Œ์ด๋ธ” ์ž‘์—…(๋ฐ์ดํ„ฐ ์ •์ œ, NL-to-SQL ๋“ฑ)์— ํŠนํ™”๋œ ์–ธ์–ด๋ชจ๋ธ์„ ๋งŒ๋“ค๊ธฐ ์œ„ํ•ด ์ƒ์„ฑ-๊ฒ€์ฆ ์ด์ค‘ ์ž‘์—…์˜ ๋ฐ˜๋ณต์  ๋ฏธ์„ธ์กฐ์ • ํŒจ๋Ÿฌ๋‹ค์ž„์ธ Table-Specialist๋ฅผ ์ œ์•ˆํ•œ๋‹ค. ์ˆ˜๋™ ๋ ˆ์ด๋ธ” ์—†์ด ์ž๋™ ์ƒ์„ฑ๋œ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ๋กœ ๊ฐ•๋ ฅํ•œ ์„ฑ๋Šฅ๊ณผ ์ผ๋ฐ˜ํ™”๋ฅผ ๋™์‹œ์— ๋‹ฌ์„ฑํ•œ๋‹ค.

Motivation

Achievement

Figure 3

Figure 3: โ€œTable-Specialist fine-tuningโ€: Quality vs. latency

How

Figure 5

Figure 5: Architecture of Table-Specialist using โ€œGenerator-Validatorโ€ fine-tuning for a given task type ๐‘‡(Error detecti

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ํ…Œ์ด๋ธ” ์ž‘์—…์˜ ์ด์ค‘์„ฑ์„ ์ฐฝ์˜์ ์œผ๋กœ ํ™œ์šฉํ•˜์—ฌ ์ˆ˜๋™ ๋ ˆ์ด๋ธ” ์—†์ด๋„ ๋†’์€ ์„ฑ๋Šฅ๊ณผ ์ผ๋ฐ˜ํ™”๋ฅผ ๋™์‹œ์— ๋‹ฌ์„ฑํ•œ ํ˜์‹ ์  ์—ฐ๊ตฌ์ด๋‹ค. Microsoft Excel ํ†ตํ•ฉ ๋“ฑ ์‹ค๋ฌด ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์ด ๋†’์œผ๋ฉฐ, ํŠนํ™”-์ผ๋ฐ˜ํ™” trade-off ๋ฌธ์ œ ํ•ด๊ฒฐ์— ์ƒˆ๋กœ์šด ๋ฐฉํ–ฅ์„ ์ œ์‹œํ•œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
few-shot ํ™˜๊ฒฝ์—์„œ RAG์™€ LLM ๊ฒฐํ•ฉ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ ์—ฐ๊ณ„๋ฅผ ๋ถ„์„ํ•ด, Table-Specialist์˜ ๋ฐ์ดํ„ฐ ์ƒ์„ฑยทํ™•์žฅ ๋ฐฉ๋ฒ•์— ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์–ธ์–ด๋ชจ๋ธ์˜ ํ…Œ์ด๋ธ” ์ดํ•ด๋ฅผ ์œ„ํ•œ ๋ฐฉ๋ฒ•๋ก ์  ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
1092๋Š” ๊ณผํ•™ ํ…Œ์ด๋ธ” ์ดํ•ด๋ฅผ ์œ„ํ•œ ๋Œ€๊ทœ๋ชจ LLM ์ „๋ฌธํ™”์˜ ๊ธฐ์ดˆ ๊ฐœ๋…์„ ์ œ๊ณตํ•˜์—ฌ 783์—์„œ ์ฐจํŠธ ์ „๋ฌธ๊ฐ€ ๋ชจ๋ธ ๊ฐœ๋ฐœ์˜ ๊ธฐ๋ฐ˜์ด ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
787์€ ํ…Œ์ด๋ธ” ์ดํ•ด๋ ฅ ํ–ฅ์ƒ์„ ์œ„ํ•œ LLM ํ™œ์šฉ ๋ฐฉ์•ˆ์„ ์ œ์•ˆํ•ด, 1092์˜ ํ…Œ์ด๋ธ” ํŠนํ™” ์ „๋ฌธ๊ฐ€ ๋ชจ๋ธ๊ณผ ๊ธด๋ฐ€ํ•˜๊ฒŒ ์—ฐ๊ฒฐ๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํ…Œ์ด๋ธ” ์ž‘์—… ํŠนํ™” ์–ธ์–ด๋ชจ๋ธ ํ•™์Šต์˜ ๋Œ€์•ˆ์  ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํ…Œ์ด๋ธ” ๊ด€๋ จ LLM ํŠนํ™” ์ž‘์—…์—์„œ ์œ ์‚ฌํ•œ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฅธ ๋ฐฉ์‹์œผ๋กœ ์ ‘๊ทผํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
NL-to-SQL ๋“ฑ ํ…Œ์ด๋ธ” ์ž‘์—…์—์„œ ๋Œ€์•ˆ์  ๋ฏธ์„ธ์กฐ์ • ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ž๋™ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ ๊ธฐ๋ฐ˜ ๋ชจ๋ธ ํ›ˆ๋ จ์—์„œ ์œ ์‚ฌํ•œ ์ ‘๊ทผ๋ฒ•์„ ์ทจํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Table-llm-specialist ๋…ผ๋ฌธ์€ ๋‹ค์–‘ํ•œ ํ…Œ์ด๋ธ” ์œ ํ˜•๋ณ„ ํŠนํ™” LLM ๋ชจ๋ธ๋ง ๋ฐฉ๋ฒ•๋ก ์œผ๋กœ Table-of-Tree ๊ฐœ๋…๊ณผ ์—ฐ๊ฒฐ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Table-llm-specialist ๋…ผ๋ฌธ์€ ๋‹ค์–‘ํ•œ ํ…Œ์ด๋ธ” ์œ ํ˜•์— ํŠนํ™”๋œ LLM ์‚ฌ์šฉ๋ฒ•์„ ์ œ์‹œํ•˜์—ฌ TableMaster์™€ ๋น„๊ต ๊ฐ€์น˜๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
119๋Š” ์–ธ์–ด๋ชจ๋ธ์„ ํ†ตํ•œ ํฌ๋กœ์Šค๋ง๊ตฌ์–ผ ์ •๋ ฌ ๋ฐ ์ž‘์—… ์ž๋™ํ™”๋ฅผ ๋…ผ์˜ํ•˜๋ฉฐ, 1092์˜ ๋ฐ˜๋ณต ๋ฏธ์„ธ์กฐ์ • ์ ‘๊ทผ์— ์‹ค์šฉ์  ํ™•์žฅ ๋ฐฉ์•ˆ์ด ๋œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
ํ‘œ-ํŠนํ™” ์–ธ์–ด๋ชจ๋ธ(LM specialist) ํ†ตํ•ด ํ‘œ์ถ”๋ก (robustness) ํ•œ๊ณ„๋ฅผ ๋ณด์™„ํ•˜๊ณ  ๋ณธ ๋ฒค์น˜๋งˆํฌ ํ™œ์šฉ์„ ๊ณ ๋„ํ™”ํ•ฉ๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
๊ณผํ•™ ๋ฐ์ดํ„ฐ๋ฅผ ๋‹ค์–‘ํ•œ ํ…Œ์ด๋ธ” ํ˜•ํƒœ๋กœ ์ฒ˜๋ฆฌ/์ดํ•ดํ•˜๋Š” ๋ฒค์น˜๋งˆํฌ๋ฅผ ํ™œ์šฉํ•˜์—ฌ Table-LLM Specialist์˜ ์‹ค์ œ ์ ์šฉ ์‚ฌ๋ก€์™€ ํ•œ๊ณ„๋ฅผ ๊ฒ€ํ† ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •