A survey on table-and-text hybridqa: Concepts, methods, challenges and future directions

์ €์ž: Dingzirui Wang, Longxu Dou, Wanxiang Che | ๋‚ ์งœ: 2022 | DOI: N/A 📄 PDF


Essence

ํ…Œ์ด๋ธ”๊ณผ ํ…์ŠคํŠธ ํ˜ผํ•ฉ ์งˆ์˜์‘๋‹ต(Table-and-Text Hybrid Question Answering, HybridQA)์€ ์ด์งˆ์  ๋ฐ์ดํ„ฐ๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ ๋‹ต๋ณ€์„ ์ƒ์„ฑํ•˜๋Š” ๋„์ „์ ์ธ NLP ๊ณผ์ œ์ด๋ฉฐ, ๋ณธ ๋…ผ๋ฌธ์€ ํ˜„์žฌ๊นŒ์ง€์˜ ๋ฒค์น˜๋งˆํฌ, ๋ฐฉ๋ฒ•๋ก , ํ•ต์‹ฌ ๊ณผ์ œ, ํ–ฅํ›„ ๋ฐฉํ–ฅ์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ •๋ฆฌํ•œ ์ตœ์ดˆ์˜ ํฌ๊ด„์  ์„ค๋ฌธ์ด๋‹ค.

Motivation

Achievement

Figure 2

HybridQA์˜ ์˜ˆ์‹œ (ํ…Œ์ด๋ธ”๊ณผ ํ…์ŠคํŠธ ํ†ตํ•ฉ ์ถ”๋ก )

  1. ์ฒซ ๋ฒˆ์งธ ํฌ๊ด„์  ์„ค๋ฌธ: HybridQA ๊ด€๋ จ ๋ฒค์น˜๋งˆํฌ(HybridQA, OTT-QA, FinQA, TAT-QA, TAT-HQA, MultiHiertt, GeoTSQA), ๋ฐฉ๋ฒ•๋ก , ๊ณผ์ œ๋ฅผ ํ†ตํ•ฉ์ ์œผ๋กœ ์ •๋ฆฌ
  2. ์ฒด๊ณ„์  ๋น„๊ต ๋ถ„์„: ๊ธฐ์กด ์‹œ์Šคํ…œ์˜ ์žฅ๋‹จ์ ์„ ๋ช…ํ™•ํžˆ ํ•˜๋Š” ํ•ฉ๋ฆฌ์  ๋น„๊ต ํ”„๋ ˆ์ž„์›Œํฌ ์ œ์‹œ (Table 1์—์„œ 7๊ฐœ ๋ฒค์น˜๋งˆํฌ๋ฅผ 6๊ฐ€์ง€ ์ฐจ์›์œผ๋กœ ๋น„๊ต)
  3. ๊ณผ์ œ ์‹ฌ์ธต ๋ถ„์„:
    • ๊ฒ€์ƒ‰ ํšจ์œจ์„ฑ๊ณผ ํšจ๊ณผ์„ฑ(Retrieval Effectiveness and Efficiency)
    • ํ…Œ์ด๋ธ” ์…€ ์œ„์น˜ ์ธ์‹(Cell Location of Tabular Evidence)
    • ์ด์งˆ์  ์ฆ๊ฑฐ ๊ด€๊ณ„ ๋ชจ๋ธ๋ง(Relation Modeling of Heterogeneous Evidence)
    • ๋‹ค์ค‘ ํ™‰ ์ถ”๋ก (Multi-Hop Reasoning)

How

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4.5/5 Clarity: 4.5/5 Overall: 4.25/5

์ดํ‰: HybridQA ๋ถ„์•ผ์˜ ์ฒซ ํฌ๊ด„์  ์„ค๋ฌธ์œผ๋กœ์„œ ๋ฒค์น˜๋งˆํฌยท๋ฐฉ๋ฒ•๋ก ยท๊ณผ์ œ๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ์ •๋ฆฌํ•œ ์˜๋ฏธ ์žˆ๋Š” ๊ธฐ์—ฌ์ด๋‚˜, ์ดˆ๊ธฐ LLM ์‹œ๋Œ€์˜ ๊ธ‰์†ํ•œ ๋ฐฉ๋ฒ•๋ก  ๋ฐœ์ „์„ ์ถฉ๋ถ„ํžˆ ๋ฐ˜์˜ํ•˜์ง€ ๋ชปํ•œ ์ ๊ณผ ์‚ฐ์—… ์ ์šฉ ๊ด€์ ์˜ ๋ถ„์„์ด ๋ฏธํกํ•œ ๊ฒƒ์ด ์•„์‰ฌ์šด ์ ์ด๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
ํ…Œ์ด๋ธ”๊ณผ ํ…์ŠคํŠธ ํ˜ผํ•ฉ ์งˆ์˜์‘๋‹ต์˜ ๊ธฐ์ดˆ๊ฐ€ ๋˜๋Š” ๋ฐ์ดํ„ฐ์…‹์ด๋‚˜ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์ด ๋…ผ๋ฌธ์—์„œ ์‚ฌ์šฉ๋œ ํŒŒ์ดํ”„๋ผ์ธ์˜ ๋ฒค์น˜๋งˆํฌ์™€ ํšจ์œจ ํ‰๊ฐ€๊ฐ€ LLM ๊ธฐ๋ฐ˜ ์‹ ์†Œ์žฌ ๋ฐœ๊ฒฌ์˜ ์„ฑ๋Šฅ ํ‰๊ฐ€์— ์ง์ ‘์ ์œผ๋กœ ๊ธฐ์ดˆ๊ฐ€ ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ด์งˆ์  ๋ฐ์ดํ„ฐ ๊ฒฐํ•ฉ ์งˆ์˜์‘๋‹ต์— ๋‹ค๋ฅธ ์ ‘๊ทผ ๋ฐฉ์‹์„ ์ ์šฉํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
RAG๋ฅผ ํ™œ์šฉํ•œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ QA ๋ฐ ํ‘œ+ํ…์ŠคํŠธ ๊ธฐ๋ฐ˜ ์งˆ์˜์‘๋‹ต ํ•ด๊ฒฐ์— ์ดˆ์ ์„ ๋งž์ถ˜ ์ตœ์‹  ๋ฒค์น˜๋งˆํฌ ๋ถ„์„ ๋…ผ๋ฌธ์ž…๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ChartInstruct ๋…ผ๋ฌธ์€ ์ฐจํŠธ ๊ธฐ๋ฐ˜ ๋‹ค์ค‘๋ชจ๋‹ฌ Q&A์™€ reasoning์„ ํƒ๊ตฌํ•˜์—ฌ, ํ•˜์ด๋ธŒ๋ฆฌ๋“œ QA์˜ ๋‹ค๋ฅธ ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
787์€ ํ‘œ์™€ ํ…์ŠคํŠธ์˜ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ QA์—์„œ LLM์„ ํ™œ์šฉํ•œ ํ…Œ์ด๋ธ” ์ดํ•ด ๋ฐ ์ถ”๋ก  ๊ธฐ๋ฒ•์„ ์ค‘์ ์ ์œผ๋กœ ๋‹ค๋ฃน๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ตฌ์กฐํ™”๋œ ๋ฐ์ดํ„ฐ์™€ ๋น„๊ตฌ์กฐํ™” ํ…์ŠคํŠธ๋ฅผ ๊ฒฐํ•ฉํ•˜๋Š” ๋‹ค๋ฅธ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
ํ…Œ์ด๋ธ”-ํ…์ŠคํŠธ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ QA์˜ ํŠน์ • ์ธก๋ฉด์„ ํ™•์žฅํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
ํ˜ผํ•ฉ ์งˆ์˜์‘๋‹ต ๋ฐฉ๋ฒ•๋ก ์„ ํŠน์ • ๋„๋ฉ”์ธ์— ์ ์šฉํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
149๋Š” ํ‘œ ํ˜•์‹ QA์™€ ๊ด€๋ จ๋œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ๋ฐ์ดํ„ฐ์—์„œ ํ˜‘์—…์  ํƒ์ƒ‰ ๊ธฐ๋ฐ˜ ๊ฐ€์„ค ํ‰๊ฐ€๋ฅผ ์‹œ๋„, HybridQA์˜ ์‹ค์ œ ์ ์šฉ ์‚ฌ๋ก€๋ฅผ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •