A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models

์ €์ž: Wenqi Fan, Yujuan Ding, Liang-bo Ning, Shijie Wang, Hengyun Li | ๋‚ ์งœ: 2024 | DOI: 10.1145/3637528.3671470 📄 PDF


Essence

Figure 1

Figure 1: Retrieval-Augmented Generation (RAG) meets

๋ณธ ๋…ผ๋ฌธ์€ Retrieval-Augmented Generation (RAG)๊ณผ Large Language Models (LLMs)์˜ ํ†ตํ•ฉ์ธ RA-LLMs์— ๋Œ€ํ•œ ์ข…ํ•ฉ์ ์ธ ์„ค๋ฌธ์กฐ์‚ฌ๋กœ, ์•„ํ‚คํ…์ฒ˜, ํ›ˆ๋ จ ์ „๋žต, ์‘์šฉ ๋ถ„์•ผ์˜ ์„ธ ๊ฐ€์ง€ ๊ธฐ์ˆ ์  ๊ด€์ ์—์„œ ๊ธฐ์กด ์—ฐ๊ตฌ๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ๋ฆฌ๋ทฐํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2: Representing RAG and RA-LLMs methods organized by their main design focus, proposed time and impact (shown by

How

Figure 3

Figure 3: Illustration of the basic Retrieval-Augmented Large Language Models (RA-LLMs) framework for a specific QA task

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ RAG์™€ LLMs์˜ ํ†ตํ•ฉ์ด๋ผ๋Š” ์‹œ๋Œ€์  ์š”๊ตฌ์— ๋ถ€์‘ํ•˜์—ฌ, ๊ธฐ์ˆ ์  ๊ด€์ ์—์„œ ๊ฐ€์žฅ ์ฒด๊ณ„์ ์ด๊ณ  ํฌ๊ด„์ ์ธ ์„ค๋ฌธ์กฐ์‚ฌ๋ฅผ ์ œ๊ณตํ•œ๋‹ค. Hallucination ๋ฌธ์ œ ํ•ด๊ฒฐ, ์ตœ์‹  ์ •๋ณด ํ™œ์šฉ, ๋„๋ฉ”์ธ ํŠนํ™” ์‘์šฉ ๋“ฑ์˜ ์‹ค์ œ ๊ฐ€์น˜์™€ ํ•จ๊ป˜ ์•„ํ‚คํ…์ฒ˜-ํ›ˆ๋ จ-์‘์šฉ์ด๋ผ๋Š” ๋ช…ํ™•ํ•œ ๋ถ„๋ฅ˜ ์ฒด๊ณ„๋ฅผ ์ œ์‹œํ•จ์œผ๋กœ์จ RA-LLMs ์—ฐ๊ตฌ ๋ถ„์•ผ์˜ ์ค‘์š”ํ•œ ๊ธฐ์ค€์ ์ด ๋  ๊ฒƒ์œผ๋กœ ๊ธฐ๋Œ€๋œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
659 REALM์€ ์ดˆ๊ธฐ RAG ๊ธฐ๋ฐ˜ ์–ธ์–ด๋ชจ๋ธ ์ œ์•ˆ ๋…ผ๋ฌธ์œผ๋กœ 034์˜ RAG ๋ฐœ์ „์‚ฌ ์„œ์ˆ ์— ๊ทผ๋ณธ์  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
RAG์˜ ๊ธฐ์ดˆ๊ฐ€ ๋˜๋Š” ๊ฒ€์ƒ‰ ๋ฐ ์ƒ์„ฑ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ๊ณตํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
675 ์—ญ์‹œ Retrieval-Augmented Generation(RAG) ์‹œ์Šคํ…œ ์„œ๋ฒ ์ด๋กœ, 034์˜ ๋ถ„์„๊ณผ ๊ทผ๊ฑฐ ๋ฐ ๊ด€๋ จ ์˜์—ญ ํ™•์žฅ์— ๋„์›€์„ ์ค€๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
RAG์™€ LLM ํ†ตํ•ฉ ๋ฐ ํ‰๊ฐ€ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ข…ํ•ฉ์ ์œผ๋กœ ๋‹ค๋ฃจ๋ฏ€๋กœ, 366 ๋…ผ๋ฌธ์˜ ๋ฐฉ๋ฒ•๋ก ์  ๊ธฐ๋ฐ˜์„ ์ดํ•ดํ•˜๋Š” ๋ฐ ๋„์›€๋จ.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
RAG ์‹œ์Šคํ…œ์˜ ๊ธฐ๋ณธ ๊ฐœ๋…๊ณผ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ๊ณตํ•˜๋Š” Agentic RAG ์„œ๋ฒ ์ด์˜ ํ•ต์‹ฌ ๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ์ด๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
034๋Š” ๋Œ€ํ˜• ์–ธ์–ด๋ชจ๋ธ ๊ธฐ๋ฐ˜ ๊ฒ€์ƒ‰ยท์ƒ์„ฑ ์‹œ์Šคํ…œ(RAG)์˜ ํ˜„ํ™ฉ์„ ๋‹ค๋ฃจ์–ด, 3277์ฒ˜๋Ÿผ ๋ฐ”์ด๋Ÿฌ์Šค ์œ ์ „์ฒดยท๋‹จ๋ฐฑ์งˆ ๋ณ€์ด ํƒ์ƒ‰ ์ž๋™ํ™”์— ๊ธฐ๋ฐ˜๊ธฐ์ˆ ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM๊ณผ ์™ธ๋ถ€ ์ง€์‹ ํ†ตํ•ฉ์— ๊ด€ํ•œ ์œ ์‚ฌํ•œ ์ฃผ์ œ๋ฅผ ๋‹ค๋ฅธ ๊ด€์ ์—์„œ ๋‹ค๋ฃจ๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ƒ์„ฑํ˜• ๋Œ€๊ทœ๋ชจ ์–ธ์–ด๋ชจ๋ธ์˜ ๋ฐœ์ „ ๊ณผ์ •์„ ๋‹ค๋ฃจ๋Š” ๊ด€๋ จ ์„œ๋ฒ ์ด ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
034 ๋…ผ๋ฌธ์€ Retrieval-Augmented Generation(RAG) ๊ธฐ๋ฐ˜์˜ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ƒ์„ฑ ๋ฐฉ์‹์„ ๋‹ค๋ฃจ๋ฉฐ, 807์˜ ๋น„๋””์˜ค ์ค‘์‹ฌ ์„ค๋ช… ๋Œ€์‹  ๊ฒ€์ƒ‰๊ธฐ๋ฐ˜ ์ง€์‹ ๋„์ถœ ๋ชจ๋ธ์ด๋ผ๋Š” ๋Œ€์กฐ์  ์‹œ๊ฐ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์žฅ๋ฌธ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ ๊ฒ€์ƒ‰ ์ฆ๊ฐ• ๋ฐฉ์‹์œผ๋กœ ์ปจํ…์ŠคํŠธ ํ™•์žฅ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฅธ ๊ฐ๋„์—์„œ ํ•ด๊ฒฐํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ฒ€์ƒ‰ ์ฆ๊ฐ• ์ƒ์„ฑ์˜ ๋‹ค๋ฅธ ์•„ํ‚คํ…์ฒ˜๋‚˜ ํ›ˆ๋ จ ์ „๋žต์„ ์ œ์•ˆํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
RAG ์‹œ์Šคํ…œ์„ ํŠน์ • ์‘์šฉ ๋ถ„์•ผ์— ํ™•์žฅ ์ ์šฉํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
RAG ์‹œ์Šคํ…œ์˜ ํŠน์ • ๊ตฌ์„ฑ ์š”์†Œ๋‚˜ ๊ฐœ์„  ๋ฐฉ๋ฒ•์„ ์‹ฌํ™”ํ•˜์—ฌ ๋‹ค๋ฃจ๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
348์€ ์—์ด์ „ํ‹ฑ RAG ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•˜๋ฉฐ, 034์˜ RAG-LLM ํ†ตํ•ฉ ๋…ผ์˜๋ฅผ agent ๊ด€์ ์—์„œ ์‹ฌํ™”์‹œ์ผœ์ค€๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
RAG ๊ธฐ๋ฐ˜ ๊ณผํ•™๋…ผ๋ฌธ Future Work ์ƒ์„ฑ ๋“ฑ ์‹ค์ œ RAG-LLM ํ†ตํ•ฉ์˜ ์‚ฌ๋ก€์—ฐ๊ตฌ๋กœ, 034์˜ ์„œ๋ฒ ์ด ๋‚ด์šฉ์„ ๊ตฌ์ฒด์  ์‘์šฉ์— ์—ฐ๊ฒฐํ•  ์ˆ˜ ์žˆ์Œ.
์‘์šฉ ์‚ฌ๋ก€
RAG ๊ธฐ๋ฐ˜ ๊ณผํ•™ QA ์‹œ์Šคํ…œ์œผ๋กœ, RA-LLMs ๊ธฐ์ˆ ์˜ ์‹ค์งˆ์  ๋ฌธํ—Œ ์‘์šฉ ์‚ฌ๋ก€๋ฅผ ๋ณด์—ฌ์ค€๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •