FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs

์ €์ž: Zengyi Gao, Yukun Cao, Hairu Wang, Ao Ke, Yuan Feng | ๋‚ ์งœ: 2025 | DOI: 10.48550/arXiv.2501.09957 📄 PDF


Essence

Figure 1: Modular and Coupled KG-RAG Frameworks

Modular๊ณผ Coupled KG-RAG ํ”„๋ ˆ์ž„์›Œํฌ์˜ ๊ตฌ์กฐ์  ์ฐจ์ด

๋ณธ ๋…ผ๋ฌธ์€ ์ง€์‹๊ทธ๋ž˜ํ”„(KG) ๊ธฐ๋ฐ˜ ๊ฒ€์ƒ‰์ฆ๊ฐ•์ƒ์„ฑ(RAG) ์‹œ์Šคํ…œ์—์„œ ์œ ์—ฐ์„ฑ๊ณผ ๊ฒ€์ƒ‰ ํ’ˆ์งˆ ์‚ฌ์ด์˜ ํŠธ๋ ˆ์ด๋“œ์˜คํ”„๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด FRAG ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•œ๋‹ค. ์ฟผ๋ฆฌ์˜ ๋ณต์žก๋„๋ฅผ ์ž๋™์œผ๋กœ ํŒ๋‹จํ•˜์—ฌ ๋‹จ์ˆœ/๋ณต์žก ์ถ”๋ก  ์ž‘์—…์— ๋งž์ถคํ˜• ๊ฒ€์ƒ‰ ์ „๋žต์„ ์ ์šฉํ•จ์œผ๋กœ์จ LLM ๋ฏธ์„ธ์กฐ์ • ์—†์ด ๋ชจ๋“ˆ์‹ ์„ค๊ณ„์˜ ์œ ์—ฐ์„ฑ์„ ์œ ์ง€ํ•˜๋ฉด์„œ๋„ ๊ฒ€์ƒ‰ ํ’ˆ์งˆ์„ ํ–ฅ์ƒ์‹œํ‚จ๋‹ค.

Motivation

Achievement

Figure 2: Analysis of Semantic and Structural Information in Reasoning Paths

์ถ”๋ก  ๊ฒฝ๋กœ์˜ ์˜๋ฏธ๋ก ์ (semantic) ์ •๋ณด์™€ ๊ตฌ์กฐ์ (structural) ์ •๋ณด ๋ถ„์„

  1. ์œ ์—ฐ์„ฑ๊ณผ ํ’ˆ์งˆ์˜ ๋™์‹œ ๋‹ฌ์„ฑ: LLM ๋ฏธ์„ธ์กฐ์ • ์—†์ด ๋ชจ๋“ˆ์‹ ์„ค๊ณ„์˜ ๊ฒฉ๋ฆฌ์„ฑ, ์œ ์—ฐ์„ฑ, ํ™•์žฅ์„ฑ์„ ์œ ์ง€ํ•˜๋ฉด์„œ ๋™์  ๊ฒ€์ƒ‰ ์ „๋žต์œผ๋กœ ๊ฒ€์ƒ‰ ํ’ˆ์งˆ ํ–ฅ์ƒ
  2. ํšจ์œจ์„ฑ ํ–ฅ์ƒ: ์ถ”๊ฐ€ LLM ํ˜ธ์ถœ์ด๋‚˜ ๋ฏธ์„ธ์กฐ์ •์ด ํ•„์š” ์—†์–ด ์ž์› ์†Œ๋น„๋ฅผ ์ตœ์†Œํ™”ํ•˜๊ณ  ๊ณ„์‚ฐ ํšจ์œจ์„ฑ ๊ทน๋Œ€ํ™”
  3. ์ตœ์ฒจ๋‹จ ์„ฑ๋Šฅ: ๋ฒค์น˜๋งˆํฌ ๋ฐ์ดํ„ฐ์…‹์—์„œ ๊ธฐ์กด ๋ฐฉ๋ฒ•๋“ค๊ณผ ๋น„๊ตํ•˜์—ฌ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ ๋‹ฌ์„ฑ

How

Figure 3: Framework of FRAG

FRAG์˜ ์„ธ ๊ฐ€์ง€ ์ฃผ์š” ๋ชจ๋“ˆ ๊ตฌ์กฐ

1. Reasoning-Aware ๋ชจ๋“ˆ (์ฟผ๋ฆฌ ๋ณต์žก๋„ ๋ถ„๋ฅ˜)

2. Flexible-Retrieval ๋ชจ๋“ˆ (๋งž์ถคํ˜• ๊ฒ€์ƒ‰ ํŒŒ์ดํ”„๋ผ์ธ)

3. Generation ๋ชจ๋“ˆ

Originality

Limitation & Further Study

ํ•œ๊ณ„:

ํ›„์† ์—ฐ๊ตฌ:

Evaluation

์ดํ‰: FRAG๋Š” ๋ชจ๋“ˆ์‹ KG-RAG์˜ ์„ฑ๋Šฅ ํ•œ๊ณ„๋ฅผ ์ฟผ๋ฆฌ ๊ธฐ๋ฐ˜ ํ˜ธํ”„ ์˜ˆ์ธก๊ณผ ๋งž์ถคํ˜• ํŒŒ์ดํ”„๋ผ์ธ์œผ๋กœ ์šฐ์•„ํ•˜๊ฒŒ ํ•ด๊ฒฐํ•˜๋Š” ์‹ค์šฉ์  ์ ‘๊ทผ๋ฒ•์ด๋‹ค. LLM ๋ฏธ์„ธ์กฐ์ •์„ ๋ฐฐ์ œํ•˜๋ฉด์„œ๋„ ๊ฒ€์ƒ‰ ํ’ˆ์งˆ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ๊ธฐ์—ฌ๋Š” ์˜๋ฏธ ์žˆ์œผ๋‚˜, ํ˜ธํ”„ ๋ถ„๋ฅ˜์˜ ์„ธ๋ฐ€๋„์™€ ๋„๋ฉ”์ธ ์ ์‘์„ฑ์— ๋Œ€ํ•œ ๋”์šฑ ์‹ฌํ™”๋œ ๋ถ„์„์ด ํ•„์š”ํ•˜๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Retrieval-Augmented Generation for Large Language Models ๋…ผ๋ฌธ์€ RAG ์‹œ์Šคํ…œ์˜ ํ•ต์‹ฌ ์ด๋ก ๊ณผ ๊ธฐ์ˆ  ํ๋ฆ„์„ ์ฒด๊ณ„์ ์œผ๋กœ ์š”์•ฝํ•˜์—ฌ FRAG ํ”„๋ ˆ์ž„์›Œํฌ์˜ ์„ค๊ณ„ ๋ฐฐ๊ฒฝ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Factkg ๋…ผ๋ฌธ์€ RAG ๋ฐ KG ๊ธฐ๋ฐ˜ ๊ฒ€์ฆ ๋งฅ๋ฝ์—์„œ ๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฐ˜ ์ถ”๋ก ๊ณผ ์‚ฌ์‹ค ๊ฒ€์ฆ์˜ ์ด๋ก ์  ๊ธฐ๋ฐ˜์„ ๋งˆ๋ จํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
348๋ฒˆ ๋…ผ๋ฌธ์€ AI ๊ธฐ๋ฐ˜ ์ง€์‹ํƒ์ƒ‰ ์‹œ์Šคํ…œ์˜ ์„ค๊ณ„ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ๋‹ค๋ค„, 517๋ฒˆ์˜ LLM ๊ธฐ๋ฐ˜ ์ธ๋ฅ˜ํ•™ ๊ต์œกยท๊ฒŒ์ž„ ์ƒ์„ฑ ์‹คํ—˜์— ๊ตฌ์กฐ์  ๋ฐฐ๊ฒฝ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
FRAG ๋…ผ๋ฌธ์€ retrieval-augmented generation ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•˜์—ฌ ์ฐจํŠธ-to-์ฝ”๋“œ ์ƒ์„ฑ ๋ฐฉ์‹์— ํ•„์ˆ˜์ ์ธ ๊ธฐ๋ณธ ๊ตฌ์กฐ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
348์€ ๋ณด๊ฐ•๋œ ํšŒ์ˆ˜ํ˜• ์ƒ์„ฑ ํ”„๋ ˆ์ž„์›Œํฌ์— ๊ธฐ๋ฐ˜ํ•œ ์ง€์‹ ๋ฐœ๊ฒฌ ์ž๋™ํ™” ์‚ฌ๋ก€๋ฅผ ์ •๋ฆฌํ•˜์—ฌ, 3273์—์„œ ์ œ์•ˆํ•œ spectral map ๊ธฐ๋ฐ˜ RNA ์•ฝ๋ฌผ ๊ฒฐํ•ฉ ํ•ด์„์˜ ๋ฒ”์šฉ์  ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ๋น„์ถฐ์ค๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Multimodal deepresearcher ๋…ผ๋ฌธ์€ ํ…์ŠคํŠธยท์ฐจํŠธ๊ฐ€ ์œตํ•ฉ๋œ ์ฆ๊ฐ• ์ƒ์„ฑ ๊ธฐ๋ฐ˜ ํ•™์ˆ  ์‹œ์Šคํ…œ์„ ํƒ๊ตฌํ•˜์—ฌ, FRAG์˜ ๋ชจ๋“ˆ์‹ ๋‹ค์ค‘ ๋ฐ์ดํ„ฐํƒ€์ž… ์ง€์› ๊ธฐ๋ฒ•๊ณผ ์„ฑ๋Šฅ ๋น„๊ต๊ฐ€ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹ค์–‘ํ•œ RAG ๊ธฐ๋ฐ˜ ๋ฆฌ๋”๋ณด๋“œ ์ž๋™ํ™” ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ๋น„๊ต ๋ถ„์„ํ•˜์—ฌ ์‹ฌ์ธต์  ๋ฒค์น˜๋งˆํ‚น ๋ฐ ๋ฐฉ๋ฒ•๋ก  ํ™•์žฅ์ด ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
348์€ agentic RAG ํ”„๋ ˆ์ž„์›Œํฌ์˜ ์„ค๊ณ„ ๋ฐ ํ‰๊ฐ€์— ์ง‘์ค‘ํ•˜์—ฌ, 063์—์„œ ์ •์น˜์—ฐ๊ตฌ์— ์‹ค์ œ ์ ์šฉ๋œ ๊ตฌ์กฐ๋ฅผ ๋‹ค๋ฅธ ํ˜•ํƒœ์˜ retrieval-augmented agent๋กœ ์žฌ๊ตฌ์„ฑํ•  ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
FRAG๋Š” Knowledge Graph ๊ธฐ๋ฐ˜ RAG ์‹œ์Šคํ…œ์„ ๋‹ค์–‘ํ•œ ๊ณผํ•™ ๋„๋ฉ”์ธ ์งˆ์˜์— ์ตœ์ ํ™”ํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, HypoChainer์™€์˜ ์›Œํฌํ”Œ๋กœ์šฐ ์ฐจ์ด๋ฅผ ๋Œ€๋น„ํ•ด์„œ ์ฝ์„ ์ˆ˜ ์žˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation๋Š” ์‹ค์ œ RAG ์‹œ์Šคํ…œ์—์„œ ์œ ์—ฐ์„ฑ๊ณผ ํ’ˆ์งˆ์„ ๋งž์ถ”๋Š” ์ƒˆ๋กœ์šด ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•˜๋ฏ€๋กœ, RAG ๋ถ„์•ผ์˜ ์‘์šฉยทํ™•์žฅ ์‚ฌ๋ก€๋กœ ์—ฐ๊ฒฐ๋ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ง€์‹ ๊ทธ๋ž˜ํ”„์™€ RAG๋ฅผ ๊ฒฐํ•ฉํ•œ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์‹ค์ œ ์ง€์‹ ํƒ์ƒ‰ ๋ฐ ์ƒ์„ฑ์— ์ ์šฉํ•œ ์‚ฌ๋ก€๋กœ, ๋ชจ๋“ˆ์‹ ์„ค๊ณ„์˜ ๋‹ค์–‘์„ฑ์„ ์‹ค์ œ๋กœ ํ™•์žฅํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
348์€ ์—์ด์ „ํ‹ฑ RAG ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•˜๋ฉฐ, 034์˜ RAG-LLM ํ†ตํ•ฉ ๋…ผ์˜๋ฅผ agent ๊ด€์ ์—์„œ ์‹ฌํ™”์‹œ์ผœ์ค€๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Personalized graph-based retrieval for large language models ๋…ผ๋ฌธ์€ ๊ทธ๋ž˜ํ”„ ๊ธฐ๋ฐ˜ RAG ์ ‘๊ทผ๋ฒ•์—์„œ ๊ฐœ์ธํ™”์™€ ํ’ˆ์งˆ ํ–ฅ์ƒ์„ ์‹œ๋„ํ•˜์—ฌ, FRAG์˜ ๋ชจ๋“ˆํ™” ์•„์ด๋””์–ด๋ฅผ ๋ณด์™„์ ์œผ๋กœ ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
348์€ Agentic RAG์˜ ์‹ค์ œ ์‹œ์Šคํ…œ ๊ตฌํ˜„์— ์ดˆ์ ์„ ๋งž์ถ”์–ด, 067์˜ ์ด๋ก ์  ์„œ๋ฒ ์ด๋ฅผ ์†Œํ”„ํŠธ์›จ์–ด ์•„ํ‚คํ…์ฒ˜ ๋ฐ ์—”์ง€๋‹ˆ์–ด๋ง ๊ด€์ ์œผ๋กœ ํ™•์žฅํ•œ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
Grounding fallacies misrepresenting scientific publications ๋…ผ๋ฌธ์€ RAG ๋ฐ KG ๊ธฐ๋ฐ˜ ์‹œ์Šคํ…œ์ด ๋ฐœ์ƒ์‹œํ‚ค๋Š” ์˜ค์ธ์˜ ์‹ค์ œ ์‚ฌ๋ก€์™€ ํ•œ๊ณ„, ํ’ˆ์งˆ ์ €ํ•˜ ๋ฌธ์ œ๋ฅผ ๋น„ํŒ์ ์œผ๋กœ ๋ถ„์„ํ•œ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •