Hallucination mitigation using agentic ai natural language-based frameworks

์ €์ž: Diego Gosmar, Deborah A. Dahl | ๋‚ ์งœ: 2025 | DOI: N/A 📄 PDF


Essence

Figure 2

Figure 2: THS results over 310 prompts with 3 agents

๋ณธ ๋…ผ๋ฌธ์€ LLM์˜ hallucination ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด multi-agent orchestration ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค. OVON framework ๊ธฐ๋ฐ˜ NLP ์ธํ„ฐํŽ˜์ด์Šค๋ฅผ ํ†ตํ•ด ์—ฌ๋Ÿฌ specialized agent๋“ค์ด ํ˜‘๋ ฅํ•˜์—ฌ hallucination์„ ๋‹จ๊ณ„์ ์œผ๋กœ ํƒ์ง€ํ•˜๊ณ  ์™„ํ™”ํ•˜๋Š” ์‹œ์Šคํ…œ์„ ๊ตฌํ˜„ํ•˜๊ณ , ์ƒˆ๋กœ์šด KPI๋“ค์„ ๋„์ž…ํ•˜์—ฌ hallucination mitigation ํšจ๊ณผ๋ฅผ ์ •๋Ÿ‰ํ™”ํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2: THS results over 310 prompts with 3 agents

Multi-agent ํŒŒ์ดํ”„๋ผ์ธ์˜ hallucination ์™„ํ™” ํšจ๊ณผ: 310๊ฐœ prompts์— ๋Œ€ํ•ด 3๊ฐœ agent๋กœ ๊ตฌ์„ฑ๋œ ํŒŒ์ดํ”„๋ผ์ธ์ด progressive hallucination score reduction ๋‹ฌ์„ฑ. Novel KPI ์ฒด๊ณ„ ๊ฐœ๋ฐœ: Factual Claim Density, Factual Grounding References, Fictional Disclaimer Frequency, Explicit Contextualization Score ๋“ฑ 4๊ฐœ ์ƒˆ๋กœ์šด ์ง€ํ‘œ๋กœ hallucination ์ˆ˜์ค€์„ ์ •๋Ÿ‰ํ™”. OVON ๊ธฐ๋ฐ˜ inter-agent communication ํšจ๊ณผ ์ž…์ฆ: Structured JSON message ๊ธฐ๋ฐ˜ agent ์ƒํ˜ธ์ž‘์šฉ์ด context ๋ณด์กด ๋ฐ transparency ํ–ฅ์ƒ์„ ํ†ตํ•ด system ์‹ ๋ขฐ์„ฑ ์ฆ์ง„. AI explainability ๊ฐœ์„ : Speculative content์˜ ๋ช…ํ™•ํ•œ ๊ตฌ๋ถ„๊ณผ explicit disclaimers ์ถ”๊ฐ€๋กœ AI ์ƒ์„ฑ ์‘๋‹ต์˜ ํ•ด์„ ๊ฐ€๋Šฅ์„ฑ ํ–ฅ์ƒ.

How

Figure 2

Figure 2: THS results over 310 prompts with 3 agents

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 3/5 Overall: 3/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ multi-agent orchestration์„ ํ†ตํ•œ hallucination mitigation์˜ ์‹ค์งˆ์  ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์—ฌ์ฃผ๋Š” ์˜๋ฏธ ์žˆ๋Š” empirical study์ด๋ฉฐ, OVON ๊ธฐ๋ฐ˜ structured communication๊ณผ novel KPI ์ฒด๊ณ„๋Š” AI reliability ํ–ฅ์ƒ์— ๊ธฐ์—ฌํ•  ์ˆ˜ ์žˆ๋‹ค. ๋‹ค๋งŒ ์ œํ•œ๋œ LLM ๋ฒ”์œ„, prompt ๋Œ€ํ‘œ์„ฑ์˜ ๋ถˆ๋ช…ํ™•์„ฑ, KPI ํƒ€๋‹น์„ฑ ๊ฒ€์ฆ ๋ถ€์กฑ, ๊ทธ๋ฆฌ๊ณ  underlying LLM์˜ black-box ํ•œ๊ณ„์— ๋Œ€ํ•œ ํ•ด๊ฒฐ์ฑ… ๋ถ€์žฌ๋กœ ์ธํ•ด ๋ฐฉ๋ฒ•๋ก ์˜ ์—„๋ฐ€์„ฑ๊ณผ ๊ฒฐ๊ณผ์˜ ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ์ด ์ œํ•œ๋œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
295 ๋…ผ๋ฌธ์—์„œ ์ œ์‹œ๋œ ๋ฉ€ํ‹ฐ์—์ด์ „ํŠธ ๊ธฐ๋ฐ˜ ๋™์  ์˜ค์ผ€์ŠคํŠธ๋ ˆ์ด์…˜์€, 396์˜ agentic AI๋ฅผ ์ด์šฉํ•œ ํ™˜๊ฐ ์™„ํ™” ํ”„๋ ˆ์ž„์›Œํฌ์˜ ๋ฉ”์‹ ์ € ๊ด€๋ฆฌ ์ „๋žต์— ์ง์ ‘์  ์ด๋ก ์  ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
736์˜ SciTrust ๋…ผ๋ฌธ์€ LLM ํ™˜๊ฐ ๊ฒ€์ฆ ๋ฐ ์™„ํ™” ํ‰๊ฐ€ ์ง€ํ‘œ์™€ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ •๋ฆฝํ•ด, 396์ด ์ œ์•ˆํ•œ ์—์ด์ „ํŠธ ์กฐ์œจ ๋ฐฉ์‹์˜ ์‹ ๋ขฐ์„ฑ ํ‰๊ฐ€์˜ ํ•ต์‹ฌ ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Agentic AI์˜ ๋ฐœ์ „๊ณผ ํ•œ๊ณ„์— ๋Œ€ํ•œ ๊ด‘๋ฒ”์œ„ํ•œ ์„œ๋ฒ ์ด๋กœ, ํ™˜๊ฐ ๋ฌธ์ œ ์™„ํ™” ๋ฐ ์‹ ๋ขฐ์„ฑ ํ”„๋ ˆ์ž„์›Œํฌ ๋…ผ์˜์˜ ๋ฐฐ๊ฒฝ์„ ์ œ๊ณตํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
747 ๋…ผ๋ฌธ์€ LLM์˜ step-by-step ํ™˜๊ฐ ์ž๊ธฐ์ ๊ฒ€ ๋ฐฉ์‹์„ ํ™œ์šฉํ•˜์—ฌ, 396์˜ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ์กฐ์ •์— ์˜์กดํ•˜์ง€ ์•Š๋Š” ๋‹ค๋ฅธ ํ™˜๊ฐ ์™„ํ™” ๋ฐ ์ง„๋‹จ ์ „๋žต์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
610๋ฒˆ ๋…ผ๋ฌธ์€ VLM์˜ hallucination ์ˆ˜์ • ๊ธฐ๋ฒ•์„ ๋‹ค๋ฃจ๋ฉฐ, 396๋ฒˆ ๋…ผ๋ฌธ์—์„œ ๋‹ค์–‘ํ•œ hallucination ์™„ํ™” ์ „๋žต ๋น„๊ต์— ์ ํ•ฉํ•œ ๋Œ€์•ˆ ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹ค๊ตญ์–ด LLM์˜ ํ™˜๊ฐ ํƒ์ง€ ๋ฐ ํ‰๊ฐ€๋ฅผ ์œ„ํ•œ ๋‹ค๋ฅธ ๋ฒค์น˜๋งˆํฌ๋ฅผ ์ œ์‹œํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
396 ๋…ผ๋ฌธ์€ ์—์ด์ „ํŠธ ๊ธฐ๋ฐ˜ ํ™˜๊ฐ ๊ฐ์†Œ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•˜์—ฌ 021 ๋…ผ๋ฌธ์—์„œ ์–ธ๊ธ‰๋œ ๋ฌธ์ œ์— ๋Œ€ํ•œ ์‹ค์งˆ์ ์ธ ์†”๋ฃจ์…˜์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
396 ๋…ผ๋ฌธ์€ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ๋ฐฉ๋ฒ•์œผ๋กœ LLM ํ™˜๊ฐ(hallucination) ์™„ํ™” ๋ฐฉ์•ˆ์„ ์ œ์‹œํ•˜์—ฌ, 736์˜ ์‹ ๋ขฐ์„ฑ ํ‰๊ฐ€์—์„œ ์ œ์‹œํ•œ ํ™˜๊ฐ ์ฒ˜๋ฆฌ์˜ ์‹ค์งˆ์  ๋Œ€์‘ ๋ฐฉ๋ฒ•์„ ๋ณด์™„ํ•ด์ค๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์—์ด์ „ํŠธ ๊ธฐ๋ฐ˜ ์ž์—ฐ์–ด ์ฒ˜๋ฆฌ๋กœ ํ—›์†Œ๋ฆฌ(hallucination)๋ฅผ ์ค„์ด๋ ค๋Š” ์ ‘๊ทผ์€ CiteCheck์˜ ์ธ์šฉ ๋งฅ๋ฝ ์˜ค๋ฅ˜ ํƒ์ง€ ํ”„๋ ˆ์ž„์›Œํฌ์™€ ์œ ์‚ฌํ•œ ๊ฐœ์„  ๋ฐฉํ–ฅ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
396๋ฒˆ ๋…ผ๋ฌธ์˜ ํ™˜๊ฐ ์™„ํ™” ๋ฐ ์‹ ๋ขฐ์„ฑ ํ–ฅ์ƒ ํ”„๋ ˆ์ž„์›Œํฌ๋Š” 493๋ฒˆ LitLLM์—์„œ ๊ณผํ•™ ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ ์ƒ์„ฑ ์‹œ ํ™˜๊ฐ์„ ์ค„์ด๊ธฐ ์œ„ํ•œ ๊ฒ€์ƒ‰-์ฆ๊ฐ• ์ƒ์„ฑ pipeline์— ํ™œ์šฉ๋ฉ๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
851๋ฒˆ ๋…ผ๋ฌธ์€ ์‹คํ—˜์‹ค ์›Œํฌํ”Œ๋กœ์šฐ ์ตœ์ ํ™”์— ์—์ด์ „ํ‹ฑ AI๋ฅผ ํ™œ์šฉํ•˜๋ฉฐ, 396๋ฒˆ์˜ multi-agent orchestrated communication ๊ธฐ๋ฒ•์ด ์‹ค์ œ ์‘์šฉ ์‚ฌ๋ก€๋กœ ๊ตฌํ˜„๋œ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
SFT ๊ธฐ๋ฐ˜ LLM์ด ๋™์ผํ•œ ํ™˜๊ฐ ์–‘์ƒ์„ ๋ณด์ด๋Š” ํ˜„์ƒ์„ ๋‚ด์„ธ์›Œ, ์—์ด์ „ํŠธ ์กฐ์œจ๋งŒ์œผ๋กœ๋Š” ํ•œ๊ณ„๊ฐ€ ์žˆ์Œ์„ ์‹œ์‚ฌํ•œ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
Hallucination mitigation ๋…ผ๋ฌธ์€ ํ™˜๊ฐ ์ค„์ด๊ธฐ๋ฅผ ๋ชฉํ‘œ๋กœ ํ•˜์—ฌ, ์˜๋„์  ํ™˜๊ฐ ํ™œ์šฉ ๊ฐ€๋Šฅ์„ฑ๊ณผ ํ•œ๊ณ„๋ฅผ ๋Œ€์กฐ์ ์œผ๋กœ ๋ณด์—ฌ์ค€๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •