HypoChainer: A Collaborative System Combining LLMs and Knowledge Graphs for Hypothesis-Driven Scientific Discovery

์ €์ž: Haoran Jiang, Shaohan Shi, Yunjie Yao, Chang Jiang, Quan Li | ๋‚ ์งœ: 2025-07-23 | URL: https://arxiv.org/abs/2507.17209 📄 PDF


Essence

Figure 1

Fig. 1: In the presented case study, the biologistโ€™s analytical workflow unfolds as follows: 1 Upload Drug Repurposing d

์ด ๋…ผ๋ฌธ์€ LLM, Knowledge Graph, ์ธ๊ฐ„ ์ „๋ฌธ๊ฐ€์˜ ํ˜‘์—…์„ ํ†ตํ•ฉํ•˜๋Š” HypoChainer ์‹œ๊ฐํ™” ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•˜๋ฉฐ, RAG ๊ธฐ๋ฐ˜ ํƒ์ƒ‰, ๊ฐ€์„ค ์‚ฌ์Šฌ ๊ตฌ์„ฑ, ๊ฒ€์ฆ ์šฐ์„ ์ˆœ์œ„ํ™”์˜ ์„ธ ๋‹จ๊ณ„ ์›Œํฌํ”Œ๋กœ์šฐ๋กœ ๊ณผํ•™์  ๋ฐœ๊ฒฌ์„ ์ง€์›ํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1: In the presented case study, the biologistโ€™s analytical workflow unfolds as follows: 1 Upload Drug Repurposing d

How

Figure 2

Fig. 2: Comparison of Traditional Practice A and HypoChainer Pipeline

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ด ์—ฐ๊ตฌ๋Š” LLM์˜ hallucination ๋ฌธ์ œ๋ฅผ Knowledge Graph๋กœ ์™„ํ™”ํ•˜๊ณ  ์‹œ๊ฐ ๋ถ„์„์„ ํ†ตํ•œ ์ธ๊ฐ„-AI ํ˜‘์—…์„ ์ฒด๊ณ„ํ™”ํ•˜์—ฌ ๊ณผํ•™์  ๋ฐœ๊ฒฌ ํ”„๋กœ์„ธ์Šค๋ฅผ ํ˜์‹ ํ•˜๋Š” ์‹ค์งˆ์ ์ธ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค. ๋‹จ, ์ •๋Ÿ‰์  ํ‰๊ฐ€ ๋ถ€์กฑ๊ณผ ์ œํ•œ๋œ ๋„๋ฉ”์ธ ์‚ฌ๋ก€๋กœ ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ ๊ฒ€์ฆ์ด ํ•„์š”ํ•˜๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Hypothesis Generation with Large Language Models ๋…ผ๋ฌธ์€ HypoChainer์˜ ๊ณผํ•™์  ๋ฐœ๊ฒฌ ์ง€์›์„ ์œ„ํ•œ LLM ๊ธฐ๋ฐ˜ ๊ฐ€์„ค์ƒ์„ฑ ์‹œ์Šคํ…œ์˜ ๊ทผ๊ฐ„์„ ์ œ๊ณตํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
HypoChainer๊ฐ€ ์ฐจํŠธ์™€ ์ง€์‹ ๊ทธ๋ž˜ํ”„, LLM, ์ „๋ฌธ๊ฐ€ ํ˜‘์—…์„ ํ†ตํ•ด ๊ณผํ•™์  ๋ฐœ๊ฒฌ์„ ์ง€์›ํ•˜๋Š” ๋ฐฉ์‹์€ ChartLlama ๋“ฑ ๊ณ ์„ฑ๋Šฅ ์ฐจํŠธ ์ดํ•ด ๋ชจ๋ธ์˜ ๋Œ€์ฒด์  ๋ฐœ์ „ ๊ฒฝ๋กœ๋ฅผ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
FRAG๋Š” Knowledge Graph ๊ธฐ๋ฐ˜ RAG ์‹œ์Šคํ…œ์„ ๋‹ค์–‘ํ•œ ๊ณผํ•™ ๋„๋ฉ”์ธ ์งˆ์˜์— ์ตœ์ ํ™”ํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, HypoChainer์™€์˜ ์›Œํฌํ”Œ๋กœ์šฐ ์ฐจ์ด๋ฅผ ๋Œ€๋น„ํ•ด์„œ ์ฝ์„ ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
HypoChainer ๋…ผ๋ฌธ์€ LLM๊ณผ ์ง€์‹ ๊ทธ๋ž˜ํ”„ ํ˜‘์—…์„ ํ†ตํ•œ ๊ณผํ•™์  ๊ฐ€์„ค ์ƒ์„ฑ ์‹œ์Šคํ…œ์œผ๋กœ ์•„์ด๋””์–ด ์กฐํ•ฉ ๋ฐฉ๋ฒ•๋ก ์ด Chimera์˜ ๋Œ€์•ˆ์  ์ ‘๊ทผ์ž…๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Graphusion์ฒ˜๋Ÿผ LLM๊ณผ KG๋ฅผ ๊ฒฐํ•ฉํ•œ ์ง€์‹ ๊ทธ๋ž˜ํ”„ ๊ตฌ์ถ•์€ HypoChainer์˜ RAG ๊ธฐ๋ฐ˜ ํƒ์ƒ‰ ๋ฐ ๊ฐ€์„ค ๊ฒ€์ฆ ํ˜‘์—… ๋‹จ๊ณ„์˜ ๊ธฐ์ˆ ์  ํ™•์žฅ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
HypoChainer๋Š” LLM๊ณผ ์ง€์‹ ๊ทธ๋ž˜ํ”„ ๊ฒฐํ•ฉ์— ๊ธฐ๋ฐ˜ํ•œ ํ˜‘์—…์  ๊ณผํ•™ ์•„์ด๋””์–ด ์ƒ์„ฑ ์‹œ์Šคํ…œ์œผ๋กœ, KG-CoI์˜ ์ ‘๊ทผ๋ฒ•์„ ํ•œ ๋‹จ๊ณ„ ๋ฐœ์ „์‹œํ‚จ ์‚ฌ๋ก€์ž…๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
HypoChainer๋Š” LLM/KG ๋ณตํ•ฉ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, ๋‹ค์ˆ˜ AI ์—์ด์ „ํŠธ๊ฐ„ ๋ˆ„์ ์  ๋ฐœ๊ฒฌ๊ณผ ๊ฐ€์„ค ์ง„ํ™”๋ผ๋Š” AgentRxiv์˜ ๋ชฉํ‘œ๋ฅผ ์‹ฌํ™”ํ•œ ๊ตฌ์กฐ์ž…๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
Interesting Scientific Idea Generation ๋…ผ๋ฌธ ์—ญ์‹œ LLM๊ณผ ์ง€์‹๊ทธ๋ž˜ํ”„๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ ๊ณผํ•™์  ์•„์ด๋””์–ด ์ƒ์„ฑ์„ ์ง€์›ํ•˜๋ฏ€๋กœ, HypoChainer์˜ ํ˜‘์—…์  ํƒ์ƒ‰ ๋ฐ ๊ฐ€์„ค์‚ฌ์Šฌ ๋ฐฉ๋ฒ•๊ณผ ์ง์ ‘์ ์œผ๋กœ ๋งž๋‹ฟ์•„ ์žˆ์Šต๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •