MIRAI: Prediction and Generation of High-Impact Academic Research

์ €์ž: Alex Li, Joseph Jacobson | ๋‚ ์งœ: 2026-06-03 | URL: https://arxiv.org/abs/2606.05443 📄 PDF


Essence

Figure 2

Figure 2: Impact prediction model architecture. The title and abstract are encoded by a frozen text

MIRAI๋Š” ๋…ผ๋ฌธ์˜ ์ œ๋ชฉ, ์ดˆ๋ก, ์ถœํŒ ๋‚ ์งœ๋งŒ์„ ์‚ฌ์šฉํ•˜์—ฌ deep learning framework๋กœ 5๋…„ ํ›„ ๋…ผ๋ฌธ ์˜ํ–ฅ๋ ฅ์„ ์˜ˆ์ธกํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ์ด๋‹ค. arXiv ํ•™์ˆ  ๊ทธ๋ž˜ํ”„์—์„œ PageRank์™€ citation counts๋ฅผ ์˜ˆ์ธกํ•˜๋ฉฐ, 2021๋…„ ์ถœํŒ ๋…ผ๋ฌธ์— ๋Œ€ํ•ด PageRank ์˜ˆ์ธก์—์„œ Spearman's ฯ 0.4686, citation ์˜ˆ์ธก์—์„œ 0.6192๋ฅผ ๋‹ฌ์„ฑํ•œ๋‹ค.

Motivation

Achievement

Figure 3

Figure 3: Performance as measuerd by Spearmanโ€™s ฯ for both impact targets across different test

Dataset: ์•ฝ 300๋งŒ ๊ฐœ arXiv ๋…ผ๋ฌธ์˜ ์ €์ž, citation, network-based impact label(citation count, PageRank) ํฌํ•จ ๋ฐ์ดํ„ฐ์…‹ ๊ตฌ์ถ•. Impact prediction: Publication time ์ •๋ณด๋งŒ์œผ๋กœ 5๋…„ citation ์˜ˆ์ธก Spearman's ฯ 0.62, PageRank ์˜ˆ์ธก 0.47 ๋‹ฌ์„ฑ. Research generation: Impact prediction framework๋ฅผ ํ™œ์šฉํ•œ research ideation pipeline ์ œ์•ˆ์œผ๋กœ LLM judge๊ฐ€ 4:3 ๋น„์œจ๋กœ baseline ๋Œ€๋น„ ๋” ๋†’์€ ์˜ํ–ฅ๋ ฅ ํŒ์ •. Public release: 5๋…„ citation prediction model์„ https://predict-paper-impact.vercel.app์— ๊ณต๊ฐœ.

How

Figure 5

Figure 5: Research ideation pipeline. Highlighted (blue) stages depend on the pipeline variant, which

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ๊ณผํ•™ ๋ฌธํ—Œ์˜ ๊ธ‰์†ํ•œ ์ฆ๊ฐ€๋ผ๋Š” ์‹œ๊ธ‰ํ•œ ๋ฌธ์ œ์— ๋Œ€์‘ํ•˜์—ฌ publication time์—์„œ๋งŒ content ๊ธฐ๋ฐ˜์œผ๋กœ ๋…ผ๋ฌธ ์˜ํ–ฅ๋ ฅ์„ ์˜ˆ์ธกํ•˜๋Š” MIRAI framework๋ฅผ ์ œ์•ˆํ•œ๋‹ค. Deep text embedding์„ ํ™œ์šฉํ•œ scalableํ•˜๊ณ  ๊ณต์ •ํ•œ ์ ‘๊ทผ๋ฒ•๊ณผ large-scale dataset, ๊ทธ๋ฆฌ๊ณ  research generation์œผ๋กœ์˜ ํ™•์žฅ์€ ์˜๋ฏธ ์žˆ๋Š” ๊ธฐ์—ฌ์ด๋‹ค. ๋‹ค๋งŒ domain ์ผ๋ฐ˜ํ™” ์ œํ•œ, ํ‰๊ฐ€ ๋ฐฉ๋ฒ•๋ก (LLM judge๋งŒ ์‚ฌ์šฉ), PageRank ์˜ˆ์ธก ์„ฑ๋Šฅ, research idea ์ƒ์„ฑ์˜ ์‹ค์ œ ์˜ํ–ฅ๋ ฅ ๊ฒ€์ฆ ๋“ฑ์—์„œ ๊ฐœ์„ ์˜ ์—ฌ์ง€๊ฐ€ ์žˆ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๋‹ค๋ฅธ ์ ‘๊ทผ
Liveideabench๋Š” ๋…ผ๋ฌธ ์•„์ด๋””์–ด์™€ ์˜ํ–ฅ๋ ฅ ์˜ˆ์ธก์˜ ๋Œ€์•ˆ์  ๋ฒค์น˜๋งˆํฌ์ด์ž ํ‰๊ฐ€ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, MIRAI์™€ ์ ‘๊ทผ๊ณผ ํ‰๊ฐ€์—์„œ ์ฐจ๋ณ„์„ฑ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ž๋™ํ™”๋œ ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ ์ž๋™ํ™” ๋ฐ ์˜ํ–ฅ๋ ฅ ํ‰๊ฐ€์— SurveyX๊ฐ€ ๋‹ค์–‘ํ•œ ์ž๋™ํ™” ์š”์•ฝ ๋ฐ ํ‰๊ฐ€ ์‚ฌ๋ก€๋ฅผ ์ œ๊ณตํ•˜์—ฌ ๋ณด์™„์ ์ธ ์‹œ๊ฐ์„ ์ค๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
3388 ๋…ผ๋ฌธ์€ ์ž„ํŒฉํŠธ๊ฐ€ ๋†’์€ ์—ฐ๊ตฌ ์•„์ด๋””์–ด ์˜ˆ์ธก ๋ฐ ์ƒ์„ฑ, ํ‰๊ฐ€์— LLM๊ณผ RAG ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•์„ ์จ์„œ ์œ ์‚ฌํ•œ LLM-RAG ๊ณผํ•™ ์‘์šฉ์˜ ๋Œ€์•ˆ์„ ๋ณด์—ฌ์คŒ.
๋‹ค๋ฅธ ์ ‘๊ทผ
3388์€ LLM ๊ธฐ๋ฐ˜ ๋ฏธ๋ž˜ ์—ฐ๊ตฌ ์ฃผ์ œ ์˜ˆ์ธก ๋ฐ ๋…ผ๋ฌธ์˜ ์ž„ํŒฉํŠธ ํ‰๊ฐ€ ๋ฐฉ๋ฒ•์„ ์†Œ๊ฐœํ•˜์—ฌ, 3212๊ฐ€ ์ œ์•ˆํ•˜๋Š” ์ƒˆ๋กœ์šด ์—ฐ๊ตฌ ๋ฐฉํ–ฅ ์˜ˆ์ธก ์‹œ์Šคํ…œ๊ณผ ๋ฐฉ๋ฒ•๋ก  ๋ฉด์—์„œ ๋น„๊ต๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
MIRAI๋Š” ๋…ผ๋ฌธ ํ…์ŠคํŠธ ๊ธฐ๋ฐ˜์œผ๋กœ ๋ฏธ๋ž˜ ์˜ํ–ฅ๋ ฅ์„ ์˜ˆ์ธกํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•˜์—ฌ LLM Metrics์˜ parametric memory ๊ธฐ๋ฐ˜ ์˜ํ–ฅ๋ ฅ ์ธก์ •์— ๋Œ€ํ•œ ๋Œ€์•ˆ์  ๊ด€์ ์„ ์ œ์‹œํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Predicting the future of ai with ai ๋…ผ๋ฌธ์€ AI๋กœ ๋…ผ๋ฌธ ์ž„ํŒฉํŠธ/๋งํฌ ์˜ˆ์ธก์„ ์‹œ๋„ํ•œ๋‹ค๋Š” ์ ์—์„œ MIRAI์˜ ์ ‘๊ทผ๋ฐฉ๋ฒ•๊ณผ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์„ ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
PaperQA ๋…ผ๋ฌธ์€ RAG ๊ธฐ๋ฐ˜ ๋…ผ๋ฌธ ์งˆ์˜์‘๋‹ต ์‹œ์Šคํ…œ์„ ํ†ตํ•ด ๋…ผ๋ฌธ ๋‚ด์šฉ์˜ ์˜๋ฏธ์  ์ดํ•ด ๋ฐ ์˜ํ–ฅ๋ ฅ ํŒ๋‹จ์œผ๋กœ MIRAI์˜ ์ž๋™ ์˜ํ–ฅ๋ ฅ ์˜ˆ์ธก ๊ธฐ๋Šฅ์„ ์‹ค์ œ ์‘์šฉ์œผ๋กœ ํ™•์žฅํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
3388์€ AI์— ์˜ํ•œ ์˜ํ–ฅ๋ ฅ ๋†’์€ ์—ฐ๊ตฌ ์„ฑ๊ณผ ์˜ˆ์ธก/์ถ”์ฒœ ๋ชจ๋ธ๋กœ ํ˜์‹ ์  ๋ฐœ๊ฒฌ์˜ ์ •๋Ÿ‰์  disruptive index์™€ ์—ฐ๊ณ„๋ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
MIRAI ๋…ผ๋ฌธ์€ ๊ณ ์ž„ํŒฉํŠธ ๋…ผ๋ฌธ/์—ฐ๊ตฌ ์„ฑ๊ณผ ์˜ˆ์ธก์˜ ์ƒ์„ฑ ๋ฐ ํƒ€๋‹น์„ฑ ๋ถ„์„ ๊ธฐ์ˆ ์„ ๋ฐœ์ „์‹œ์ผœ, ์•„์ด๋””์–ด ์‹คํ—˜ ์„ฑ๊ณต ์˜ˆ์ธก ๋ฌธ์ œ์— ๋‹ค์–‘ํ•œ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
MIRAI์˜ ๋…ผ๋ฌธ ์˜ํ–ฅ๋ ฅ ์˜ˆ์ธก ํ”„๋ ˆ์ž„์›Œํฌ๋Š” LLM-Metrics์˜ LLM ๊ธฐ๋ฐ˜ ์ž„ํŒฉํŠธ ์ธก์ • ์ง€ํ‘œ์˜ ์ •๋Ÿ‰์  ์˜ˆ์ธก ๋ถ€๋ถ„์„ ์‹ค์ œ๋กœ ๊ตฌํ˜„ํ•ด์ค๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •