Quantifying large language model usage in scientific papers

์ €์ž: Weixin Liang, Yaohui Zhang, Zhengxuan Wu, Haley Lepp, Wenlong Ji, Xuandong Zhao, Hancheng Cao, Sheng Liu, Siyu He, Zhi Huang, Diyi Yang, Christopher Potts, Christopher D. Manning, James Zou | ๋‚ ์งœ: 2025-08-04 | DOI: 10.1038/s41562-025-02273-8 📄 PDF


Essence

Figure 1

Fig. 1 | Estimated fraction of LLM-modified sentences across research paper

2020๋…„ 1์›”๋ถ€ํ„ฐ 2024๋…„ 9์›”๊นŒ์ง€ 112๋งŒ์—ฌ ๊ฐœ์˜ ํ•™์ˆ ๋…ผ๋ฌธ์„ ๋ถ„์„ํ•˜์—ฌ ๋Œ€๊ทœ๋ชจ์–ธ์–ด๋ชจ๋ธ(LLM, Large Language Model)์˜ ์‚ฌ์šฉ ๋น„์œจ์„ ๋‹จ์–ด๋นˆ๋„ ๋ณ€ํ™” ๊ธฐ๋ฐ˜ ๋ชจ์ง‘๋‹จ ์ˆ˜์ค€ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ ์ •๋Ÿ‰ํ™”ํ•œ ์—ฐ๊ตฌ์ด๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1 | Estimated fraction of LLM-modified sentences across research paper

How

Figure 2

Fig. 2 | Fine-grained validation of estimation accuracy under temporal

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ํ•™์ˆ  ์ถœํŒ์—์„œ LLM ์‚ฌ์šฉ์˜ ์‹ค์ œ ๊ทœ๋ชจ๋ฅผ ์ฒ˜์Œ์œผ๋กœ ์ •๋Ÿ‰ํ™”ํ•œ ์ค‘์š”ํ•œ ์—ฐ๊ตฌ๋กœ, ๋ชจ์ง‘๋‹จ ์ˆ˜์ค€์˜ ํ†ต๊ณ„์  ๋ฐฉ๋ฒ•๋ก ์œผ๋กœ ๊ฐœ๋ณ„ ํƒ์ง€์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ–ˆ๋‹ค. ๋ถ„์•ผ๋ณ„ ์ฐจ๋“ฑ์  ์ฑ„ํƒ ํŒจํ„ด๊ณผ ์ €์ž ํŠน์„ฑ ๊ฐ„์˜ ์ƒ๊ด€๊ด€๊ณ„ ๊ทœ๋ช…์„ ํ†ตํ•ด ํ•™์ˆ  ์ปค๋ฎค๋‹ˆ์ผ€์ด์…˜์˜ ๊ตฌ์กฐ์  ๋ณ€ํ™”๋ฅผ ์ดํ•ดํ•˜๋Š” ๋ฐ ๊ธฐ์—ฌํ•  ์ˆ˜ ์žˆ์œผ๋‚˜, LLM ์‚ฌ์šฉ์ด ๋…ผ๋ฌธ์˜ ์‹ค์ œ ํ’ˆ์งˆ๊ณผ ์‹ ๋ขฐ์„ฑ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์— ๋Œ€ํ•œ ์‹ฌ์ธต ๋ถ„์„์€ ํ–ฅํ›„ ๊ณผ์ œ์ด๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
ํ•™์ˆ  ๋…ผ๋ฌธ์—์„œ LLM ์‚ฌ์šฉ ํŒจํ„ด์„ ๋ถ„์„ํ•˜๊ธฐ ์œ„ํ•œ ๋ฐฉ๋ฒ•๋ก ์  ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
LLM์˜ ๊ณผํ•™ ์—ฐ๊ตฌ ํ™œ์šฉ ํ˜„ํ™ฉ์„ ์ฒด๊ณ„์ ์œผ๋กœ ๋ถ„์„ํ•œ ๋ฐฐ๊ฒฝ ์—ฐ๊ตฌ๋กœ ํ™œ์šฉ๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋ฌธํ—Œ์—์„œ ํ† ํ”ฝ ๊ตฌ์กฐ๋ฅผ ํƒ์ง€ํ•˜๋Š” ๋‹ค๋ฅธ ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•˜๋Š” ๊ด€๋ จ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ „์ดํ•™์Šต ๊ธฐ๋ฐ˜ ์‚ฌ์ „ํ›ˆ๋ จ ๋ชจ๋ธ์„ ์ƒ์˜ํ•™ NLP ์ž‘์—…์— ์ ์šฉํ•˜๋Š” ์œ ์‚ฌํ•œ ๋ฐฉ๋ฒ•๋ก ์„ ์—ฐ๊ตฌํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
AI ๋„๊ตฌ๊ฐ€ ๊ณผํ•™ ์—ฐ๊ตฌ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ๋‹ค๋ฅธ ๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐฉ๋ฒ•์œผ๋กœ ์ธก์ •ํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM์„ ๊ณผํ•™ ์—ฐ๊ตฌ์— ํ™œ์šฉํ•˜๋Š” ์‹ค์ œ ์‚ฌ๋ก€๋ฅผ ์ œ์‹œํ•˜์—ฌ LLM ์‚ฌ์šฉ ํ˜„ํ™ฉ ๋ถ„์„๊ณผ ๋ณด์™„ ๊ด€๊ณ„๋ฅผ ํ˜•์„ฑํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํ•ญ์ฒด ์„ค๊ณ„๋ฅผ ์œ„ํ•œ ๋‹ค๋ฅธ ์‹œํ€€์Šค ์ƒ์„ฑ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM์˜ ํ•™์ˆ  ๋ถ„์•ผ ์‚ฌ์šฉ์„ ๋‹ค๋ฅธ ๋ฐฉ๋ฒ•๋ก ์œผ๋กœ ์ •๋Ÿ‰ํ™”ํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
LLM ์‚ฌ์šฉ ์ •๋Ÿ‰ํ™” ๋ฐฉ๋ฒ•๋ก ์„ ํŠน์ • ํ•™์ˆ  ๋งฅ๋ฝ์— ์ ์šฉํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
AI ์ƒ์„ฑ ํ…์ŠคํŠธ์˜ ๊ณผํ•™ ๋…ผ๋ฌธ ๋‚ด ํ™•์‚ฐ์„ ํƒ์ง€ํ•˜๋Š” ๊ด€๋ จ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ๊ณตํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
ํ•™์ˆ  ๋…ผ๋ฌธ์—์„œ LLM ์‚ฌ์šฉ์„ ์ •๋Ÿ‰ํ™”ํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก ์„ ์ง์ ‘์ ์œผ๋กœ ํ™•์žฅํ•˜๊ฑฐ๋‚˜ ๋™์ผํ•œ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฃฌ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
AI์˜ ๊ณผํ•™ ๋…ผ๋ฌธ ์ƒ์„ฑ ํ™œ์šฉ์„ ์ •๋Ÿ‰ํ™”ํ•˜์—ฌ ์ž๋™ ๊ฐ€์„ค ์ƒ์„ฑ ์‹œ์Šคํ…œ์˜ ์‹ค์ œ ์‚ฌ์šฉ ๋งฅ๋ฝ์„ ์ œ๊ณตํ•œ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •