Assessing the impact of Open Research Information Infrastructures using NLP driven full-text Scientometrics: A case study of the LXCat open-access platform

์ €์ž: Kalp Pandya, Khushi Shah, Nirmal Shah, N. Shah, Bhaskar Chaudhury | ๋‚ ์งœ: 2026 | DOI: 10.48550/arXiv.2602.07664 📄 PDF


Essence

Figure 1

Figure 1: Overview of the data processing pipeline used to assess the scholarly impact

LXCat ์˜คํ”ˆ์•ก์„ธ์Šค ํ”Œ๋žซํผ์˜ ์ €์˜จ ํ”Œ๋ผ์ฆˆ๋งˆ ์—ฐ๊ตฌ ์ปค๋ฎค๋‹ˆํ‹ฐ์— ๋Œ€ํ•œ ์˜ํ–ฅ์„ NLP ๊ธฐ๋ฐ˜ ์ „๋ฌธ ํ…์ŠคํŠธ scientometrics๋กœ ์ฒด๊ณ„์ ์œผ๋กœ ์ •๋Ÿ‰ํ™”ํ•œ ์—ฐ๊ตฌ์ด๋‹ค. ์ธ์šฉ ์ˆ˜๋ฅผ ๋„˜์–ด ๋ฐ์ดํ„ฐ ์‚ฌ์šฉ ํŒจํ„ด, ํ™”ํ•™ ๋ฌผ์งˆ, ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ™œ์šฉ๋„, ์ฃผ์ œ ์ง„ํ™” ๋“ฑ์„ ์ถ”์ถœํ•˜๋Š” ๋„๋ฉ”์ธ ์ค‘๋ฆฝ์ ์ด๊ณ  ์ด์ „ ๊ฐ€๋Šฅํ•œ ํ‰๊ฐ€ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2:

How

Figure 3

Figure 3: Overview of the database mention extraction pipeline. Starting from plain

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ์—ฐ๊ตฌ๋Š” ์ธ์šฉ ๊ธฐ๋ฐ˜ scientometrics์˜ ํ•œ๊ณ„๋ฅผ NLP ๊ธฐ๋ฐ˜ full-text ๋ถ„์„์œผ๋กœ ๊ทน๋ณตํ•œ ์„ ๋„์  ์‚ฌ๋ก€๋กœ, ORI ์ธํ”„๋ผ์˜ ์‹ค์งˆ์  ์˜ํ–ฅ์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ •๋Ÿ‰ํ™”ํ•˜๋Š” ๋„๋ฉ”์ธ ์ค‘๋ฆฝ์  ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค. ์˜คํ”ˆ์†Œ์Šค ๊ณต๊ฐœ์™€ ๋†’์€ ์ด์ „ ๊ฐ€๋Šฅ์„ฑ์œผ๋กœ ํ–ฅํ›„ ์˜คํ”ˆ ์‚ฌ์ด์–ธ์Šค ์ •์ฑ… ์ˆ˜๋ฆฝ ๋ฐ ์ธํ”„๋ผ ํ‰๊ฐ€์— ์‹ค์งˆ์  ๊ธฐ์—ฌํ•  ๊ฒƒ์œผ๋กœ ๊ธฐ๋Œ€๋œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๋‹ค๋ฅธ ์ ‘๊ทผ
์˜คํ”ˆ์•ก์„ธ์Šค ํ”Œ๋žซํผ์˜ ์—ฐ๊ตฌ ์ปค๋ฎค๋‹ˆํ‹ฐ์— ๋Œ€ํ•œ ์˜ํ–ฅ์„ ๋‹ค๋ฅธ ๋ถ„์„ ๋ฐฉ๋ฒ•์œผ๋กœ ํ‰๊ฐ€ํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํŠน์ • ์—ฐ๊ตฌ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค๋‚˜ ํ”Œ๋žซํผ์ด ์—ฐ๊ตฌ ์ปค๋ฎค๋‹ˆํ‹ฐ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ์ฒด๊ณ„์ ์œผ๋กœ ๋ถ„์„ํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ณผํ•™ ๋ฐ์ดํ„ฐ ๊ณต์œ  ์ธํ”„๋ผ์˜ ์˜ํ–ฅ์„ ๋‹ค๋ฅธ ๊ด€์ ์—์„œ ๋ถ„์„ํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์˜คํ”ˆ ๋ฆฌ์„œ์น˜ ์ธํ”„๋ผ์˜ ์˜ํ–ฅ์„ ๋‹ค๋ฅธ ๋ฐฉ๋ฒ•๋ก ์œผ๋กœ ์ •๋Ÿ‰ํ™”ํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋„๋ฉ”์ธ ํŠนํ™” ์˜คํ”ˆ ๋ฐ์ดํ„ฐ ์ธํ”„๋ผ์˜ ํ™œ์šฉ ํŒจํ„ด๊ณผ ์˜ํ–ฅ์„ scientometrics๋กœ ํ‰๊ฐ€ํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ด‰๋งค ๋ฐœ๊ฒฌ์„ ์œ„ํ•œ ๊ธฐ๊ณ„ํ•™์Šต ๋ชจ๋ธ์„ OC20 ๋ฐ์ดํ„ฐ์…‹ ์œ„์—์„œ ํ™•์žฅ ๋ฐœ์ „์‹œํ‚จ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •