When should we trust the annotation? Selective prediction for molecular structure retrieval from mass spectra

์ €์ž: | ๋‚ ์งœ: 2026-03-11 | URL: https://arxiv.org/abs/2603.10950 📄 PDF


Essence

Figure 1

Fig. 1: Overview of the selective prediction framework for molecular structure retrieval from tandem

์งˆ๋Ÿ‰๋ถ„์„๊ธฐ ์ŠคํŽ™ํŠธ๋Ÿผ์—์„œ ๋ถ„์ž ๊ตฌ์กฐ๋ฅผ ๊ฒ€์ƒ‰ํ•  ๋•Œ ์–ด๋–ค ์˜ˆ์ธก์„ ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ๋Š”์ง€ ํŒ๋‹จํ•˜๊ธฐ ์œ„ํ•ด ์„ ํƒ์  ์˜ˆ์ธก(selective prediction) ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•˜๊ณ , ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™” ์ „๋žต์„ ์ฒด๊ณ„์ ์œผ๋กœ ํ‰๊ฐ€ํ•œ๋‹ค.

Motivation

Achievement

Figure 3

Fig. 3: Risk-coverage analysis for different scoring functions ฮบ based on different uncertainty estimates as

How

Figure 1

Fig. 1: Overview of the selective prediction framework for molecular structure retrieval from tandem

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์งˆ๋Ÿ‰๋ถ„์„ ๊ธฐ๋ฐ˜ ๋ถ„์ž ์‹๋ณ„์˜ ์‹ ๋ขฐ์„ฑ ๋ฌธ์ œ๋ฅผ ์„ ํƒ์  ์˜ˆ์ธก์œผ๋กœ ์ฒ˜์Œ ์ฒด๊ณ„์ ์œผ๋กœ ๋‹ค๋ฃจ๋ฉฐ, ์‹ค์ฆ์  ๋ถ„์„์„ ํ†ตํ•ด ๊ณ„์‚ฐ ํšจ์œจ์„ฑ๊ณผ ํ†ต๊ณ„์  ๋ณด์ฆ์˜ ์‹ค์šฉ์  ๊ท ํ˜•์„ ์ œ์‹œํ•œ ์šฐ์ˆ˜ํ•œ ์—ฐ๊ตฌ์ด๋‹ค. ์ž„์ƒ/ํ™˜๊ฒฝ ์‘์šฉ์˜ ์•ˆ์ „์„ฑ ์š”๊ตฌ๋ฅผ ์ถฉ์กฑํ•˜๋Š” uncertainty-aware ํ”„๋ ˆ์ž„์›Œํฌ์˜ ๋ชจ๋ฒ”์ด ๋œ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์ƒ์ฒด ๋ถ„์ž ์ƒํ˜ธ์ž‘์šฉ ๋ฐ ๊ตฌ์กฐ ์˜ˆ์ธก ์ •ํ™•์„ฑ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฃจ๋Š” ๋…ผ๋ฌธ์œผ๋กœ, ๊ตฌ์กฐ ์˜ˆ์ธก์˜ ์‹ ๋ขฐ์„ฑ ์ฒ™๋„ ๋ฐ ์„ ํƒ์  ์˜ˆ์ธก ๊ฐœ๋…์— ์ด๋ก ์ ์œผ๋กœ ๊ธฐ์ดˆ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
665๋ฒˆ ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜ ์ž๋™ ๋ฆฌ๋ทฐ ๋ฐ ์‹ ๋ขฐ์„ฑ ํ‰๊ฐ€์ง€ํ‘œ ๊ฐœ๋ฐœ์„ ๋‹ค๋ฃจ๋ฉฐ, 3283๋ฒˆ์˜ ์„ ํƒ์  ์˜ˆ์ธก ํ”„๋ ˆ์ž„์›Œํฌ์˜ ํƒ€๋‹น์„ฑ ํ‰๊ฐ€ ๋ฐ ์‹ ๋ขฐ์„ฑ ๋…ผ์˜์— ๊ธฐ์ดˆ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ณผํ•™ ๋จธ์‹ ๋Ÿฌ๋‹์—์„œ์˜ ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™” ๋ฐฉ๋ฒ•์— ๊ด€ํ•œ ์ข…ํ•ฉ ๋ฆฌ๋ทฐ๋กœ, ๋ถ„์ž ๊ตฌ์กฐ ์‹ ๋ขฐ์„ฑ ํŒ๋ณ„์˜ ํƒ€ ์ ‘๊ทผ ๋ฐ ํ•œ๊ณ„๋„ ํ•จ๊ป˜ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์งˆ๋Ÿ‰๋ถ„์„๊ธฐ ์‘์šฉ์— ๋Œ€ํ•œ ์„ ํƒ์  ์˜ˆ์ธก๊ณผ ๋‹ฌ๋ฆฌ, ๊ตฌ์กฐ ๊ธฐ๋ฐ˜ ํ† ํฌ๋‚˜์ด์ฆˆ๋ฅผ ํ†ตํ•œ ์˜ˆ์ธก ๋ฐ ์„ค๊ณ„์— ์ดˆ์ ์„ ๋งž์ถ˜ ์ ‘๊ทผ์ž…๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Annotation์˜ ์‹ ๋ขฐ๋„์™€ LLM ํ™œ์šฉ๋œ ์˜ˆ์ธก ๋ฌธ์ œ๋ฅผ ์‹ค์ฆ์ ์œผ๋กœ ๋ถ„์„ํ•˜์—ฌ 206๋ฒˆ์˜ ์ž๋™ ์ฃผ์„ ์—ฐ๊ตฌ์— ๊นŠ์ด๋ฅผ ๋”ํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
SciTrust๋Š” LLM ๊ธฐ๋ฐ˜ ๊ณผํ•™ ์˜ˆ์ธก์˜ ์‹ ๋ขฐ์„ฑ ํ‰๊ฐ€ ์ฒด๊ณ„๋ฅผ ์ œ๊ณตํ•˜์—ฌ, ์„ ํƒ์  ๋ถ„์ž ๊ตฌ์กฐ ์˜ˆ์ธก์˜ ์‹ ๋ขฐ์„ฑ ํŒ๋‹จ ์—ฐ๊ตฌ์™€ ์ง์ ‘์ ์œผ๋กœ ์—ฐ๊ด€๋œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
โ€˜Trust, But Verifyโ€™ ๋…ผ๋ฌธ์€ ์ž๊ธฐ๊ฒ€์ฆ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ํ†ตํ•ด ์˜ˆ์ธก ์‹ ๋ขฐ์„ฑ ํŒ๋‹จ, ์„ ํƒ์  ์˜ˆ์ธก์˜ ๊ทผ๋ณธ์  ๋ฐฉํ–ฅ์„ฑ๊ณผ ๋งž๋‹ฟ์•„ ์žˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์˜ ์‹ฌ์ธต์ /์ง๊ต์  ํ‰๊ฐ€๋กœ ๋ณธ ๋…ผ๋ฌธ์˜ ๋ถˆํ™•์‹ค์„ฑ ์ „๋žต์˜ ์‹ค์ œ ์ ํ•ฉ๋„๋ฅผ ํ‰๊ฐ€ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
397๋ฒˆ ๋…ผ๋ฌธ์˜ ํ™˜๊ฐ(hallucination) ๋ถ„์„์€ 3283๋ฒˆ์—์„œ ๋‹ค๋ฃจ๋Š” ๋ถ„์ž๊ตฌ์กฐ ์˜ˆ์ธก ์‹ ๋ขฐ๋„ ํŒ๋‹จ์— ์žˆ์–ด ์ž ์žฌ์  ์˜ค๋ฅ˜ ๋ฐ ์ผ๋ฐ˜ํ™” ๋ฌธ์ œ์— ๋Œ€ํ•œ ๋น„ํŒ์  ๊ด€์ ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •