Uncertainty quantification in scientific machine learning: Methods, metrics, and comparisons

์ €์ž: Apostolos F. Psaros, Xuhui Meng, Zongren Zou, Ling Guo, George Em Karniadakis | ๋‚ ์งœ: 03/2023 | DOI: 10.1016/j.jcp.2022.111902 📄 PDF


Essence

์‹ ๊ฒฝ๋ง(Neural Networks, NN) ๊ธฐ๋ฐ˜์˜ ๊ณผํ•™ ๊ธฐ๊ณ„ํ•™์Šต(Scientific Machine Learning, SciML)์—์„œ ์˜ˆ์ธก ๋ถˆํ™•์‹ค์„ฑ์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ •๋Ÿ‰ํ™”ํ•˜๋Š” ํฌ๊ด„์  ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•˜๊ณ , ๋‹ค์–‘ํ•œ UQ ๋ฐฉ๋ฒ•๋“ค์„ ํ•จ์ˆ˜ ๊ทผ์‚ฌ, ํŽธ๋ฏธ๋ถ„๋ฐฉ์ •์‹ ํ’€์ด, ์—ฐ์‚ฐ์ž ํ•™์Šต ๋ฌธ์ œ์—์„œ ๋น„๊ต ํ‰๊ฐ€ํ•œ๋‹ค. ํŠนํžˆ ๋ฌผ๋ฆฌ์ •๋ณด์‹ ๊ฒฝ๋ง(Physics-Informed Neural Network, PINN)๊ณผ ์‹ฌ์ธต์—ฐ์‚ฐ์ž๋ง(DeepONet)์„ ์ค‘์‹ฌ์œผ๋กœ ๋ถˆํ™•์‹ค์„ฑ ๋ชจ๋ธ๋ง, ์ •๋Ÿ‰ํ™” ๋ฐฉ๋ฒ•, ํ‰๊ฐ€ ์ง€ํ‘œ๋ฅผ ํ†ตํ•ฉ์ ์œผ๋กœ ๋‹ค๋ฃฌ๋‹ค.

Motivation

Achievement

Figure 2: ๋ฐ์ดํ„ฐ ๋…ธ์ด์ฆˆ, ์ œํ•œ๋œ ๋ฐ์ดํ„ฐ, ๋ชจํ˜• ์˜ค๋ช…์‹œ ๋“ฑ์œผ๋กœ๋ถ€ํ„ฐ์˜ ์ „์ฒด ๋ถˆํ™•์‹ค์„ฑ ๋ถ„ํ•ด

๊ทธ๋ฆผ 2: ๋ฐ์ดํ„ฐ(๋…ธ์ด์ฆˆ, ๊ฐญ), ๋ฌผ๋ฆฌ ๋ชจํ˜•, ์‹ ๊ฒฝ๋ง์œผ๋กœ๋ถ€ํ„ฐ์˜ ๋ถˆํ™•์‹ค์„ฑ ๊ธฐ์—ฌ๋„ ์ •์„ฑ์  ๋ถ„์„

  1. ํฌ๊ด„์  UQ ํ”„๋ ˆ์ž„์›Œํฌ ์ œ์‹œ: ํ•จ์ˆ˜ ๊ทผ์‚ฌ, PINN ๊ธฐ๋ฐ˜ PDE ํ’€์ด, DeepONet ๊ธฐ๋ฐ˜ ์—ฐ์‚ฐ์ž ํ•™์Šต์—์„œ์˜ ํ†ต์ผ๋œ ๋ถˆํ™•์‹ค์„ฑ ๋ชจ๋ธ๋ง ๋ฐ ์ •๋Ÿ‰ํ™” ๋ฐฉ๋ฒ•๋ก  ๊ฐœ๋ฐœ. ์ด ๋ถˆํ™•์‹ค์„ฑ์„ ์ธ์‹๋ก ์  ๋ถˆํ™•์‹ค์„ฑ(epistemic uncertainty)๊ณผ ์šฐ์—ฐ์  ๋ถˆํ™•์‹ค์„ฑ(aleatoric uncertainty)์œผ๋กœ ๋ถ„ํ•ด.
  2. ๋‹ค์–‘ํ•œ UQ ๋ฐฉ๋ฒ•๋ก  ํ†ตํ•ฉ: Bayesian ๋ฐฉ๋ฒ•(Variational Inference, MCMC, Laplace approximation), ์•™์ƒ๋ธ”(ensemble) ๋ฐฉ๋ฒ•, ํ•จ์ˆ˜ ์‚ฌ์ „๋ถ„ํฌ(Functional Priors, FP), ํ™•๋ฅ ๋ฏธ๋ถ„๋ฐฉ์ •์‹(Stochastic PDE, SPDE) ํ•ด์„ ๋ฐฉ๋ฒ•์„ ์ฒด๊ณ„์ ์œผ๋กœ ๋น„๊ต ํ‰๊ฐ€. ๊ฐ ๋ฐฉ๋ฒ•์˜ ๊ณ„์‚ฐ ๋น„์šฉ(computational cost)๊ณผ ์„ฑ๋Šฅ์„ ์ •๋Ÿ‰ํ™”.
  3. ํ‰๊ฐ€ ์ง€ํ‘œ ๋ฐ ๋ณด์ • ๋ฐฉ๋ฒ• ๊ฐœ๋ฐœ: ์˜ˆ์ธก ๋ถˆํ™•์‹ค์„ฑ์˜ ํ’ˆ์งˆ์„ ํ‰๊ฐ€ํ•˜๊ธฐ ์œ„ํ•œ ์ง€ํ‘œ(coverage probability, negative log-likelihood, prediction interval coverage probability, calibration) ๋ฐ ์‚ฌํ›„ ๋ณด์ •(post-hoc calibration) ๊ธฐ๋ฒ• ์ œ์‹œ. ๋ถ„ํฌ ์‹œํ”„ํŠธ ๊ฐ์ง€ ๋Šฅ๋ ฅ ํ‰๊ฐ€.
  4. ๊ด‘๋ฒ”์œ„ํ•œ ์‹ค์ฆ ๋น„๊ต ์—ฐ๊ตฌ: ๋ถˆ์—ฐ์† ํ•จ์ˆ˜ ๊ทผ์‚ฌ, ํ˜ผํ•ฉ ๊ฒฐ์ •๋ก ์  PDE(๋น„์„ ํ˜• ์‹œ๊ฐ„-์˜์กด ํ™•์‚ฐ-๋ฐ˜์‘ ๋ฐฉ์ •์‹), ํ˜ผํ•ฉ ํ™•๋ฅ ์  ํƒ€์› ๋ฐฉ์ •์‹, 2D ๋‹ค๊ณต์งˆ ๋งค์งˆ ์œ ๋™ ์—ฐ์‚ฐ์ž ํ•™์Šต ๋“ฑ 5๊ฐœ ํ”„๋กœํ† ํƒ€์ž… ๋ฌธ์ œ์—์„œ ๋ชจ๋“  ๋ฐฉ๋ฒ• ๊ฒ€์ฆ.
  5. ์˜คํ”ˆ์†Œ์Šค ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ๊ฐœ๋ฐœ: NeuralUQ ํŒŒ์ด์ฌ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ(github.com/Crunch-UQ4MI/neuraluq) ์ œ๊ณต์œผ๋กœ ์žฌํ˜„์„ฑ๊ณผ ์ ‘๊ทผ์„ฑ ํ™•๋ณด. ๊ต์œก์šฉ ํŠœํ† ๋ฆฌ์–ผ ๋ฐ ์ถ”๊ฐ€ ๊ณ„์‚ฐ ์‹คํ—˜ ํฌํ•จ.

How

Figure 4: ์ž…๋ ฅ ํ•จ์ˆ˜ ฮป(x; ฮพ)์™€ ๋Œ€์‘๋˜๋Š” ์ถœ๋ ฅ u_ฮธ(x; ฮพ)๋ฅผ ๊ฐ€์ง„ DeepONet ๊ตฌ์กฐ

๊ทธ๋ฆผ 4: DeepONet ์•„ํ‚คํ…์ฒ˜๋กœ ๋ถ„๊ธฐ ๋„คํŠธ์›Œํฌ(branch net)๊ฐ€ ํ•จ์ˆ˜ ์ž…๋ ฅ์„ ์ธ์ฝ”๋”ฉํ•˜๊ณ  ํŠธ๋ ํฌ ๋„คํŠธ์›Œํฌ(trunk net)๊ฐ€ ๊ณต๊ฐ„ ์œ„์น˜๋ฅผ ์ฒ˜๋ฆฌ

Originality

Limitation & Further Study

Evaluation

Novelty: 4.5/5 Technical Soundness: 4/5 Significance: 4.5/5 Clarity: 4/5 Overall: 4.25/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ๊ณผ

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๊ณผํ•™ ML์˜ ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™” ์ฃผ์š” ์ด์Šˆ๋ฅผ ๋‹ค๋ฃจ๋ฉฐ, LLM์˜ ๋ฒ ์ด์ง€์•ˆ ์ตœ์ ํ™” ์ ์šฉ์—์„œ ๋ถˆํ™•์‹ค์„ฑ ํ‰๊ฐ€์— ๋Œ€ํ•œ ์ด๋ก ์  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์–ธ์ œ ๊ธฐ๊ณ„ํ•™์Šต ๊ธฐ๋ฐ˜ ๋ชจ๋ธ์˜ ๋ถˆํ™•์‹ค์„ฑ์„ ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ๋Š”์ง€์— ๋Œ€ํ•œ ์ด๋ก ์  ๊ธฐ์ค€์„ ์ œ์‹œํ•ด ๊ณผํ•™์  UQ ๋ฐฉ๋ฒ•๋ก ์˜ ๊ทผ๊ฐ„์ด ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๊ณผํ•™ ๋จธ์‹ ๋Ÿฌ๋‹์—์„œ ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™”(uncertainty quantification)์˜ ๋‹ค์–‘ํ•œ ๋ฐฉ๋ฒ•๋ก ์„ ํฌ๊ด„์ ์œผ๋กœ ๋‹ค๋ฃจ๋ฏ€๋กœ, 380 ๋…ผ๋ฌธ์˜ ์ œ์–ด ๊ด€์  ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•์„ ์ดํ•ดํ•˜๋Š” ๋ฐ ์ด๋ก ์  ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•จ.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
103์€ ์‹ ๊ฒฝ ์—ฐ์‚ฐ์ž ๊ณ„์—ด ์•„ํ‚คํ…์ฒ˜์˜ ๋น„๊ต ๋ฆฌ๋ทฐ๋กœ, 850์—์„œ ๋‹ค๋ฃจ๋Š” PDE ํ’€์ด UQ ํ”„๋ ˆ์ž„์›Œํฌ์™€ ์—ฐ์‚ฐ์ž ํ•™์Šต ๋งฅ๋ฝ์„ ๋ณด๋‹ค ํญ๋„“๊ฒŒ ์ดํ•ดํ•˜๋Š” ๋ฐ ๊ธฐ์ดˆ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๊ณผํ•™ ๋จธ์‹ ๋Ÿฌ๋‹์—์„œ ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™” ์ด๋ก ๊ณผ ๋‹ค์–‘ํ•œ ๋ฐฉ๋ฒ•๋ก ์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ •๋ฆฌํ•˜๊ณ  ์žˆ์–ด, evidential deep learning์˜ ๊ฐœ๋…์  ๊ธฐ๋ฐ˜์ด ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๊ณผํ•™์  ๊ธฐ๊ณ„ํ•™์Šต์—์„œ ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™”์— ๋Œ€ํ•œ ๊ด‘๋ฒ”์œ„ํ•œ ๋…ผ์˜๊ฐ€ DenSNet์˜ ์˜ˆ์ธก ์‹ ๋ขฐ๋„ ๋ถ„์„์— ์ง์ ‘ ์—ฐ๊ด€๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋”ฅ๋Ÿฌ๋‹์˜ ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™” ๋ฐฉ๋ฒ•์„ ๋‹ค์–‘ํ•œ ๊ด€์ ์—์„œ ์ฒด๊ณ„์ ์œผ๋กœ ๋ถ„๋ฅ˜ ๋ฐ ๊ฒ€ํ† ํ•˜์—ฌ, ์‹ ๊ฒฝ ์—ฐ์‚ฐ์ž ๊ธฐ๋ฐ˜ UQ ๋…ผ๋ฌธ์˜ ์‹œ์•ผ๋ฅผ ํ™•์žฅ์‹œ์ผœ์ค๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํ•™์ˆ ์  ํ‰๊ฐ€์šฉ LLM์˜ ์„ฑ๋Šฅ ํ‰๊ฐ€์™€ ์‹ ๋ขฐ์„ฑ ๋ฌธ์ œ๋„ ๋‹ค๋ฃจ๋ฉฐ, ์‹ ๊ฒฝ๋ง ๊ธฐ๋ฐ˜ ๊ณผํ•™์  ์˜ˆ์ธก์˜ ๋ถˆํ™•์‹ค์„ฑ ์‹ ๋ขฐ์„ฑ๊ณผ ์ง์ ‘ ์—ฐ๊ฒฐ๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ณผํ•™ ๋จธ์‹ ๋Ÿฌ๋‹์—์„œ์˜ ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™” ๋ฐฉ๋ฒ•์— ๊ด€ํ•œ ์ข…ํ•ฉ ๋ฆฌ๋ทฐ๋กœ, ๋ถ„์ž ๊ตฌ์กฐ ์‹ ๋ขฐ์„ฑ ํŒ๋ณ„์˜ ํƒ€ ์ ‘๊ทผ ๋ฐ ํ•œ๊ณ„๋„ ํ•จ๊ป˜ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์‹ ๊ฒฝ ์—ฐ์‚ฐ์ž ๊ธฐ๋ฐ˜ ๊ณผํ•™ ๋จธ์‹ ๋Ÿฌ๋‹์—์„œ ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™” ๋ฒค์น˜๋งˆํฌ๋ฅผ ์ˆ˜ํ–‰ํ•ด UQ ์ข…ํ•ฉ ์„ค๋ฌธ์˜ ์‹ค๋ฌด์  ์ ์šฉ ์‚ฌ๋ก€๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
850์˜ UQ ํ”„๋ ˆ์ž„์›Œํฌ๋Š” 103์—์„œ ์ •๋ฆฌํ•œ ์—ฌ๋Ÿฌ ์‹ ๊ฒฝ์—ฐ์‚ฐ์ž ์•„ํ‚คํ…์ฒ˜(PINN, FNO ๋“ฑ)์˜ ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™” ๋ฐ ํ•ด์„์— ์ง์ ‘์ ์œผ๋กœ ์—ฐ๊ฒฐ๋œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
427์˜ cd-PINN์€ 850์˜ ๋ฌผ๋ฆฌ์ •๋ณด์‹ ๊ฒฝ๋ง ๊ธฐ๋ฐ˜ PDE ํ’€์ด์™€ ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™”์˜ ๊ธฐ๋Šฅ์„ ์—ฐ์† ์˜์กด์„ฑ๊นŒ์ง€ ํ™•์žฅํ•œ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
Uncertainty quantification in scientific machine learning ๋…ผ๋ฌธ์€ PINN ๋ฐ ๊ณผํ•™๊ธฐ๊ณ„ํ•™์Šต์—์„œ ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™” ๋ฐ ํ•ด์„ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ๊ณตํ•˜์—ฌ, ํŠน์ด์  ๊ฒ€์ฆ ๋ฐ ์‹ ๋ขฐ ๋ถ„์„ ๋ฐฉ๋ฒ• ๊ฐ•ํ™”๋ฅผ ๋„์™€์ค๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
๋น„์„ ํ˜•์ /ํ™•๋ฅ ์  ๋ฌผ๋ฆฌ๊ณ„์—์„œ ์‹ ๋ขฐ์„ฑยท๋ถˆํ™•์‹ค์„ฑ ์ถ”์ •์„ ๋‹ค๋ฃธ์œผ๋กœ์จ 576์˜ SNR ํ˜„์ƒ๊ณผ ์ˆ˜๋ฆฌ์  ๋ชจ๋ธ๋ง์— ์ง์ ‘์  ์‹œ์‚ฌ์ ์„ ์ค€๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •