Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding

์ €์ž: Xiner Li, Yulai Zhao, Chenyu Wang, Gabriele Scalia, Gรถkรงen Eraslan | ๋‚ ์งœ: 2024 | DOI: 10.48550/arXiv.2408.08252 📄 PDF


Essence

Figure 1

Figure 1: Summary of SVDD. v denotes value

๋ณธ ๋…ผ๋ฌธ์€ pre-trained diffusion models๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋ณด์ƒ ํ•จ์ˆ˜๋ฅผ ์ตœ์ ํ™”ํ•˜๋ฉด์„œ ์ž์—ฐ์Šค๋Ÿฌ์šด ๋ฐ์ดํ„ฐ ๋ถ„ํฌ๋ฅผ ๋ณด์กดํ•˜๋Š” ์ƒˆ๋กœ์šด ๋ฐฉ๋ฒ•์ธ Soft Value-based Decoding (SVDD)๋ฅผ ์ œ์•ˆํ•œ๋‹ค. SVDD๋Š” ๋ฏธ๋ถ„ ๋ถˆ๊ฐ€๋Šฅํ•œ ๋ณด์ƒ ํ•จ์ˆ˜๋ฅผ ์ง์ ‘ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๊ณ  ์—ฐ์† ๋ฐ ์ด์‚ฐ diffusion models ๋ชจ๋‘์— ์ ์šฉ ๊ฐ€๋Šฅํ•œ ์ถ”๋ก  ์‹œ๊ฐ„ ๋ฐฉ๋ฒ•์ด๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: Summary of SVDD. v denotes value

์ด๋ฏธ์ง€ ์ƒ์„ฑ: ๋ฏธ๋ถ„ ๋ถˆ๊ฐ€๋Šฅํ•œ ๋ณด์ƒ ํ•จ์ˆ˜๋กœ ์ด๋ฏธ์ง€ ์กฐ๊ฑด๋ถ€ ์ƒ์„ฑ ์„ฑ๊ณต / ๋ถ„์ž ์ƒ์„ฑ: Docking score, QED, SA ์ ์ˆ˜ ์ตœ์ ํ™”์—์„œ ๋†’์€ ์„ฑ๋Šฅ ๋‹ฌ์„ฑ / DNA/RNA ์ƒ์„ฑ: ํ™œ์„ฑ๋„ ์ˆ˜์ค€ ์ตœ์ ํ™”์—์„œ ํšจ๊ณผ ์ž…์ฆ / ๋ฐฉ๋ฒ•๋ก ์  ๊ธฐ์—ฌ: Fine-tuning ๋ถˆํ•„์š”, ๋น„์ฐจ๋ณ„ํ™” ๋ณด์ƒ ์ง์ ‘ ์‚ฌ์šฉ ๊ฐ€๋Šฅ, ์ด์‚ฐ diffusion models์— ํ†ต์ผ๋œ ๋ฐฉ์‹์œผ๋กœ ์ ์šฉ ๊ฐ€๋Šฅ / ๊ณ„์‚ฐ ํšจ์œจ์„ฑ: Pre-trained ๋ชจ๋ธ ํ™œ์šฉ์œผ๋กœ ํ•™์Šต ๋น„์šฉ ์ ˆ๊ฐ

How

Figure 1

Figure 1: Summary of SVDD. v denotes value

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ๋ฏธ๋ถ„ ๋ถˆ๊ฐ€๋Šฅํ•œ ๋ณด์ƒ ํ•จ์ˆ˜ ์ตœ์ ํ™”์™€ ์ด์‚ฐ diffusion models ์ง€์›์ด๋ผ๋Š” ์ค‘์š”ํ•œ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๋Š” ์‹ค์šฉ์ ์ด๊ณ  ์›์น™์ ์ธ ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค. ์ด๋ก ์  ๊ทผ๊ฑฐ๊ฐ€ ์ถฉ๋ถ„ํ•˜๊ณ  ๋‹ค์–‘ํ•œ ์‘์šฉ ๋„๋ฉ”์ธ์—์„œ ์‹ค์ฆ์  ์„ฑ๊ณผ๋ฅผ ๋ณด์—ฌ์ฃผ๋Š” ์ˆ˜์ค€ ๋†’์€ ์—ฐ๊ตฌ์ด๋‹ค. ์ถ”๋ก  ๋น„์šฉ ์ฆ๊ฐ€๋ผ๋Š” ์ œ์•ฝ์ด ์žˆ์œผ๋‚˜ ์ƒ๋ฌผํ•™ ๋ฐ ํ™”ํ•™ ๋ถ„์•ผ์˜ ์‹ค์ œ ์‘์šฉ์„ฑ์ด ๋†’์•„ ๊ฒŒ์žฌ ๊ฐ€์น˜๊ฐ€ ์žˆ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
296์€ ํ™•์‚ฐ ๋ชจ๋ธ์˜ ์ด๋ก ์  ๊ธฐ๋ฐ˜์ด๋‚˜ ๋””์ฝ”๋”ฉ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ๊ณตํ•˜์—ฌ SVDD ๊ฐœ๋ฐœ์˜ ๋ฐฉ๋ฒ•๋ก ์  ํ† ๋Œ€๊ฐ€ ๋œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
RNA ์„ค๊ณ„ ์ตœ์ ํ™”์˜ ๋ฐฉ๋ฒ•๋ก ์  ๊ธฐ๋ฐ˜์ด ๋˜๋Š” ์ด์‚ฐ ์ตœ์ ํ™” ๊ด€๋ จ ์—ฐ๊ตฌ์ด๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
269๋ฒˆ ๋…ผ๋ฌธ์€ ํ™•์‚ฐ ๋ชจ๋ธ์˜ ๋ฏธ๋ถ„ ๋ฐ ๋น„์—ฐ์† ๋„๋ฉ”์ธ์—์„œ์˜ guidance ๊ธฐ๋ฒ•์„ ์†Œ๊ฐœํ•ด, CoCoGraph์˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์  ์ฒ ํ•™๊ณผ๋„ ์—ฐ๊ฒฐ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
446์€ ํ™•์‚ฐ ๋ชจ๋ธ์—์„œ์˜ ๊ฐ€์ด๋˜์Šค ๋˜๋Š” ๋ณด์ƒ ์ตœ์ ํ™”๋ฅผ ์œ„ํ•œ ๋‹ค๋ฅธ ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•˜์—ฌ SVDD์™€ ๋Œ€์•ˆ์ ์œผ๋กœ ๋น„๊ต๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Derivative-Free Guidance in Continuous and Discrete Diffusion Models(269)์€ ๋ชจ๋ธ ๋ฏธ์„ธ์กฐ์ • ์—†์ด ๋ณด์ƒ ๊ธฐ๋ฐ˜ ์œ ๋„ ๊ธฐ๋ฒ•์„ ๊ฐœ๋ฐœํ•˜์—ฌ, 428์˜ ํ…Œ์ŠคํŠธํƒ€์ž„ ์ •๋ ฌ ์•„์ด๋””์–ด์™€ ์ง์ ‘ ๋น„๊ต๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Derivative-Free Guidance๊ฐ€ ํ…Œ์ŠคํŠธ ํƒ€์ž„ ์„ฑ๋Šฅ ์ตœ์ ํ™” ๋ฐฉ๋ฒ•์˜ ๋‹ค๋ฅธ ๊ตฌํ˜„ ๋ฐฉ์‹์ด๋ฏ€๋กœ ๋‘ ์ ‘๊ทผ์„ ๋น„๊ตํ•ด๋ณผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ณ ๋ถ„์ž ๋˜๋Š” ์ „์ž ์žฌ๋ฃŒ์˜ ์ฒ˜๋ฆฌ ๊ณต๊ฐ„์„ AI ๊ธฐ๋ฐ˜์œผ๋กœ ํƒ์ƒ‰ํ•˜๋Š” ์œ ์‚ฌํ•œ ๋ฐฉ๋ฒ•๋ก ์„ ์‚ฌ์šฉํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Adaptive Constraint Guidance ๊ธฐ๋ฐ˜ ํŒŒ์ธํŠœ๋‹ ์—†๋Š” ํ™•์‚ฐ๋ชจ๋ธ ์ƒ์„ฑ๋ฒ• ๋…ผ๋ฌธ์œผ๋กœ, ํŒŒ์ƒ์  ๋ณด์ƒ ์œ ๋„ ๋ฐ reward-guidance์˜ ๋˜๋‹ค๋ฅธ ์‹คํ˜„ ๋ฐฉ์•ˆ์„ ๋น„๊ตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ƒ๋ฌผํ•™์  ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ์ค‘๊ฐ„ ์ธต ํ‘œํ˜„์„ ๋‹ค๋ฅธ ๊ด€์ ์—์„œ ๋ถ„์„ํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ด์‚ฐ-์—ฐ์† ํ˜ผํ•ฉ ๊ณต๊ฐ„์—์„œ์˜ ๋””ํ“จ์ „ ๋ฐ ๊ฐ€์ด๋“œ ๋ฐฉ์‹์— ๊ด€ํ•œ ์ตœ์‹  ํ”„๋ ˆ์ž„์›Œํฌ์˜ ๋Œ€์•ˆ์  ์ ‘๊ทผ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
682 ๋…ผ๋ฌธ์€ ๋ฏธ์„ธ์กฐ์ • ์—†๋Š” ์ƒ˜ํ”Œ ์ƒ์„ฑ์—์„œ ๋ณด์ƒ์„ ํ™œ์šฉํ•˜๋Š” ํ™•์‚ฐ ๋ชจ๋ธ ์ตœ์ ํ™” ๋ฐฉ๋ฒ•์„ ์ถ”๊ฐ€์ ์ธ ์ •์ œ ๊ธฐ๋ฒ•์œผ๋กœ ํƒ๊ตฌํ•œ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
269์˜ reward ๊ธฐ๋ฐ˜ ์ƒ์„ฑ ์›๋ฆฌ๋Š” 433์˜ RL ๊ธฐ๋ฐ˜ ์‹คํ—˜์  ์˜์‚ฌ๊ฒฐ์ • ๋ฐ ์‹คํ—˜ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์— ์‹ค์ œ๋กœ ์ ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •