How Creative Are Large Language Models in Generating Molecules?

์ €์ž: | ๋‚ ์งœ: 2026-04-20 | URL: https://arxiv.org/abs/2604.18031 📄 PDF


Essence

Figure 1

Figure 1. Overview of how LLM creativity in molecule generation

Large Language Model์˜ ๋ถ„์ž ์ƒ์„ฑ ๋Šฅ๋ ฅ์„ convergent creativity(์ œ์•ฝ ์กฐ๊ฑด ๋งŒ์กฑ)์™€ divergent creativity(ํ™”ํ•™๊ณต๊ฐ„ ํƒ์ƒ‰)์˜ ๋‘ ์ฐจ์›์œผ๋กœ ์ •๋Ÿ‰ ํ‰๊ฐ€ํ•˜์—ฌ, LLM์ด ์ œ์•ฝ ์กฐ๊ฑด ์ถ”๊ฐ€ ์‹œ ์˜คํžˆ๋ ค ๋งŒ์กฑ๋„๊ฐ€ ํ–ฅ์ƒ๋˜๋Š” ๋…ํŠนํ•œ ์ฐฝ์˜์„ฑ ํŒจํ„ด์„ ๋ณด์ž„์„ ๊ทœ๋ช…ํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Figure 2. Creativity profiles vary systematically by task type. Physicochemical and ADMET tasks show high convergent cre

How

Figure 1

Figure 1. Overview of how LLM creativity in molecule generation

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ฐฝ์˜์„ฑ์ด๋ผ๋Š” ์ƒˆ๋กœ์šด ๋ Œ์ฆˆ๋กœ LLM ๊ธฐ๋ฐ˜ ๋ถ„์ž ์ƒ์„ฑ์„ ์žฌํ•ด์„ํ•˜๊ณ , ๋‘ ์ฐจ์›์˜ ๊ท ํ˜• ์žกํžŒ ํ‰๊ฐ€ ํ”„๋ ˆ์ž„์›Œํฌ์™€ ์—ญ์„ค์  ์ œ์•ฝ ํšจ๊ณผ ๋ฐœ๊ฒฌ์„ ํ†ตํ•ด LLM์˜ ์•ฝ๋ฌผ ๋ฐœ๊ฒฌ ์‘์šฉ์— ๋Œ€ํ•œ ์ฒด๊ณ„์  ์ดํ•ด๋ฅผ ์ œ์‹œํ•˜๋Š” ์šฐ์ˆ˜ํ•œ ์—ฐ๊ตฌ๋กœ, ํ™”ํ•™๊ณผ AI์˜ ๊ต์ฐจ ๋ถ„์•ผ ์‹ค๋ฌด์ž์—๊ฒŒ ๋†’์€ ์ฐธ๊ณ  ๊ฐ€์น˜๊ฐ€ ์žˆ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
509 ๋…ผ๋ฌธ์€ LLM์˜ ์กฐํ•ฉ์  ์ฐฝ์˜์„ฑ ์ธก์ • ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ๊ณตํ•˜์—ฌ ๋ถ„์ž ์ƒ์„ฑ ํ‰๊ฐ€(3131)์˜ ๋ฐฉ๋ฒ•๋ก ์  ๋ฐฐ๊ฒฝ์ด ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋ถ„์ž ์ƒ์„ฑ ๋ชจ๋ธ์˜ ํ‰๊ฐ€ ์ง€ํ‘œ ๋ฐ ํ™”ํ•™๊ณต๊ฐ„ ํƒ์ƒ‰์˜ ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋ถ„์ž ์„ค๊ณ„๋ฅผ ์œ„ํ•œ ์–ธ์–ด ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ํ”„๋ ˆ์ž„์›Œํฌ์˜ ๋ฐฉ๋ฒ•๋ก ์  ๊ธฐ๋ฐ˜์ด ๋˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋ถ„์ž ์ƒ์„ฑ ๋ฐ ํ‘œํ˜„์„ ์œ„ํ•œ ์–ธ์–ด ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ์ ‘๊ทผ๋ฒ•์˜ ๊ธฐ๋ฐ˜์ด ๋˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM ๊ธฐ๋ฐ˜ ๋Œ€๊ทœ๋ชจ ํ™”ํ•™๊ณต๊ฐ„ ํƒ์ƒ‰๊ณผ ์ง„ํ™”์  ๋ถ„์ž ์ƒ์„ฑ์˜ ๋Œ€์•ˆ์  ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
SE(3)-๋ถˆ๋ณ€ ๋ถ„์ž ํ‘œํ˜„ ํ•™์Šต์˜ ๋‹ค๋ฅธ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•˜๋Š” ์œ ์‚ฌํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ œ์•ฝ ์กฐ๊ฑด ๊ธฐ๋ฐ˜ ๋ถ„์ž ์ƒ์„ฑ์„ ์œ„ํ•œ ๋‹ค๋ฅธ ๋”ฅ๋Ÿฌ๋‹ ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM์„ ํ™œ์šฉํ•œ ๋ถ„์ž ์ƒ์„ฑ ๋ฐ ์„ค๊ณ„์˜ ๋Œ€์•ˆ์  ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
LLM ๊ธฐ๋ฐ˜ ๊ณผํ•™์  ์ฐฝ์˜์„ฑ ํ‰๊ฐ€๋ฅผ ์œ„ํ•œ ๋Œ€์•ˆ์  ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํ™”ํ•™์  ์ฐฝ์˜์„ฑ ๋ฐ ๋‹ค์–‘์„ฑ์„ ์œ„ํ•œ ๋‹ค๋ฅธ ๋ถ„์ž ์ƒ์„ฑ ๋ฐฉ๋ฒ•๋ก ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ์–ธ์–ด๋ชจ๋ธ์˜ ์ƒ์„ฑ ๋ฐ ํŒจํ„ด ํƒ์ง€ ๋Šฅ๋ ฅ์„ ๋‹ค๋ฅธ ๋ฐฉ์‹์œผ๋กœ ํ‰๊ฐ€ํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋ถ„์ž ์„ค๊ณ„ ๋ฐ ํ•ฉ์„ฑ ๊ณ„ํš์— ์–ธ์–ด ๋ชจ๋ธ์„ ํ™œ์šฉํ•˜๋Š” ์œ ์‚ฌํ•œ ์ ‘๊ทผ๋ฒ•์˜ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ์„œ์—ด ์ƒ์„ฑ์„ ์œ„ํ•œ LLM ํ™œ์šฉ์˜ ๋Œ€์•ˆ์  ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋ถ„์ž ๋ฐ ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„๋ฅผ ์œ„ํ•œ ์–ธ์–ด ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ์ƒ์„ฑ ํ”„๋ ˆ์ž„์›Œํฌ์— ๋Œ€ํ•œ ์œ ์‚ฌํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
How Creative Are Large Language Models in Generating Molecular Structures ๋…ผ๋ฌธ์€ LLM์˜ ๋ถ„์ž ๊ตฌ์กฐ ์ƒ์„ฑ ๋ฐ ์ฐฝ์˜์„ฑ ํ‰๊ฐ€๋ฅผ ๋‹ค๋ฃจ์–ด AI๊ฐ€ ๊ณผํ•™๊ณ„ ๊ตฌ์กฐ์  ๋‹ค์–‘์„ฑ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ๊ณผ ๋Œ€๋น„๋ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
๋Œ€ํ˜• ์–ธ์–ด ๋ชจ๋ธ์ด ๊ณผํ•™์ž ์ƒ์‚ฐ์„ฑ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ๊ณผ ์ฐฝ์˜์„ฑ, ๊ณผํ•™์  ๋‹ค์–‘์„ฑ ๋ณ€ํ™”์— ๋Œ€ํ•œ ์‹ค์ฆ๋ถ„์„์„ ๋ณด๋‹ค ์ •๋Ÿ‰์ ์œผ๋กœ ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค.
๋ฐ˜๋ก /๋น„ํŒ
397 ๋…ผ๋ฌธ์€ LLM์˜ ํ™˜๊ฐ ํ˜„์ƒ์ด ์˜คํžˆ๋ ค ํ™”ํ•ฉ๋ฌผ ์ฐฝ์˜์„ฑ์— ๊ธ์ •์ ์œผ๋กœ ์ž‘์šฉํ•  ์ˆ˜ ์žˆ์Œ์„ ๋…ผ์˜ํ•˜๋ฉฐ, 3131์˜ ์ฐฝ์˜์„ฑ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ํ•ด์„๊ณผ ์ƒ๋ฐ˜๋ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •