DrugPlayGround: Benchmarking Large Language Models and Embeddings for Drug Discovery

์ €์ž: | ๋‚ ์งœ: 2026-04-07 | URL: https://www.biorxiv.org/content/10.64898/2026.04.04.716470v1 📄 PDF


Essence

Figure 1

Fig. 1: Overview of DrugPlayground. We prepare datasets with molecule-text-paired

DrugPlayGround๋Š” ์•ฝ๋ฌผ ๋ฐœ๊ฒฌ์˜ ๋„ค ๊ฐ€์ง€ ๋Œ€ํ‘œ์  ์ž‘์—…(์•ฝ๋ฌผ ๊ธฐ๋Šฅ ๋ถ„์„, ์•ฝ๋ฌผ-๋‹จ๋ฐฑ์งˆ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก, ์•ฝ๋ฌผ ์‹œ๋„ˆ์ง€ ์กฐํ•ฉ ์˜ˆ์ธก, ์•ฝ๋ฌผ ์„ญ๋™ ์˜ˆ์ธก)์—์„œ ๋Œ€ํ˜•์–ธ์–ด๋ชจ๋ธ(LLM)์˜ ์„ฑ๋Šฅ์„ ์ฒด๊ณ„์ ์œผ๋กœ ํ‰๊ฐ€ํ•˜๋Š” ๋ฒค์น˜๋งˆํ‚น ํ”Œ๋žซํผ์ด๋‹ค. ๋‹ค์–‘ํ•œ ํ”„๋กฌํ”„ํŠธ ์„ค์ •๊ณผ ๋ชจ๋ธ ์˜จ๋„ ๋ณ€ํ™” ํ•˜์—์„œ LLM์ด ์ƒ์„ฑํ•œ ํ…์ŠคํŠธ ์„ค๋ช… ๋ฐ ์ž„๋ฒ ๋”ฉ์˜ ํ’ˆ์งˆ์„ ๊ฐ๊ด€์ ์œผ๋กœ ์ธก์ •ํ•˜๊ณ  ํ™”ํ•™์ž ํ”ผ๋“œ๋ฐฑ์„ ํ†ตํ•ฉํ•˜์—ฌ ํ‰๊ฐ€ํ•œ๋‹ค.

Motivation

Achievement

Figure 2

Fig. 2: Model Performance in Terms of Text Generation (a) Five LLMsโ€™ BLEU scores

How

Figure 2

Fig. 2: Model Performance in Terms of Text Generation (a) Five LLMsโ€™ BLEU scores

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: DrugPlayGround๋Š” ์•ฝ๋ฌผ ๋ฐœ๊ฒฌ์ด๋ผ๋Š” ๊ณ ๋„๋กœ ์ „๋ฌธํ™”๋œ ์˜์—ญ์—์„œ LLM์˜ ์„ฑ๋Šฅ์„ ์ฒด๊ณ„์ ์œผ๋กœ ๋ฒค์น˜๋งˆํ‚นํ•˜๋Š” ํ†ตํ•ฉ ํ”Œ๋žซํผ์œผ๋กœ, ์•ฝ๋ฌผ ์„ค๋ช… ์ƒ์„ฑ(ํ…์ŠคํŠธ ํ‰๊ฐ€) ๋ฐ ์ž„๋ฒ ๋”ฉ ๊ธฐ๋ฐ˜ ๋‹ค์šด์ŠคํŠธ๋ฆผ ์ž‘์—…(์•ฝ๋ฌผ-๋‹จ๋ฐฑ์งˆ ์ƒํ˜ธ์ž‘์šฉ, ์‹œ๋„ˆ์ง€ ์˜ˆ์ธก ๋“ฑ)์„ ๋™์‹œ์— ๋‹ค๋ฃจ๋Š” ํฌ๊ด„์  ์ ‘๊ทผ์„ ์ œ์‹œํ•œ๋‹ค๋Š” ์ ์—์„œ ๊ฐ€์น˜์žˆ๋‹ค. ๋‹ค์–‘ํ•œ ํ”„๋กฌํ”„ํŠธ ์ „๋žต, ์˜จ๋„ ์„ค์ •, ํ™”ํ•™์ž ํ”ผ๋“œ๋ฐฑ์„ ํ†ตํ•ฉํ•œ ํ‰๊ฐ€๋Š” LLM์˜ ์‹ค์šฉ์  ํ™œ์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ๋ช…ํ™•ํžˆ ํ•˜๋Š” ๋ฐ ๊ธฐ์—ฌํ•  ๊ฒƒ์œผ๋กœ ์˜ˆ์ƒ๋œ๋‹ค. ๋‹ค๋งŒ ๋ฐ์ดํ„ฐ์…‹ ๊ทœ๋ชจ, ๋ฉ”ํŠธ๋ฆญ์˜ ํ™”ํ•™์  ํƒ€๋‹น์„ฑ, ํ†ต๊ณ„์  ์œ ์˜์„ฑ ๊ฒ€์ •, ํ™”ํ•™์ž ํ‰๊ฐ€์˜ ์ฒด๊ณ„ํ™” ๋“ฑ์—์„œ ์ถ”๊ฐ€ ์ •๋ณด ์ œ์‹œ ๋ฐ ๋ณด์™„์ด ํ•„์š”ํ•˜๋ฉฐ, ๋ณธ๋ฌธ ๋ฐœ์ทŒ์—์„œ๋Š” ์˜จ์ „ํ•œ ์ž„๋ฒ ๋”ฉ ๊ธฐ๋ฐ˜ ์„ฑ๋Šฅ ํ‰๊ฐ€ ๊ฒฐ๊ณผ๊ฐ€ ๋ˆ„๋ฝ๋˜์–ด ์ตœ์ข… ํŒ๋‹จ์— ์ œ์•ฝ์ด ์žˆ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
151๋ฒˆ ๋…ผ๋ฌธ(BERT)์€ ์ƒ๋ฌผํ•™ ๋ฐ์ดํ„ฐ์—์„œ ๋Œ€ํ˜• ์–ธ์–ด๋ชจ๋ธ ์‚ฌ์ „ํ•™์Šต์˜ ๊ธฐ์ดˆ๋ฅผ ์ œ๊ณตํ•˜๋ฏ€๋กœ, 3080์ด ์ œ์‹œํ•˜๋Š” LLM ๋ฒค์น˜๋งˆํฌ์˜ ๊ธฐ๋ฐ˜ ์ด๋ก ์„ ์ดํ•ดํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
MolQuest๋Š” ์•ฝ๋ฌผ ๊ด€๋ จ ์—ฐ์—ญ์  ์ถ”๋ก  ํ‰๊ฐ€๋ฅผ ์œ„ํ•œ ๋ฒค์น˜๋งˆํฌ๋กœ์„œ, DrugPlayGround์˜ ๊ณผ์ œ-์ž„๋ฒ ๋”ฉ ์งˆ ํ‰๊ฐ€ ๊ตฌ์กฐ ๋…ผ๋ฆฌ์— ๊ทผ๊ฐ„์ด ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Code Llama ๋“ฑ ์ฝ”๋“œ ์ƒ์„ฑ/์ดํ•ด์— ํŠนํ™”๋œ LLM์˜ ๊ณผํ•™์  ๋ถ„์„๋Šฅ๋ ฅ ํ‰๊ฐ€์™€ DrugPlayGround์˜ ๋ฒค์น˜๋งˆํฌ(์•ฝ๋ฌผ ๋ฐœ๊ฒฌ ์ž‘์—… LLM ์„ฑ๋Šฅ)๊ฐ„ ๋น„๊ต๋Š” task ํŠน์ด์„ฑ๊ณผ LLM ๋ฒ”์šฉ์„ฑ์˜ ์ฐจ์ด๋ฅผ ๋“œ๋Ÿฌ๋ƒ…๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
MASSW๋Š” AI ๋ณด์กฐ ํ™”ํ•™์‹คํ—˜ ๋ฒค์น˜๋งˆํฌ๋กœ, DrugPlayGround์˜ ๋ฒค์น˜๋งˆํ‚น ๋ชฉ์ ๊ณผ ์œ ์‚ฌํ•˜์ง€๋งŒ ๊ณผ์ œ ์ ์šฉ ๋ฒ”์œ„๊ฐ€ ๋‹ค๋ฅด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
MOOSE-Chem์€ LLM์„ ํ™œ์šฉํ•œ ์ƒˆ๋กœ์šด ํ™”ํ•ฉ๋ฌผ ์žฌ๋ฐœ๊ฒฌ ๋Šฅ๋ ฅ์„ ํ‰๊ฐ€ํ•˜๋ฉฐ, ์•ฝ๋ฌผ๋ฐœ๊ฒฌ ์ž‘์—…์—์„œ DrugPlayGround์˜ ํ‰๊ฐ€ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ํ˜„์žฅ ์‹คํ—˜ ๊ฒ€์ฆ๊นŒ์ง€ ํ™•์žฅํ•œ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •