AlphaFold Database expands to proteome-scale quaternary structures

์ €์ž: | ๋‚ ์งœ: 2026-03-29 | URL: https://www.biorxiv.org/content/10.64898/2026.03.27.714458v1 📄 PDF


Essence

Figure 2

Figure 2: Homodimer analysis de๏ฌnes a high-con๏ฌdence prediction threshold and assesses structural

AlphaFold Database๋ฅผ ๋‹จ๋ฐฑ์งˆ ๋ณตํ•ฉ์ฒด(homo- ๋ฐ heteromeric)์˜ 3D ๊ตฌ์กฐ ์˜ˆ์ธก์œผ๋กœ ํ™•์žฅํ•˜์˜€์œผ๋ฉฐ, 4,777๊ฐœ ํ”„๋กœํ…Œ์˜ด์—์„œ 3,100๋งŒ ๊ฐœ ์ด์ƒ์˜ ๋ณตํ•ฉ์ฒด๋ฅผ ์˜ˆ์ธกํ•˜๊ณ  ์‹ ๋ขฐ๋„ ๋ฉ”ํŠธ๋ฆญ์„ ๊ธฐ๋ฐ˜์œผ๋กœ 180๋งŒ ๊ฐœ์˜ ๊ณ ์‹ ๋ขฐ ๋ณตํ•ฉ์ฒด๋ฅผ AFDB์— ๊ณต๊ฐœํ•˜์˜€๋‹ค. ์ด๋Š” ๋‹จ๋Ÿ‰์ฒด ์˜ˆ์ธก์—์„œ ๋‚˜ํƒ€๋‚˜์ง€ ์•Š๋Š” emergent structure์™€ topology๋ฅผ ๋ฐœ๊ฒฌํ•˜๊ณ  ์ƒํ˜ธ์ž‘์šฉ ๋„คํŠธ์›Œํฌ์˜ ๊ตฌ์กฐ์  ์ปค๋ฒ„๋ฆฌ์ง€๋ฅผ ๋Œ€ํญ ํ™•๋Œ€ํ•œ๋‹ค.

Motivation

Achievement

Figure 4

Figure 4: Compressibility of predicted complex space. (a) Clustering of 1,811,201 structures (1,754,242

๊ณ ์‹ ๋ขฐ ๋ณตํ•ฉ์ฒด ์˜ˆ์ธก: 4,777๊ฐœ ํ”„๋กœํ…Œ์˜ด์—์„œ 23,441,822๊ฐœ homomeric ๋ฐ 7,620,644๊ฐœ heteromeric ๋ณตํ•ฉ์ฒด๋ฅผ ์˜ˆ์ธกํ•˜์—ฌ 1,754,242๊ฐœ์˜ ๊ณ ์‹ ๋ขฐ ๋ณตํ•ฉ์ฒด๋ฅผ AFDB์— ์ถ”๊ฐ€. ์‹ ๋ขฐ๋„ ๋ฉ”ํŠธ๋ฆญ ๊ฒ€์ฆ: ์‹คํ—˜์  PDB ๊ตฌ์กฐ์™€์˜ ๋น„๊ต๋ฅผ ํ†ตํ•ด ipSAEmin, pLDDTavg, backbone clash์˜ ์กฐํ•ฉ์ด ๊ณ ์‹ ๋ขฐ ์˜ˆ์ธก์˜ ํšจ๊ณผ์ ์ธ ์ง€ํ‘œ์ž„์„ ํ™•์ธ. ๊ตฌ์กฐ์  ๊ฐ„๊ทน ํ•ด์†Œ: ๋Œ€๋ถ€๋ถ„์˜ ํ”„๋กœํ…Œ์˜ด์—์„œ PDB์˜ ์‹คํ—˜์  multimer ๊ตฌ์กฐ๋ณด๋‹ค 1~3 ์ž๋ฆฌ ์ˆ˜ ๋งŽ์€ ๊ณ ์‹ ๋ขฐ ๋ชจ๋ธ ์ œ๊ณต. emergent structure ๋ฐœ๊ฒฌ: ๋‹จ๋Ÿ‰์ฒด ์˜ˆ์ธก์—์„œ๋Š” ๋‚˜ํƒ€๋‚˜์ง€ ์•Š๋Š” oligomeric context์—์„œ์˜ ๊ตฌ์กฐ์  ํŠน์„ฑ ๊ทœ๋ช….

How

Figure 2

Figure 2: Homodimer analysis de๏ฌnes a high-con๏ฌdence prediction threshold and assesses structural

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 5/5 Significance: 5/5 Clarity: 5/5 Overall: 5/5

์ดํ‰: ์ด ์—ฐ๊ตฌ๋Š” AlphaFold-Multimer์˜ ๊ณ„์‚ฐ ๊ฐ€์†ํ™”๋ฅผ ํ†ตํ•ด ์ „๋ก€ ์—†๋Š” ๊ทœ๋ชจ์˜ ํ”„๋กœํ…Œ์˜ด ์ˆ˜์ค€ ๋ณตํ•ฉ์ฒด ๊ตฌ์กฐ ์˜ˆ์ธก์„ ์ˆ˜ํ–‰ํ•˜๊ณ , ์—„๋ฐ€ํ•œ ์‹ ๋ขฐ๋„ ๊ฒ€์ฆ์œผ๋กœ 1.8M์˜ ๊ณ ์‹ ๋ขฐ ๋ณตํ•ฉ์ฒด๋ฅผ AFDB์— ๊ณต๊ฐœํ•จ์œผ๋กœ์จ ๊ตฌ์กฐ ์ƒ๋ฌผํ•™์˜ ๋Œ€๊ทœ๋ชจ ์ž์› ๊ธฐ๋ฐ˜์„ ํ™•๋ฆฝํ•˜์˜€๋‹ค. ๋‹จ๋Ÿ‰์ฒด ์˜ˆ์ธก์˜ ํ•œ๊ณ„๋ฅผ ๋„˜์–ด emergent structure์™€ ์ƒํ˜ธ์ž‘์šฉ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ๋ฐœ๊ฒฌํ•  ์ƒˆ๋กœ์šด ๊ธฐ์ดˆ๋ฅผ ์ œ๊ณตํ•˜๋ฉฐ, ๊ณต๊ณต ์ธํ”„๋ผ ํ†ตํ•ฉ์œผ๋กœ ์ƒ๋ฌผํ•™ ์ „๋ฐ˜์˜ AI ์—ฐ๊ตฌ์™€ ํ•จ์ˆ˜ ๋ฐœ๊ฒฌ์„ ๊ฐ€์†ํ™”ํ•  ๋งค์šฐ ๋†’์€ ๊ฐ€์น˜์˜ ์—ฐ๊ตฌ์ด๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold์˜ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก ์ •ํ™•๋„ ํ‰๊ฐ€ ๋…ผ๋ฌธ์œผ๋กœ, AFDB ํ”„๋กœํ…Œ์˜ด ํ™•์žฅ์˜ ์‹ ๋ขฐ์„ฑ ๊ฒ€ํ† ์— ์ด๋ก ์  ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold Database expands ๋…ผ๋ฌธ์€ AlphaFold3์—์„œ ์˜ˆ์ธกํ•˜๋Š” ๋‹ค์–‘ํ•œ ๋ณตํ•ฉ์ฒด ๊ตฌ์กฐ์— ๋Œ€ํ•œ ์‹ ๋ขฐ๋„, emergent structure ๊ฐœ๋…์˜ ์ด๋ก ์  ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์ƒ๋ช…์ •๋ณดํ•™์—์„œ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์ด ๋‹จ๋ฐฑ์งˆ ๋“ฑ ์ƒ๋ฌผํ•™์  ๊ตฌ์กฐ ์˜ˆ์ธก์— ์–ด๋–ป๊ฒŒ ์“ฐ์ด๋Š”์ง€ ์ฒด๊ณ„์ ์œผ๋กœ ๊ฒ€ํ† ํ•ด์ค€๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
3019๋ฒˆ ๋…ผ๋ฌธ์€ AlphaFold ๊ณต๊ฐœ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ™•์žฅ ํ˜„ํ™ฉ์„ ๋‹ค๋ฃจ์–ด, 3007๋ฒˆ ๋…ผ๋ฌธ์˜ ํด๋“œ ์‹ ๋ขฐ์„ฑ ํ‰๊ฐ€ ์‹คํ—˜์— ๋ฐ์ดํ„ฐ ์†Œ์Šค๋กœ ํ™œ์šฉ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
3019๋ฒˆ ๋…ผ๋ฌธ์€ AlphaFold DB์˜ ๊ตฌ์กฐ ๋ฐ์ดํ„ฐ ๋ฆฌ์†Œ์Šค๋ฅผ ์ œ๊ณตํ•˜์—ฌ, AF2BIND์˜ ์†Œ๋ถ„์ž ๊ฒฐํ•ฉ๋ถ€์œ„ ์˜ˆ์ธก ์ ์šฉ ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜์ด ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold ๊ตฌ์กฐ DB ๋ฐ ์ฟผํ„ฐ๋„ˆ๋ฆฌ ๋ชจ๋ธ์˜ ์„ค๋ช…๋ ฅ์„ ๋…ผ์˜ํ•˜๋Š” ๋…ผ๋ฌธ์ด ConforNets์˜ conformation ์ œ์–ด ๊ฐœ๋…์— ์ด๋ก ์  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold์˜ ๊ตฌ์กฐ ์˜ˆ์ธก ์›๋ฆฌ์™€ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์…‹์„ ์ œ๊ณตํ•˜๋ฏ€๋กœ, ์ž ์žฌ๊ณต๊ฐ„ ์กฐ์ž‘ ๊ธฐ๋ฐ˜ ๋‹จ๋ฐฑ์งˆ ๋™์—ญํ•™ ์—ฐ๊ตฌ์˜ ํ•ต์‹ฌ ๊ธฐ๋ฐ˜์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ด ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ ๋ณตํ•ฉ์ฒด ๊ตฌ์กฐ ์˜ˆ์ธก์— AlphaFold ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ™•์žฅ์„ ๋‹ค๋ฃจ์–ด, ๋‹จ๋ฐฑ์งˆ ์„ค๊ณ„์™€ ๊ฒฐํ•ฉ ๋ถ€์œ„ ์˜ˆ์ธก์˜ ๋‹ค์–‘ํ•œ ์ตœ์‹  ์ ‘๊ทผ๋ฒ•์„ ๋น„๊ตํ•ด๋ณผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ƒ๋ฌผํ•™์  ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์ด ์˜ˆ์ธกํ•˜๋Š” ํ† ํด๋กœ์ง€, ๊ตฌ์กฐ์˜ ๊ธฐํ•˜์ ยท์œ„์ƒ์  ํŠน์„ฑ์„ ๋ถ„์„ํ•˜๋Š” ๋…ผ๋ฌธ์œผ๋กœ, ๋Œ€๊ทœ๋ชจ ๊ตฌ์กฐ ์˜ˆ์ธก๊ณผ ์—ฐ๊ณ„ํ•ด๋ณด๋ฉด ์œ ์šฉํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ๋ณตํ•ฉ์ฒด ์˜ˆ์ธก ๋Œ€์‹  de novo ํšจ์†Œ ์„ค๊ณ„์— ์ดˆ์ ์„ ๋งž์ถ˜ EnzyGen2๋Š” ์ƒ๋ฌผ ๊ตฌ์กฐ ์˜ˆ์ธก AI์˜ ๋‹ค์–‘ํ•œ ์ ์šฉ์„ ๋ณด์—ฌ์ค€๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
3019๋Š” ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก์„ ํ”„๋กœํ…Œ์˜ด ์Šค์ผ€์ผ๋กœ ํ™•์žฅํ•˜์—ฌ, 1060์˜ 3-track ์‹ ๊ฒฝ๋ง ๊ธฐ๋ฐ˜ ๊ตฌ์กฐ ์˜ˆ์ธก ๋ฐฉ๋ฒ•์„ ๋Œ€ํ˜• ๋ฐ์ดํ„ฐ์— ์ ์šฉ์‹œํ‚จ ์˜ˆ์ž…๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
AlphaFold์˜ ๊ธฐ๋ณธ ์›๋ฆฌ์™€ ์„ฑ๋Šฅ ์œ„์—, ๋‹จ๋ฐฑ์งˆ ๋ณตํ•ฉ์ฒด ์˜ˆ์ธก์œผ๋กœ ํ™•์žฅํ•œ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ๊ตฌ์ถ• ์‚ฌ๋ก€์ž…๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
AlphaFold ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ™•๋Œ€ ์—ฐ๊ตฌ๋Š” ESMFold์˜ ๋Œ€๊ทœ๋ชจ ์˜ˆ์ธก๋œ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ๋ฐ์ดํ„ฐ์…‹๊ณผ ์ƒํ˜ธ๋ณด์™„์ ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
AlphaFold Database์˜ ๋‹จ๋ฐฑ์งˆ ๋ณตํ•ฉ์ฒด ์˜ˆ์ธก ์—ฐ๊ตฌ๊ฐ€ AlphaFold3 ๋ชจ๋ธ์˜ ํ™•์žฅ๋œ ๊ตฌ์กฐ ์˜ˆ์ธก ๋Šฅ๋ ฅ๊ณผ ์ง์ ‘์ ์œผ๋กœ ์—ฐ๊ฒฐ๋ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
AlphaFold Database ๊ด€๋ จ ๋…ผ๋ฌธ์€ ๋Œ€ํ˜• ์ƒ๋ฌผ์ •๋ณดํ•™ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ์‹ค์ œ ์‘์šฉ ์‚ฌ๋ก€๋ฅผ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
AlphaFold Database๋Š” ๋Œ€๊ทœ๋ชจ ์ธ๊ฐ„ ๋‹จ๋ฐฑ์งˆ์˜ ๊ตฌ์กฐ ์ •๋ณด๋ฅผ ์ œ๊ณตํ•˜์—ฌ ๋‹จ๋ฐฑ์งˆ ํ˜„๋ฏธ๊ฒฝ ์ด๋ฏธ์ง€ ์ƒ์„ฑ ์—ฐ๊ตฌ์™€ ์ƒํ˜ธ ๋ณด์™„์ ์ž…๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •