Highly accurate protein structure prediction with AlphaFold

์ €์ž: John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin ลฝรญdek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David Silver, Oriol Vinyals, Andrew W. Senior, Koray Kavukcuoglu, Pushmeet Kohli, Demis Hassabis | ๋‚ ์งœ: 2021-08-26 | DOI: 10.1038/s41586-021-03819-2 📄 PDF


Essence

AlphaFold๋Š” ์•„๋ฏธ๋…ธ์‚ฐ ์„œ์—ด๋งŒ์œผ๋กœ ๋‹จ๋ฐฑ์งˆ์˜ 3์ฐจ์› ๊ตฌ์กฐ๋ฅผ ์›์ž ์ˆ˜์ค€์˜ ์ •ํ™•๋„๋กœ ์˜ˆ์ธกํ•˜๋Š” ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ๋กœ, 50๋…„ ์ด์ƒ์˜ ๋‹จ๋ฐฑ์งˆ ํด๋”ฉ ๋ฌธ์ œ๋ฅผ ๊ทผ๋ณธ์ ์œผ๋กœ ํ•ด๊ฒฐํ•œ ํš๊ธฐ์ ์ธ ์„ฑ๊ณผ์ด๋‹ค.

Motivation

Achievement

Figure 1

AlphaFold๊ฐ€ ์ƒ์„ฑํ•œ ๊ณ ์ •ํ™•๋„ ๊ตฌ์กฐ: (a) CASP14 ๋ฐ์ดํ„ฐ์…‹์—์„œ ๋‹ค๋ฅธ ์ƒ์œ„ 15๊ฐœ ๋ฐฉ๋ฒ•๊ณผ์˜ ์„ฑ๋Šฅ ๋น„๊ต, (b-d) ์ •ํ™•ํ•œ ๋ฐฑ๋ณธ ๋ฐ ์‚ฌ์ด๋“œ ์ฒด์ธ ์˜ˆ์ธก, ํŠนํžˆ ํฐ ๋‹จ๋ฐฑ์งˆ์˜ ๋„๋ฉ”์ธ ํŒจํ‚น ์ •ํ™•๋„ ์‹œ์—ฐ

  1. CASP14 ๋ฒค์น˜๋งˆํฌ์—์„œ์˜ ์••๋„์  ์„ฑ๋Šฅ:
    • ๋ฐฑ๋ณธ ์ •ํ™•๋„(Cฮฑ r.m.s.d.โ‚‰โ‚…): ์ค‘์•™๊ฐ’ 0.96 ร… (์‹ ๋ขฐ ๊ตฌ๊ฐ„ 0.85โ€“1.16 ร…)
    • ์ฐจ์ƒ์œ„ ๋ฐฉ๋ฒ•: 2.8 ร… (2.7โ€“4.0 ร…) โ€” ์•ฝ 3๋ฐฐ ํ–ฅ์ƒ
    • ์ „์ฒด ์›์ž ์ •ํ™•๋„: 1.5 ร… r.m.s.d.โ‚‰โ‚… vs. ์ฐจ์ƒ์œ„ 3.5 ร…
    • ํƒ„์†Œ ์›์ž ํญ(~1.4 ร…)๊ณผ ๋น„์Šทํ•œ ์ˆ˜์ค€์˜ ์ •๋ฐ€๋„ ๋‹ฌ์„ฑ
  2. ์ผ๋ฐ˜ํ™” ๋ฐ ์‹ ๋ขฐ๋„:
    • ์ตœ๊ทผ PDB์— ๋“ฑ๋ก๋œ ๊ตฌ์กฐ(ํ•™์Šต ๋ฐ์ดํ„ฐ ์ปท์˜คํ”„ ์ดํ›„)์—์„œ๋„ ๋†’์€ ์ •ํ™•๋„ ์œ ์ง€ (Figure 2)
    • ์˜ˆ์ธก๋œ ๊ตญ์†Œ ๊ฑฐ๋ฆฌ ์ฐจ์ด ๊ฒ€์‚ฌ(pLDDT) ์ง€ํ‘œ๊ฐ€ ์‹ค์ œ ์ •ํ™•๋„๋ฅผ ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ๊ฒŒ ์˜ˆ์ธก
    • 2,180๊ฐœ ์ž”๊ธฐ ๋‹จ๋ฐฑ์งˆ์˜ ์ •ํ™•ํ•œ ๋„๋ฉ”์ธ ํŒจํ‚น ์˜ˆ์ธก ๊ฐ€๋Šฅ

How

Figure 3

Evoformer ๋ธ”๋ก์˜ ๊ตฌ์กฐ: MSA ํ‘œํ˜„๊ณผ ํŽ˜์–ด ํ‘œํ˜„ ๊ฐ„์˜ ์ •๋ณด ํ๋ฆ„์„ ๋ณด์—ฌ์ฃผ๋Š” ๊ทธ๋ž˜ํ”„ ์ถ”๋ก  ํ”„๋ ˆ์ž„์›Œํฌ

๋„คํŠธ์›Œํฌ ์•„ํ‚คํ…์ฒ˜์˜ ํ•ต์‹ฌ ํ˜์‹ :

Originality

Limitation & Further Study

Evaluation

Novelty: 5/5 Technical Soundness: 5/5 Significance: 5/5 Clarity: 4/5 Overall: 5/5

์ดํ‰: ์ด ๋…ผ๋ฌธ์€ 50๋…„ ์ด์ƒ ๋ฏธํ•ด๊ฒฐ๋œ ๋‹จ๋ฐฑ์งˆ ํด๋”ฉ ๋ฌธ์ œ๋ฅผ ๋”ฅ๋Ÿฌ๋‹์œผ๋กœ ๊ฑฐ์˜ ์™„์ „ํžˆ ํ•ด๊ฒฐํ•œ ์—ญ์‚ฌ์  ์„ฑ๊ณผ๋กœ, ์ง„ํ™”์  ์ •๋ณด์™€ ๊ธฐํ•˜ํ•™์  ์ œ์•ฝ์„ ์ฐฝ์˜์ ์œผ๋กœ ํ†ตํ•ฉํ•œ ํ˜์‹ ์  ์•„ํ‚คํ…์ฒ˜๋ฅผ ์ œ์‹œํ•˜๋ฉฐ, ๊ตฌ์กฐ ์ƒ๋ฌผํ•™๊ณผ ์ƒ์˜ํ•™ ์—ฐ๊ตฌ์— ํŒจ๋Ÿฌ๋‹ค์ž„ ์ „ํ™˜์„ ๊ฐ€์ ธ์™”๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ 3์ฐจ์› ๊ตฌ์กฐ ์˜ˆ์ธก์˜ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์ด๋ก ์  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•˜์—ฌ, SARS-CoV-2์™€ ๊ฐ™์€ ๋ฐ”์ด๋Ÿฌ์Šค ๋‹จ๋ฐฑ์งˆ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์„ ๋” ํšจ์œจ์ ์œผ๋กœ ์ˆ˜ํ–‰ํ•˜๋Š” ๊ธฐ๋ฐ˜์„ ๋งˆ๋ จํ•˜๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold3์˜ ์ตœ์‹  ์ ‘๊ทผ์€ AlphaFold์˜ ํ˜์‹ ์  ๋‹จ๋ฐฑ์งˆ 3์ฐจ์› ๊ตฌ์กฐ ์˜ˆ์ธก ๊ธฐ๋ฒ•์„ ์ง์ ‘์ ์œผ๋กœ ๊ธฐ๋ฐ˜ ์‚ผ๊ณ  ์žˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold ๊ธฐ๋ฐ˜ ๊ตฌ์กฐ ์˜ˆ์ธก ๋ฐฉ๋ฒ•, ๊ตฌ์กฐ-๊ธฐ๊ณ„ ์„ค๊ณ„์— ์‘์šฉ๋  ์ˆ˜ ์žˆ๋Š” ๊ตฌ์กฐ์  ํ•™์Šต ์›๋ฆฌ๋ฅผ ์ฒด๊ณ„ํ™”ํ•จ.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
719๋Š” LLM ๊ธฐ๋ฐ˜ ๊ณผํ•™์  ๊ฐ€์„ค ํƒ์ƒ‰์˜ ์›๋ฆฌ๋ฅผ ์ œ์‹œํ•˜์—ฌ AlphaFold์˜ ํ˜์‹ ์  ๋ฐœ๊ฒฌ์—์„œ ๋‚˜ํƒ€๋‚œ AI-์ฃผ๋„ ๊ณผํ•™๋ฐœ๊ฒฌ ํ”„๋ ˆ์ž„์›Œํฌ์˜ ์ด๋ก ์  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold ๋…ผ๋ฌธ์€ AlphaFold3 ์œ ์‚ฌ ์•„ํ‚คํ…์ฒ˜ ๊ธฐ๋ฐ˜์˜ ๊ตฌ์กฐ ์ƒ์„ฑ๊ณผ ํ•™์Šต ์ด๋ก ์„ ์ œ๊ณตํ•˜์—ฌ, all-atom generative foundation model ์„ค๊ณ„์— ํ•ต์‹ฌ์ด ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold์˜ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก ์ •ํ™•๋„ ํ‰๊ฐ€ ๋…ผ๋ฌธ์œผ๋กœ, AFDB ํ”„๋กœํ…Œ์˜ด ํ™•์žฅ์˜ ์‹ ๋ขฐ์„ฑ ๊ฒ€ํ† ์— ์ด๋ก ์  ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
403 ๋…ผ๋ฌธ์€ AlphaFold์˜ ๊ตฌ์กฐ ์˜ˆ์ธก์ด ๊ณผํ•™ ์—ฐ๊ตฌ์— ๋ฏธ์นœ ์˜ํ–ฅ์„ ์ด๋ก ์ /๊ธฐ์ˆ ์ ์œผ๋กœ ๋’ท๋ฐ›์นจํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold ๋…ผ๋ฌธ์€ AlphaFold2์™€ ๊ฐ™์€ AI๊ฐ€ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก์— ๋ฏธ์นœ ์˜ํ–ฅ์˜ ์ด๋ก ์ /๊ธฐ์ˆ ์  ๊ธฐ๋ฐ˜์œผ๋กœ, 3015์˜ AlphaFold2 ๊ธฐ์—ฌ ๋ถ„์„์— ํ•„์ˆ˜์ ์ž…๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold ๋…ผ๋ฌธ์€ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก์˜ ๊ทผ๊ฐ„์ด ๋˜๋Š” ์•„ํ‚คํ…์ฒ˜์™€ ํ•™์Šต ๋ฐฉ๋ฒ•๋ก ์„ ์„ค๋ช…ํ•˜์—ฌ, AlphaFold 3์˜ ๋‚ด๋ถ€ ํ‘œํ˜„ ๋ฐ ํ•œ๊ณ„ ๋ถ„์„์˜ ์ด๋ก ์  ๋ฐฐ๊ฒฝ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold ๋…ผ๋ฌธ์€ ๋ฌด์ž‘์œ„ ๋‹จ๋ฐฑ์งˆ ์„œ์—ด์—์„œ์˜ ๊ตฌ์กฐ ๋‹ค์–‘์„ฑ ์˜ˆ์ธก์ด ๊ฐ€๋Šฅํ•จ์„ ๋ณด์—ฌ์ฃผ๋ฉฐ, ML์ด ์ž์—ฐ ๋‹จ๋ฐฑ์งˆ๊ณผ ํ•ฉ์„ฑ ๋‹จ๋ฐฑ์งˆ์˜ ๊ตฌ์กฐ์  ์ด์งˆ์„ฑ ํ•™์Šต์— ์–ด๋–ป๊ฒŒ ๊ธฐ์—ฌํ•  ์ˆ˜ ์žˆ๋Š”์ง€ ์„ค๋ช…ํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
AlphaFold ๋ฐ ESMFold์™€ ๊ฐ™์€ ์ฒจ๋‹จ ๋ชจ๋ธ์„ ๋น„๊ต ์ ์šฉํ•˜์—ฌ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ๋ฐ ์ƒํ˜ธ์ž‘์šฉ ์˜ˆ์ธก์˜ ์ƒ๋Œ€์  ๊ฐ•์ ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
403 ๋…ผ๋ฌธ์€ AlphaFold๋กœ ๋Œ€๋Ÿ‰์˜ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก์„ ์‹œ์—ฐํ•˜์—ฌ, 2196์˜ ESMFold ๊ธฐ๋ฐ˜ ์ง„ํ™”์  ์–ธ์–ด๋ชจ๋ธ๊ณผ ์ง์ ‘์  ๋น„๊ต ๋ถ„์„์ด ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์–‘์ž์—ญํ•™ ๋ฐ ๋ณต์žก๊ณ„ ๋ฌผ๋ฆฌ ์˜์—ญ์˜ ์ฒจ๋‹จ AI ํ™œ์šฉ ํ˜„ํ™ฉ์„ ๋‹ค๋ฃจ๋ฉฐ, ์นด์ด๋ž„ ์Šคํ•€ ๋Œ€์นญ์„ฑ์„ ํฌํ•จํ•œ ์—ด์—ญํ•™ ๋ถ„๊ณผ ์—ฐ๊ตฌ์˜ ํ˜„๋Œ€์  ์ ‘๊ทผ์„ ์†Œ๊ฐœํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Foundation models in bioinformatics๋Š” ์•ฝ๋ฌผ, ๋‹จ๋ฐฑ์งˆ, ์œ ์ „์ฒด ๋“ฑ ๋‹ค์–‘ํ•œ bio ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์„ ์ข…ํ•ฉ์ ์œผ๋กœ ๋น„๊ตํ•˜์—ฌ AlphaFold ๊ธฐ๋ฐ˜/๋น„๊ต ๋ชจ๋ธ์„ ํ•œ๋ˆˆ์— ์กฐ๋งํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
AlphaFold3๋Š” AlphaFold ๊ธฐ๋ฐ˜์„ ๋ฐ”ํƒ•์œผ๋กœ ์†Œ๋ถ„์ž, ํ•ต์‚ฐ ๋“ฑ ๋‹ค์–‘ํ•œ ์ƒ์ฒด๋ถ„์ž ๋ณตํ•ฉ์ฒด ์˜ˆ์ธก๊นŒ์ง€ ํ™•์žฅยทํ†ตํ•ฉํ•œ ์ตœ์‹  ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
AlphaFold์˜ ๊ธฐ๋ณธ ์›๋ฆฌ์™€ ์„ฑ๋Šฅ ์œ„์—, ๋‹จ๋ฐฑ์งˆ ๋ณตํ•ฉ์ฒด ์˜ˆ์ธก์œผ๋กœ ํ™•์žฅํ•œ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ๊ตฌ์ถ• ์‚ฌ๋ก€์ž…๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
403(Highly accurate protein structure prediction with AlphaFold)์€ ๋ถ„์ž๊ตฌ์กฐ ์˜ˆ์ธก์˜ ๊ธฐ์ค€์ด ๋˜๋ฉฐ, ChemFlow์˜ ๊ณ„์ธต์  ๋ถ„์žํ‘œํ˜„์ด ๋‹จ๋ฐฑ์งˆ ์˜ˆ์ธก ๋“ฑ ๋‹ค์–‘ํ•œ ํ™”ํ•™ ์‹œ์Šคํ…œ์—์˜ ์‘์šฉ ํ™•์žฅ ๊ฐ€๋Šฅ์„ฑ์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
AlphaFold ์‹œ๋ฆฌ์ฆˆ์˜ ๋‚ด๋ถ€ ํ‘œํ˜„ ๋ฐ ๊ตฌ์กฐ ์˜ˆ์ธก ๋ฉ”์ปค๋‹ˆ์ฆ˜์— ๋Œ€ํ•œ ๋ถ„์„ ์—ฐ๊ตฌ๋กœ, AlphaFold์˜ ๊ตฌ์กฐ์  ๊ฐ•์ ์„ ์‹ฌ์ธต์ ์œผ๋กœ ์ดํ•ดํ•˜๋Š” ๋ฐ ๋„์›€์ด ๋œ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
Highly accurate protein structure prediction with AlphaFold(403)๋Š” ์žฌ๋ฃŒ๊ณผํ•™์—์„œ LLM-๊ธฐ๋ฐ˜ ๊ตฌ์กฐ์˜ˆ์ธก ์‹ค์‚ฌ๋ก€๋กœ, 465์—์„œ ์ œ์•ˆํ•œ ์—ญํ• ๋ณ„ LLM ํŒจ๋Ÿฌ๋‹ค์ž„์˜ ์‹ค์ œ ์ ์šฉ ์˜ˆ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •