Accurate prediction of protein structures and interactions using a three-track neural network

์ €์ž: Minkyung Baek, Frank DiMaio, Ivan Anishchenko, Justas Dauparas, Sergey Ovchinnikov, Gyu Rie Lee, Jue Wang, Qian Cong, Lisa N. Kinch, R. Dustin Schaeffer, Claudia Millรกn, Hahnbeom Park, Carson Adams, Caleb R. Glassman, Andy DeGiovanni, Jose H. Pereira, Andria V. Rodrigues, Alberdina A. Van Dijk, Ana C. Ebrecht, Diederik J. Opperman, Theo Sagmeister, Christoph Buhlheller, Tea Pavkov-Keller, Manoj K. Rathinaswamy, Udit Dalwadi, Calvin K. Yip, John E. Burke, K. Christopher Garcia, Nick V. Grishin, Paul D. Adams, Randy J. Read, David Baker | ๋‚ ์งœ: 2021-08-20 | DOI: 10.1126/science.abj8754 📄 PDF


Essence

Figure 1

Fig. 1. Network architecture and performance.

3-ํŠธ๋ž™ ์‹ ๊ฒฝ๋ง ์•„ํ‚คํ…์ฒ˜๋ฅผ ์ด์šฉํ•˜์—ฌ 1D ์„œ์—ด, 2D ๊ฑฐ๋ฆฌ ์ง€๋„, 3D ์ขŒํ‘œ ์ •๋ณด๋ฅผ ๋™์‹œ์— ์ฒ˜๋ฆฌํ•จ์œผ๋กœ์จ AlphaFold2์— ๊ทผ์ ‘ํ•œ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก ์ •ํ™•๋„๋ฅผ ๋‹ฌ์„ฑํ•˜๊ณ  ๋‹จ๋ฐฑ์งˆ-๋‹จ๋ฐฑ์งˆ ๋ณตํ•ฉ์ฒด ๋ชจ๋ธ๋ง์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ–ˆ๋‹ค.

Motivation

Achievement

Figure 2

Fig. 2. Enabling experimental structure determination with RoseTTAFold.

How

Figure 1

Fig. 1. Network architecture and performance.

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: RoseTTAFold๋Š” AlphaFold2์˜ ํ•ต์‹ฌ ๊ฐœ๋…์„ 3-ํŠธ๋ž™ ์•„ํ‚คํ…์ฒ˜๋กœ ์ฐฝ์˜์ ์œผ๋กœ ์žฌ๊ตฌ์„ฑํ•˜์—ฌ ๊ฒฝ์Ÿ๋ ฅ ์žˆ๋Š” ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ–ˆ์œผ๋ฉฐ, ํŠนํžˆ ๊ณต๊ฐœ ๋ฐฉ์‹์œผ๋กœ ์ œ๊ณต๋จ์œผ๋กœ์จ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก์˜ ๋ฏผ์ฃผํ™”์™€ ๊ตฌ์กฐ์ƒ๋ฌผํ•™ ์—ฐ๊ตฌ ๊ฐ€์†ํ™”์— ํฌ๊ฒŒ ๊ธฐ์—ฌํ•˜๋Š” ํš๊ธฐ์ ์ธ ์—ฐ๊ตฌ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก์˜ ์ด๋ก ์ ยท๋ฐฉ๋ฒ•๋ก ์  ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•˜๋Š” ์„ ํ–‰ ์—ฐ๊ตฌ์ด๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
383์˜ geometry-informed tokenization ํ”„๋ ˆ์ž„์›Œํฌ๋Š” ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ๋ชจ๋ธ ํ•™์Šต์˜ ์ „์ฒ˜๋ฆฌ ๋ฐ ํ”ผ์ฒ˜ ์„ค๊ณ„์— ์ด๋ก ์  ๊ทผ๊ฐ„์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AlphaFold2 ๊ธฐ๋ฐ˜ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ๋ณตํ•ฉ์ฒด ์˜ˆ์ธก ์•„ํ‚คํ…์ฒ˜ ๋ฐ ์ •ํ™•๋„ ํ–ฅ์ƒ ์—ฐ๊ตฌ๊ฐ€ BOS-Lig dataset ๊ตฌ์ถ• ๋ฐ ์ ์šฉ์˜ ๊ธฐ์ˆ ์  ๊ธฐ๋ฐ˜์ด ๋œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ๋ถ„์„์˜ ๋ฐฉ๋ฒ•๋ก ์  ๊ธฐ๋ฐ˜์ด ๋˜๋Š” ๊ด€๋ จ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‘ ๋…ผ๋ฌธ ๋ชจ๋‘ AlphaFold2(403) ๋ฐ ๊ทธ ๋ณ€ํ˜•(1060)์œผ๋กœ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก์˜ ์ดˆ๊ณ ์ •ํ™•์„ฑ ๋‹ฌ์„ฑ์„ ๋‹ค๋ฃจ์ง€๋งŒ, ๋„คํŠธ์›Œํฌ ๊ตฌ์กฐ ๋ฐ multi-track ์‚ฌ์šฉ๋ฒ•์˜ ์ฐจ์ด๋ฅผ ๋น„๊ตํ•  ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก์„ ์œ„ํ•œ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์ ‘๊ทผ๋ฒ•์œผ๋กœ ๋™์ผํ•œ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฅธ ๋ฐฉ์‹์œผ๋กœ ํ•ด๊ฒฐํ•œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ์ƒํ˜ธ์ž‘์šฉ ๋ชจ๋ธ๋ง์„ ์œ„ํ•œ ๋‹ค๋ฅธ ๋„ํ‚น ์ ‘๊ทผ๋ฒ•์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก ๋ฐ ๋ณตํ•ฉ์ฒด ๋ชจ๋ธ๋ง ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฅธ ์ ‘๊ทผ๋ฒ•์œผ๋กœ ํ•ด๊ฒฐํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ๊ธฐ๋Šฅ ์˜ˆ์ธก ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ๋‹ค๋ฅธ ๋ฐฉ์‹์œผ๋กœ ํ‰๊ฐ€ํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ์˜ ๊ธฐ๋Šฅ์  ๋‹จ์œ„ ๋ถ„ํ•  ๋ฐ ํ‘œํ˜„ ํ•™์Šต์— ๋Œ€ํ•œ ๋Œ€์•ˆ์  ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ •ํ™•ํ•œ ๋‹จ๋ฐฑ์งˆ-๋ฆฌ๊ฐ„๋“œ ๋ณตํ•ฉ์ฒด ์˜ˆ์ธก ํ›„ ํ•ด๋‹น ๋ฆฌ๊ฐ„๋“œ์˜ ์ „ํ•˜ ์˜ˆ์ธก ๋ฐ ๊ธฐ๋Šฅ์  ๋ถ„๋ฅ˜(BOS-Lig Dataset)๋กœ ์—ฐ๊ณ„๋˜๋Š” ๋ถ„์„ ํŒŒ์ดํ”„๋ผ์ธ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
๋ถ„์ž๊ฒฐ์ •๊ตฌ์กฐ ์˜ˆ์ธก์„ ๋ชฉํ‘œ๋กœ ํ•˜๋Š” Flow ๊ธฐ๋ฐ˜ ์ƒ์„ฑ ๋ชจ๋ธ(3173)์€ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก์˜ ์ตœ์‹  ์ƒ์„ฑ AI ํ๋ฆ„๊ณผ ๋งž๋‹ฟ์•„ ์žˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
3019๋Š” ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ ์˜ˆ์ธก์„ ํ”„๋กœํ…Œ์˜ด ์Šค์ผ€์ผ๋กœ ํ™•์žฅํ•˜์—ฌ, 1060์˜ 3-track ์‹ ๊ฒฝ๋ง ๊ธฐ๋ฐ˜ ๊ตฌ์กฐ ์˜ˆ์ธก ๋ฐฉ๋ฒ•์„ ๋Œ€ํ˜• ๋ฐ์ดํ„ฐ์— ์ ์šฉ์‹œํ‚จ ์˜ˆ์ž…๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ƒ์ฒด๋ถ„์ž์˜ ์ „์›์ž ๋ถ„ํฌ ํ•™์Šต๊ณผ ๋‹จ๋ฐฑ์งˆ ๊ตฌ์กฐ์˜ˆ์ธก์˜ ์ตœ์ฒจ๋‹จ ๋ชจ๋ธ์ด ์‹ค์ œ ์ƒํ˜ธ๋ณด์™„์  ๋ฐฉ๋ฒ•์ž„์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
RoseTTAFold์˜ ๋ฐฉ๋ฒ•๋ก ์„ ํ™•์žฅํ•˜๊ฑฐ๋‚˜ ์‘์šฉํ•œ ํ›„์† ์—ฐ๊ตฌ์ด๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •