Artificial intelligence advances for de novo molecular structure modeling in cryo-electron microscopy
Corresponding Author
Dong Si
Division of Computing and Software Systems, University of Washington Bothell, Bothell, Washington, USA
Correspondence
Dong Si, Division of Computing and Software Systems, University of Washington Bothell, Bothell, WA 98011, USA.
Email: [email protected]
Contribution: Conceptualization, Data curation, Funding acquisition, Investigation, Resources, Supervision, Visualization, Writing - original draft, Writing - review & editing
Search for more papers by this authorAndrew Nakamura
Division of Computing and Software Systems, University of Washington Bothell, Bothell, Washington, USA
Contribution: Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing - review & editing
Search for more papers by this authorRunbang Tang
Molecular Engineering and Sciences Institute, University of Washington Seattle, Seattle, Washington, USA
Contribution: Data curation, Formal analysis, Investigation, Validation, Writing - review & editing
Search for more papers by this authorHaowen Guan
Applied and Computational Math Sciences, University of Washington Seattle, Seattle, Washington, USA
Contribution: Data curation, Investigation, Methodology, Validation, Writing - review & editing
Search for more papers by this authorJie Hou
Department of Computer Science, Saint Louis University, Saint Louis, Missouri, USA
Contribution: Investigation, Methodology, Visualization, Writing - review & editing
Search for more papers by this authorAmmaar Firozi
Department of Computer Science, Saint Louis University, Saint Louis, Missouri, USA
Contribution: Investigation, Methodology, Writing - review & editing
Search for more papers by this authorRenzhi Cao
Department of Computer Science, Pacific Lutheran University, Tacoma, Washington, USA
Contribution: Investigation, Methodology, Writing - review & editing
Search for more papers by this authorKyle Hippe
Department of Computer Science, Pacific Lutheran University, Tacoma, Washington, USA
Contribution: Investigation, Writing - review & editing
Search for more papers by this authorMinglei Zhao
Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois, USA
Contribution: Investigation, Writing - review & editing
Search for more papers by this authorCorresponding Author
Dong Si
Division of Computing and Software Systems, University of Washington Bothell, Bothell, Washington, USA
Correspondence
Dong Si, Division of Computing and Software Systems, University of Washington Bothell, Bothell, WA 98011, USA.
Email: [email protected]
Contribution: Conceptualization, Data curation, Funding acquisition, Investigation, Resources, Supervision, Visualization, Writing - original draft, Writing - review & editing
Search for more papers by this authorAndrew Nakamura
Division of Computing and Software Systems, University of Washington Bothell, Bothell, Washington, USA
Contribution: Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing - review & editing
Search for more papers by this authorRunbang Tang
Molecular Engineering and Sciences Institute, University of Washington Seattle, Seattle, Washington, USA
Contribution: Data curation, Formal analysis, Investigation, Validation, Writing - review & editing
Search for more papers by this authorHaowen Guan
Applied and Computational Math Sciences, University of Washington Seattle, Seattle, Washington, USA
Contribution: Data curation, Investigation, Methodology, Validation, Writing - review & editing
Search for more papers by this authorJie Hou
Department of Computer Science, Saint Louis University, Saint Louis, Missouri, USA
Contribution: Investigation, Methodology, Visualization, Writing - review & editing
Search for more papers by this authorAmmaar Firozi
Department of Computer Science, Saint Louis University, Saint Louis, Missouri, USA
Contribution: Investigation, Methodology, Writing - review & editing
Search for more papers by this authorRenzhi Cao
Department of Computer Science, Pacific Lutheran University, Tacoma, Washington, USA
Contribution: Investigation, Methodology, Writing - review & editing
Search for more papers by this authorKyle Hippe
Department of Computer Science, Pacific Lutheran University, Tacoma, Washington, USA
Contribution: Investigation, Writing - review & editing
Search for more papers by this authorMinglei Zhao
Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois, USA
Contribution: Investigation, Writing - review & editing
Search for more papers by this authorEdited by: Peter R. Schreiner, Editor-in-Chief
Funding information: National Science Foundation, Grant/Award Number: 2030381
Abstract
Cryo-electron microscopy (cryo-EM) has become a major experimental technique to determine the structures of large protein complexes and molecular assemblies, as evidenced by the 2017 Nobel Prize. Although cryo-EM has been drastically improved to generate high-resolution three-dimensional maps that contain detailed structural information about macromolecules, the computational methods for using the data to automatically build structure models are lagging far behind. The traditional cryo-EM model building approach is template-based homology modeling. Manual de novo modeling is very time-consuming when no template model is found in the database. In recent years, de novo cryo-EM modeling using machine learning (ML) and deep learning (DL) has ranked among the top-performing methods in macromolecular structure modeling. DL-based de novo cryo-EM modeling is an important application of artificial intelligence, with impressive results and great potential for the next generation of molecular biomedicine. Accordingly, we systematically review the representative ML/DL-based de novo cryo-EM modeling methods. Their significances are discussed from both practical and methodological viewpoints. We also briefly describe the background of cryo-EM data processing workflow. Overall, this review provides an introductory guide to modern research on artificial intelligence for de novo molecular structure modeling and future directions in this emerging field.
This article is categorized under:
- Structure and Mechanism > Molecular Structures
- Structure and Mechanism > Computational Biochemistry and Biophysics
- Data Science > Artificial Intelligence/Machine Learning
Graphical Abstract
CONFLICT OF INTEREST
The authors have declared no conflicts of interest for this article.
Open Research
DATA AVAILABILITY STATEMENT
Data sharing is not applicable to this article as no new data were created or analyzed in this study.
REFERENCES
- 1Murata K, Wolf M. Cryo-electron microscopy for structural analysis of dynamic biological macromolecules. Biochim Biophys Acta Gen Subj. 2018; 1862: 324–34.
- 2Nakane T, Kotecha A, Sente A, McMullan G, Masiulis S, Brown PMGE, et al. Single-particle cryo-EM at atomic resolution. Nature. 2020; 587: 152–6.
- 3Yip KM, Fischer N, Paknia E, Chari A, Stark H. Atomic-resolution protein structure determination by cryo-EM. Nature. 2020; 587: 157–61.
- 4Henderson R. Avoiding the pitfalls of single particle cryo-electron microscopy: Einstein from noise. Proc Natl Acad Sci U S A. 2013; 110: 18037–41.
- 5Berman H, Henrick K, Nakamura H, Markley JL. The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res. 2007; 35(Database issue): D301–3.
- 6Callaway E. Revolutionary cryo-EM is taking over structural biology. Nature. 2020; 578: 201–1.
- 7Patwardhan A. Trends in the Electron Microscopy Data Bank (EMDB). Acta Crystallogr D Struct Biol. 2017; 73(Pt 6): 503–8.
- 8Rappazzo CG, Tse LV, Kaku CI, Wrapp D, Sakharkar M, Huang D, et al. Broad and potent activity against SARS-like viruses by an engineered human monoclonal antibody. Science. 2021; 371: 823–9.
- 9Kokic G, Hillen HS, Tegunov D, Dienemann C, Seitz F, Schmitzova J, et al. Mechanism of SARS-CoV-2 polymerase stalling by remdesivir. Nat Commun. 2021; 12: 279.
- 10Zhou T, Tsybovsky Y, Gorman J, Rapp M, Cerutti G, Chuang G-Y, et al. Cryo-EM structures of SARS-CoV-2 spike without and with ACE2 reveal a pH-dependent switch to mediate endosomal positioning of receptor-binding domains. Cell Host Microbe. 2020; 28: 867–879.e5.
- 11Lv Z, Deng Y-Q, Ye Q, Cao L, Sun C-Y, Fan C, et al. Structural basis for neutralization of SARS-CoV-2 and SARS-CoV by a potent therapeutic antibody. Science. 2020; 369: 1505–9.
- 12Yao H, Sun Y, Deng Y-Q, Wang N, Tan Y, Zhang N-N, et al. Rational development of a human antibody cocktail that deploys multiple functions to confer Pan-SARS-CoVs protection. Cell Res. 2021; 31: 25–36.
- 13Wang N, Sun Y, Feng R, Wang Y, Guo Y, Zhang L, et al. Structure-based development of human antibody cocktails against SARS-CoV-2. Cell Res. 2021; 31: 101–3.
- 14Xu C, Wang Y, Liu C, Zhang C, Han W, Hong X, et al. Conformational dynamics of SARS-CoV-2 trimeric spike glycoprotein in complex with receptor ACE2 revealed by cryo-EM. Sci Adv. 2021; 7:eabe5575.
- 15Turoňová B, Sikora M, Schürmann C, Hagen WJH, Welsch S, Blanc FEC, et al. In situ structural analysis of SARS-CoV-2 spike reveals flexibility mediated by three hinges. Science. 2020; 370: 203–8.
- 16Liu H, Wu NC, Yuan M, Bangaru S, Torres JL, Caniels TG, et al. Cross-neutralization of a SARS-CoV-2 antibody to a functionally conserved site is mediated by avidity. Immunity. 2020; 53: 1272–1280.e5.
- 17Zhang C, Wang Y, Zhu Y, Liu C, Gu C, Xu S, et al. Development and structural basis of a two-MAb cocktail for treating SARS-CoV-2 infections. Nat Commun. 2021; 12: 264.
- 18Guo L, Bi W, Wang X, Xu W, Yan R, Zhang Y, et al. Engineered trimeric ACE2 binds viral spike protein and locks it in “Three-up” conformation to potently inhibit SARS-CoV-2 infection. Cell Res. 2021; 31: 98–100.
- 19Pfab J, Phan NM, Si D. DeepTracer for fast de novo cryo-EM protein structure modeling and special studies on CoV-related complexes. Proc Natl Acad Sci U S A. 2021; 118:e2017525118.
- 20 Press Release: The Nobel Prize in Chemistry 2017. NobelPrize.org.Nobel Media AB. 2021.
- 21Merk A, Bartesaghi A, Banerjee S, Falconieri V, Rao P, Davis MI, et al. Breaking cryo-EM resolution barriers to facilitate drug discovery. Cell. 2016; 165: 1698–707.
- 22Lawson CL, Patwardhan A, Baker ML, Hryc C, Garcia ES, Hudson BP, et al. EMDataBank unified data resource for 3DEM. Nucleic Acids Res. 2016; 44(D1): D396–403.
- 23Si D, Ji S, Nasr KA, He J. A machine learning approach for the identification of protein secondary structure elements from electron cryo-microscopy density maps. Biopolymers. 2012; 97: 698–708.
- 24Maddhuri Venkata Subramaniya SR, Terashi G, Kihara D. Protein secondary structure detection in intermediate-resolution cryo-EM maps using deep learning. Nat Methods. 2019; 16: 911–7.
- 25Ma L, Reisert M, Burkhardt H. RENNSH: a novel α-helix identification approach for intermediate resolution electron density maps. IEEE/ACM Trans Comput Biol Bioinform. 2012; 9: 228–39.
- 26Jiang W, Baker ML, Ludtke SJ, Chiu W. Bridging the information gap: computational tools for intermediate resolution structure interpretation. J Mol Biol. 2001; 308: 1033–44.
- 27Li R, Si D, Zeng T, Ji S, He J. Deep convolutional neural networks for detecting secondary structures in protein density maps from cryo-electron microscopy. In: 2016 IEEE international conference on bioinformatics and biomedicine (BIBM); 2016. p. 41–46.
- 28Li P-N, de Oliveira SHP, Wakatsuki S, van den Bedem H. Sequence-guided protein structure determination using graph convolutional and recurrent networks. In: 2020 IEEE 20th international conference on bioinformatics and bioengineering (BIBE); 2020. p. 122–127.
- 29Xu K, Wang Z, Shi J, Li H, Zhang QC. A2-Net: molecular structure estimation from cryo-EM density volumes. AAAI. 2019; 33: 1230–7.
- 30Chen M, Baldwin PR, Ludtke SJ, Baker ML. De novo modeling in cryo-EM density maps with Pathwalking. J Struct Biol. 2016; 196: 289–98.
- 31Volkmann N, Hanein D. Quantitative fitting of atomic models into observed densities derived by electron microscopy. J Struct Biol. 1999; 125: 176–84.
- 32Wriggers W, Milligan RA, McCammon JA. Situs: a package for docking crystal structures into low-resolution maps from electron microscopy. J Struct Biol. 1999; 125: 185–95.
- 33Roseman AM. Docking structures of domains into maps from cryo-electron microscopy using local correlation. Acta Crystallogr D Biol Crystallogr. 2000; 56: 1332–40.
- 34Rossmann MG, Bernal R, Pletnev SV. Combining electron microscopic with X-ray crystallographic structures. J Struct Biol. 2001; 136: 190–200.
- 35Chacón P, Wriggers W. Multi-resolution contour-based fitting of macromolecular structures. J Mol Biol. 2002; 317: 375–84.
- 36Navaza J, Lepault J, Rey FA, Álvarez-Rúa C, Borge J. On the fitting of model electron densities into EM reconstructions: a reciprocal-space formulation. Acta Crystallogr D Biol Crystallogr. 2002; 58: 1820–5.
- 37Wu X, Milne JLS, Borgnia MJ, Rostapshov AV, Subramaniam S, Brooks BR. A core-weighted fitting method for docking atomic structures into low-resolution maps: application to cryo-electron microscopy. J Struct Biol. 2003; 141: 63–76.
- 38Kovacs JA, Chacón P, Cong Y, Metwally E, Wriggers W. Fast rotational matching of rigid bodies by fast Fourier transform acceleration of five degrees of freedom. Acta Crystallogr D Biol Crystallogr. 2003; 59: 1371–6.
- 39Ceulemans H, Russell RB. Fast fitting of atomic structures to low-resolution electron density maps by surface overlap maximization. J Mol Biol. 2004; 338: 783–93.
- 40Topf M, Baker ML, John B, Chiu W, Sali A. Structural characterization of components of protein assemblies by comparative modeling and electron cryo-microscopy. J Struct Biol. 2005; 149: 191–203.
- 41Garzón JI, Kovacs J, Abagyan R, Chacón P. ADP_EM: fast exhaustive multi-resolution docking for high-throughput coverage. Bioinformatics. 2007; 23: 427–33.
- 42Birmanns S, Wriggers W. Multi-resolution anchor-point registration of biomolecular assemblies and their components. J Struct Biol. 2007; 157: 271–80.
- 43Lasker K, Dror O, Shatsky M, Nussinov R, Wolfson HJ. EMatch: discovery of high resolution structural homologues of protein domains in intermediate resolution cryo-EM maps. IEEE/ACM Trans Comput Biol Bioinform. 2007; 4: 28–39.
- 44Kawabata T. Multiple subunit fitting into a low-resolution density map of a macromolecular complex using a Gaussian mixture model. Biophys J. 2008; 95: 4643–58.
- 45Lasker K, Topf M, Sali A, Wolfson HJ. Inferential optimization for simultaneous fitting of multiple components into a cryoEM map of their assembly. J Mol Biol. 2009; 388: 180–94.
- 46Pintilie GD, Zhang J, Goddard TD, Chiu W, Gossard DC. Quantitative analysis of cryo-EM density map segmentation by watershed and scale-space filtering, and fitting of structures by alignment to regions. J Struct Biol. 2010; 170: 427–38.
- 47Zhang S, Vasishtan D, Xu M, Topf M, Alber F. A fast mathematical programming procedure for simultaneous fitting of assembly components into cryoEM density maps. Bioinformatics. 2010; 26: i261–8.
- 48Woetzel N, Lindert S, Stewart PL, Meiler J. BCL::EM-Fit: rigid body fitting of atomic structures into density maps using geometric hashing and real space refinement. J Struct Biol. 2011; 175: 264–76.
- 49Birmanns S, Rusu M, Wriggers W. Using Sculptor and Situs for simultaneous assembly of atomic components into low-resolution shapes. J Struct Biol. 2011; 173: 428–35.
- 50de Vries SJ, Zacharias M. ATTRACT-EM: a new method for the computational assembly of large molecular machines using cryo-EM maps. PLoS ONE. 2012; 7:e49733.
- 51Esquivel-Rodríguez J, Kihara D. Fitting multimeric protein complexes into electron microscopy maps using 3D Zernike descriptors. J Phys Chem B. 2012; 116: 6854–61.
- 52Hoang TV, Cavin X, Ritchie DW. gEMfitter: a highly parallel FFT-based 3D density fitting tool with GPU texture memory acceleration. J Struct Biol. 2013; 184: 348–54.
- 53Derevyanko G, Grudinin S. HermiteFit: fast-fitting atomic structures into a low-resolution density map using three-dimensional orthogonal Hermite functions. Acta Crystallogr D Biol Crystallogr. 2014; 70: 2069–84.
- 54Pandurangan AP, Vasishtan D, Alber F, Topf M. γ-TEMPy: simultaneous fitting of components in 3D-EM maps of their assembly using a genetic algorithm. Structure. 2015; 23: 2365–76.
- 55van Zundert GCP, Melquiond ASJ, Bonvin AMJJ. Integrative modeling of biomolecular complexes: HADDOCKing with cryo-electron microscopy data. Structure. 2015; 23: 949–60.
- 56Bettadapura R, Rasheed M, Vollrath A, Bajaj C. PF2 fit: polar fast Fourier matched alignment of atomistic structures with 3D electron microscopy maps. PLoS Comput Biol. 2015; 11:e1004289.
- 57Kuzu G, Keskin O, Nussinov R, Gursoy A. PRISM-EM: template interface-based modelling of multi-protein complexes guided by cryo-electron microscopy density maps. Acta Crystallogr D Struct Biol. 2016; 72: 1137–48.
- 58Wriggers W, Birmanns S. Using Situs for flexible and rigid-body fitting of multiresolution single-molecule data. J Struct Biol. 2001; 133: 193–202.
- 59Chen JZ, Fürst J, Chapman MS, Grigorieff N. Low-resolution structure refinement in electron microscopy. J Struct Biol. 2003; 144: 144–51.
- 60Tama F, Miyashita O, Brooks CL III. Normal mode based flexible fitting of high-resolution structure into low-resolution experimental data from cryo-EM. J Struct Biol. 2004; 147: 315–26.
- 61Delarue M, Dumas P. On the use of low-frequency normal modes to enforce collective movements in refining macromolecular structural models. Proc Natl Acad Sci U S A. 2004; 101: 6957–62.
- 62Hinsen K, Reuter N, Navaza J, Stokes DL, Lacapère J-J. Normal mode-based fitting of atomic structure into electron density maps: application to sarcoplasmic reticulum Ca-ATPase. Biophys J. 2005; 88: 818–27.
- 63Velazquez-Muriel J-Á, Valle M, Santamaría-Pang A, Kakadiaris IA, Carazo J-M. Flexible fitting in 3D-EM guided by the structural variability of protein superfamilies. Structure. 2006; 14: 1115–26.
- 64Suhre K, Navaza J, Sanejouand Y-H. NORMA: a tool for flexible fitting of high-resolution protein structures into low-resolution electron-microscopy-derived density maps. Acta Crystallogr D Biol Crystallogr. 2006; 62: 1098–100.
- 65Topf M, Baker ML, Marti-Renom MA, Chiu W, Sali A. Refinement of protein structures by iterative comparative modeling and cryoEM density fitting. J Mol Biol. 2006; 357: 1655–68.
- 66Velazquez-Muriel JA, Carazo J-M. Flexible fitting in 3D-EM with incomplete data on superfamily variability. J Struct Biol. 2007; 158: 165–81.
- 67Topf M, Lasker K, Webb B, Wolfson H, Chiu W, Sali A. Protein structure fitting and refinement guided by cryo-EM density. Structure. 2008; 16: 295–307.
- 68Jolley CC, Wells SA, Fromme P, Thorpe MF. Fitting low-resolution cryo-EM maps of proteins using constrained geometric simulations. Biophys J. 2008; 94: 1613–21.
- 69Tan RK-Z, Devkota B, Harvey SC. YUP.SCX: coaxing atomic models into medium resolution electron density maps. J Struct Biol. 2008; 163: 163–74.
- 70Trabuco LG, Villa E, Mitra K, Frank J, Schulten K. Flexible fitting of atomic structures into electron microscopy maps using molecular dynamics. Structure. 2008; 16: 673–83.
- 71Orzechowski M, Tama F. Flexible fitting of high-resolution X-ray structures into cryoelectron microscopy maps using biased molecular dynamics simulations. Biophys J. 2008; 95: 5692–705.
- 72Siebert X, Navaza J. UROX 2.0: an interactive tool for fitting atomic models into electron-microscopy reconstructions. Acta Crystallogr D Biol Crystallogr. 2009; 65: 651–8.
- 73Zhu J, Cheng L, Fang Q, Zhou ZH, Honig B. Building and refining protein models within cryo-electron microscopy density maps based on homology modeling and multiscale structure refinement. J Mol Biol. 2010; 397: 835–51.
- 74Grubisic I, Shokhirev MN, Orzechowski M, Miyashita O, Tama F. Biased coarse-grained molecular dynamics simulation approach for flexible fitting of X-ray structure into cryo electron microscopy maps. J Struct Biol. 2010; 169: 95–105.
- 75Zheng W. Accurate flexible fitting of high-resolution protein structures into cryo-electron microscopy maps using coarse-grained pseudo-energy minimization. Biophys J. 2011; 100: 478–88.
- 76Chan K-Y, Gumbart J, McGreevy R, Watermeyer JM, Sewell BT, Schulten K. Symmetry-restrained flexible fitting for symmetric EM maps. Structure. 2011; 19: 1211–8.
- 77Whitford PC, Ahmed A, Yu Y, Hennelly SP, Tama F, Spahn CMT, et al. Excited states of ribosome translocation revealed through integrative molecular modeling. Proc Natl Acad Sci U S A. 2011; 108: 18943–8.
- 78Wang Z, Schröder GF. Real-space refinement with DireX: from global fitting to side-chain improvements. Biopolymers. 2012; 97: 687–97.
- 79Vashisth H, Skiniotis G, Brooks CL. Using enhanced sampling and structural restraints to refine atomic structures into low-resolution electron microscopy maps. Structure. 2012; 20: 1453–62.
- 80Ahmed A, Whitford PC, Sanbonmatsu KY, Tama F. Consensus among flexible fitting approaches improves the interpretation of cryo-EM data. J Struct Biol. 2012; 177: 561–70.
- 81Lopéz-Blanco JR, Chacón P. iMODFIT: efficient and robust flexible fitting based on vibrational analysis in internal coordinates. J Struct Biol. 2013; 184: 261–70.
- 82Singharoy A, Teo I, McGreevy R, Stone JE, Zhao J, Schulten K. Molecular dynamics-based refinement and validation for sub-5 Å cryo-electron microscopy maps. eLife. 2016; 5:e16105.
- 83Miyashita O, Kobayashi C, Mori T, Sugita Y, Tama F. Flexible fitting to cryo-EM density map using ensemble molecular dynamics simulations. J Comput Chem. 2017; 38: 1447–61.
- 84Dou H, Burrows DW, Baker ML, Ju T. Flexible fitting of atomic models into cryo-EM density maps guided by helix correspondences. Biophys J. 2017; 112: 2479–93.
- 85Croll TI. ISOLDE: a physically realistic environment for model building into low-resolution electron-density maps. Acta Crystallogr D Struct Biol. 2018; 74: 519–30.
- 86Kovacs JA, Galkin VE, Wriggers W. Accurate flexible refinement of atomic models against medium-resolution cryo-EM maps using damped dynamics. BMC Struct Biol. 2018; 18: 12.
- 87Kim DN, Moriarty NW, Kirmizialtin S, Afonine PV, Poon B, Sobolev OV, et al. Cryo_fit: democratization of flexible fitting for cryo-EM. J Struct Biol. 2019; 208: 1–6.
- 88Kidmose RT, Juhl J, Nissen P, Boesen T, Karlsen JL, Pedersen BP. Namdinator – automatic molecular dynamics flexible fitting of structural models into cryo-EM and crystallography experimental maps. IUCrJ. 2019; 6: 526–31.
- 89Mori T, Kulik M, Miyashita O, Jung J, Tama F, Sugita Y. Acceleration of cryo-EM flexible fitting for large biomolecular systems by efficient space partitioning. Structure. 2019; 27: 161–174.e3.
- 90Igaev M, Kutzner C, Bock LV, Vaiana AC, Grubmüller H. Automated cryo-EM structure refinement using correlation-driven molecular dynamics. eLife. 2019; 8:e43542.
- 91Zhang B, Zhang W, Pearce R, Zhang Y, Shen H-B. Fitting low-resolution protein structures into cryo-EM density maps by multiobjective optimization of global and local correlations. J Phys Chem B. 2021; 235: 528–38.
- 92van Zundert GCP, Bonvin AM. Fast and sensitive rigid-body fitting into cryo-EM density maps with PowerFit. AIMS Biophys. 2015; 2: 73–87.
- 93Kim DN, Gront D, Sanbonmatsu KY. Practical considerations for atomistic structure modeling with cryo-EM maps. J Chem Inf Model. 2020; 60: 2436–42.
- 94Wang RY-R, Kudryashev M, Li X, Egelman EH, Basler M, Cheng Y, et al. De novo protein structure determination from near-atomic-resolution cryo-EM maps. Nat Methods. 2015; 12: 335–8.
- 95Senior AW, Evans R, Jumper J, Kirkpatrick J, Sifre L, Green T, et al. Improved protein structure prediction using potentials from deep learning. Nature. 2020; 577: 706–10.
- 96Yang J, Anishchenko I, Park H, Peng Z, Ovchinnikov S, Baker D. Improved protein structure prediction using predicted interresidue orientations. Proc Natl Acad Sci U S A. 2020; 117: 1496–503.
- 97Xu J. Distance-based protein folding powered by deep learning. Proc Natl Acad Sci U S A. 2019; 116: 16856–65.
- 98Abriata LA, Tamò GE, Dal PM. A further leap of improvement in tertiary structure prediction in CASP13 prompts new routes for future assessments. Proteins. 2019; 87: 1100–12.
- 99Chen H, Skolnick J. M-TASSER: an algorithm for protein quaternary structure prediction. Biophys J. 2008; 94: 918–28.
- 100Mukherjee S, Zhang Y. Protein–protein complex structure predictions by multimeric threading and template recombination. Structure. 2011; 19: 955–66.
- 101Guerler A, Govindarajoo B, Zhang Y. Mapping monomeric threading to protein–protein structure prediction. J Chem Inf Model. 2013; 53: 717–25.
- 102Kundrotas PJ, Zhu Z, Janin J, Vakser IA. Templates are available to model nearly all complexes of structurally characterized proteins. Proc Natl Acad Sci U S A. 2012; 109: 9438–41.
- 103Rozanov M, Wolfson HJ. AAnchor: CNN guided detection of anchor amino acids in high resolution cryo-EM density maps. In: 2018 IEEE international conference on bioinformatics and biomedicine (BIBM). 2018. p. 88–91.
- 104Sigworth FJ. Principles of cryo-EM single-particle image processing. Microscopy (Oxf). 2016; 65: 57–67.
- 105Bhamre T, Zhang T, Singer A. Denoising and covariance estimation of single particle cryo-EM images. J Struct Biol. 2016; 195: 72–81.
- 106Cong Y, Ludtke SJ. Chapter eight – Single Particle Analysis at High Resolution. In: GJ Jensen, editor. Methods in enzymology. Academic Press; 2010; 482: p. 211–35.
- 107Penczek PA, Fang J, Li X, Cheng Y, Loerke J, Spahn CMT. CTER—rapid estimation of CTF parameters with error assessment. Ultramicroscopy. 2014; 140: 9–19.
- 108Zivanov J, Nakane T, Forsberg BO, Kimanius D, Hagen WJ, Lindahl E, et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. eLife. 2018; 7:e42166.
- 109Heimowitz A, Andén J, Singer A. Reducing bias and variance for CTF estimation in single particle cryo-EM. Ultramicroscopy. 2020; 212: 112950.
- 110Arbeláez P, Han B-G, Typke D, Lim J, Glaeser RM, Malik J. Experimental evaluation of support vector machine-based and correlation-based approaches to automatic particle selection. J Struct Biol. 2011; 175: 319–28.
- 111Heimowitz A, Andén J, Singer A. APPLE picker: automatic particle picking, a low-effort cryo-EM framework. J Struct Biol. 2018; 204: 215–27.
- 112Wang F, Gong H, Liu G, Li M, Yan C, Xia T, et al. DeepPicker: a deep learning approach for fully automated particle picking in cryo-EM. J Struct Biol. 2016; 195: 325–36.
- 113Xiao Y, Yang G. A fast method for particle picking in cryo-electron micrographs based on fast R-CNN. AIP Conf Proc. 2017; 1836:020080.
- 114Zhu Y, Ouyang Q, Mao Y. A deep convolutional neural network approach to single-particle recognition in cryo-electron microscopy. BMC Bioinform. 2017; 18: 348.
- 115Zhang J, Wang Z, Chen Y, Han R, Liu Z, Sun F, et al. PIXER: an automated particle-selection method based on segmentation using a deep neural network. BMC Bioinform. 2019; 20: 41.
- 116Bepler T, Morin A, Rapp M, Brasch J, Shapiro L, Noble AJ, et al. Positive-unlabeled convolutional neural networks for particle picking in cryo-electron micrographs. Nat Methods. 2019; 16: 1153–60.
- 117Wagner T, Merino F, Stabrin M, Moriya T, Antoni C, Apelbaum A, et al. SPHIRE-crYOLO is a fast and accurate fully automated particle picker for cryo-EM. Commun Biol. 2019; 2: 1–13.
- 118Tegunov D, Cramer P. Real-time cryo-electron microscopy data preprocessing with Warp. Nat Methods. 2019; 16: 1146–52.
- 119Al-Azzawi A, Ouadou A, Tanner JJ, Cheng J. AutoCryoPicker: an unsupervised learning approach for fully automated single particle picking in cryo-EM images. BMC Bioinform. 2019; 20: 326.
- 120Tang G, Peng L, Baldwin PR, Mann DS, Jiang W, Rees I, et al. EMAN2: an extensible image processing suite for electron microscopy. J Struct Biol. 2007; 157: 38–46.
- 121Sanchez-Garcia R, Segura J, Maluenda D, Sorzano COS, Carazo JM. MicrographCleaner: a python package for cryo-EM micrograph cleaning using deep learning. J Struct Biol. 2020; 210:107498.
- 122Chen JZ, Grigorieff N. SIGNATURE: a single-particle selection system for molecular electron microscopy. J Struct Biol. 2007; 157: 168–73.
- 123Yang Z, Fang J, Chittuluru J, Asturias FJ, Penczek PA. Iterative stable alignment and clustering of 2D transmission electron microscope images. Structure. 2012; 20: 237–47.
- 124Chung S-C, Lin H-H, Niu P-Y, Huang S-H, Tu I-P, Chang W-H. Pre-pro is a fast pre-processor for single-particle cryo-EM by enhancing 2D classification. Commun Biol. 2020; 3: 1–12.
- 125Miolane N, Poitevin F, Li Y, Holmes S. Estimation of orientation and camera parameters from cryo-electron microscopy images with variational autoencoders and generative adversarial networks. In: 2020 IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW). 2020. p. 4174–4183.
- 126Martin TG, Bharat TAM, Joerger AC, Bai X, Praetorius F, Fersht AR, et al. Design of a molecular support for cryo-EM structure determination. Proc Natl Acad Sci U S A. 2016; 113: E7456–63.
- 127Scheres SHW. A Bayesian view on cryo-EM structure determination. J Mol Biol. 2012; 415: 406–18.
- 128Zhong ED, Bepler T, Berger B, Davis JH. CryoDRGN: reconstruction of heterogeneous cryo-EM structures using neural networks. Nat Methods. 2021; 18: 176–85.
- 129Punjani A, Zhang H, Fleet DJ. Non-uniform refinement: adaptive regularization improves single-particle cryo-EM reconstruction. Nat Methods. 2020; 17: 1214–21.
- 130Terashi G, Kihara D. De novo main-chain modeling for EM maps using MAINMAST. Nat Commun. 2018; 9: 1618.
- 131Lindert S, Alexander N, Wötzel N, Karakaş M, Stewart PL, Meiler J. EM-fold: de novo atomic-detail protein structure determination from medium-resolution density maps. Structure. 2012; 20: 464–78.
- 132Baker ML, Abeysinghe SS, Schuh S, Coleman RA, Abrams A, Marsh MP, et al. Modeling protein structure at near atomic resolutions with gorgon. J Struct Biol. 2011; 174: 360–73.
- 133DiMaio F, Leaver-Fay A, Bradley P, Baker D, André I. Modeling symmetric macromolecular structures in Rosetta3. PLoS ONE. 2011; 6:e20450.
- 134Terashi G, Kagaya Y, Kihara D. MAINMASTseg: automated map segmentation method for cryo-EM density maps with symmetry. J Chem Inf Model. 2020; 60: 2634–43.
- 135Liebschner D, Afonine PV, Baker ML, Bunkóczi G, Chen VB, Croll TI, et al. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Crystallogr D Struct Biol. 2019; 75(Pt 10): 861–77.
- 136Kuhlman B, Bradley P. Advances in protein structure prediction and design. Nat Rev Mol Cell Biol. 2019; 20: 681–97.
- 137Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, et al. The Protein Data Bank. Nucleic Acids Res. 2000; 28: 235–42.
- 138He J, Wang J, Tao H, Xiao Y, Huang S-Y. HNADOCK: a nucleic acid docking server for modeling RNA/DNA–RNA/DNA 3D complex structures. Nucleic Acids Res. 2019; 47(W1): W35–42.
- 139Li B, Cao Y, Westhof E, Miao Z. Advances in RNA 3D structure modeling using experimental data. Front Genet. 2020; 11:574485.
- 140Terwilliger TC, Grosse-Kunstleve RW, Afonine PV, Moriarty NW, Zwart PH, Hung L-W, et al. Iterative model building, structure refinement and density modification with the PHENIX AutoBuild wizard. Acta Crystallogr D Biol Crystallogr. 2008; 64: 61–9.
- 141Langer G, Cohen SX, Lamzin VS, Perrakis A. Automated macromolecular model building for X-ray crystallography using ARP/wARP version 7. Nat Protoc. 2008; 3: 1171–9.
- 142AlQuraishi M. ProteinNet: a standardized data set for machine learning of protein structure. BMC Bioinform. 2019; 20: 311.
- 143Suzek BE, Wang Y, Huang H, McGarvey PB, Wu CH, UniProt Consortium. UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches. Bioinformatics. 2015; 31: 926–32.
- 144Si D, Moritz SA, Pfab J, Hou J, Cao R, Wang L, et al. Deep learning to predict protein backbone structure from high-resolution cryo-EM density maps. Sci Rep. 2020; 10: 4282.
- 145Ronneberger O, Fischer P, Brox T. U-Net: convolutional networks for biomedical image segmentation. In: N Navab, J Hornegger, WM Wells, AF Frangi, editors. Medical image computing and computer-assisted intervention – MICCAI 2015. Lecture notes in computer science. Cham: Springer International Publishing; 2015. p. 234–41.
- 146Krivov GG, Shapovalov MV, Dunbrack RL. Improved prediction of protein side-chain conformations with SCWRL4. Proteins. 2009; 77: 778–95.
- 147Adams PD, Grosse-Kunstleve RW, Hung LW, Ioerger TR, McCoy AJ, Moriarty NW, et al. PHENIX: building new software for automated crystallographic structure determination. Acta Crystallogr D Biol Crystallogr. 2002; 58(Pt 11): 1948–54.
- 148Singh J, Hanson J, Paliwal K, Zhou Y. RNA secondary structure prediction using an ensemble of two-dimensional deep neural networks and transfer learning. Nat Commun. 2019; 10: 5407.
- 149Mostosi P, Schindelin H, Kollmannsberger P, Thorn A. Haruspex: a neural network for the automatic identification of oligonucleotides and protein secondary structure in cryo-electron microscopy maps. Angew Chem Int Ed. 2020; 59: 14788–95.
- 150Hollingsworth SA, Karplus PA. A fresh look at the Ramachandran plot and the occurrence of standard structures in proteins. Biomol Concepts. 2010; 1: 271–83.
- 151Kappel K, Zhang K, Su Z, Watkins AM, Kladwang W, Li S, et al. Accelerated cryo-EM-guided determination of three-dimensional RNA-only structures. Nat Methods. 2020; 17: 699–707.
- 152Sato K, Akiyama M, Sakakibara Y. RNA secondary structure prediction using deep learning with thermodynamic integration. Nat Commun. 2021; 12: 941.
- 153Chen VB, Arendall WB, Headd JJ, Keedy DA, Immormino RM, Kapral GJ, et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr D Biol Crystallogr. 2010; 66: 12–21.
- 154Barad BA, Echols N, Wang RY-R, Cheng Y, DiMaio F, Adams PD, et al. EMRinger: side chain-directed model and map validation for 3D cryo-electron microscopy. Nat Methods. 2015; 12: 943–6.
- 155Cao R, Bhattacharya D, Hou J, Cheng J. DeepQA: improving the estimation of single protein model quality with deep belief networks. BMC Bioinform. 2016; 17: 1–9.
- 156Smith J, Conover M, Stephenson N, Eickholt J, Si D, Sun M, et al. TopQA: a topological representation for single-model protein quality assessment with machine learning. Int J Comput Biol Drug Des. 2020; 13: 144–53.
- 157Conover M, Staples M, Si D, Sun M, Cao R. Angular QA. Protein model quality assessment with LSTM networks. bioRxiv. 2019:560995.
- 158Uziela K, Menendez Hurtado D, Shu N, Wallner B, Elofsson A. ProQ3D: improved model quality assessments using deep learning. Bioinformatics. 2017; 33: 1578–80.
- 159Lawson CL, Kryshtafovych A, Adams PD, Afonine PV, Baker ML, Barad BA, et al. Cryo-EM model validation recommendations based on outcomes of the 2019 EMDataResource challenge. Nat Methods. 2021; 18: 156–64.
- 160Rohou A. Improving cryo-EM structure validation. Nat Methods. 2021; 18: 130–1.
- 161Matsumoto S, Ishida S, Araki M, Kato T, Terayama K, Okuno Y. Extraction of protein dynamics information from cryo-EM maps using deep learning. Nat Mach Intell. 2021; 4: 1–8.
- 162Du Y, An W, Zhu X, Sun Q, Qi J, Ye K. Cryo-EM structure of 90S small ribosomal subunit precursors in transition states. Science. 2020; 369: 1477–81.
- 163Yang H, Wang M, Yu Z, Zhao X, Li A. GANcon: protein contact map prediction with deep generative adversarial network. IEEE Access. 2020; 8: 80899–907.
- 164Madani A, McCann B, Naik N, Keskar NS, Anand N, Eguchi RR, et al. ProGen: language modeling for protein generation. bioRxiv. 2020:2020.03.07.982272.
- 165Rives A, Meier J, Sercu T, Goyal S, Lin Z, Liu J, et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. bioRxiv. 2020:622803.
- 166Gao W, Mahajan SP, Sulam J, Gray JJ. Deep learning in protein structural modeling and design. Patterns. 2020; 1: 100142.