Automated model building and protein identification in cryo-EM maps. Jamali K, Käll L, Zhang R, Brown A, Kimanius D, Scheres SHW. Nature. 2024 Nature. Apr 11;628(8007):450–457.
[github] (open-source MIT license)
(published in past 2 years and citations from May/June 2025 put it in the
top 0.1% in the field of Biology & Biochemistry) as well as "highly cited"
(top 1% of field, not necessarily from the last 2 years)
– checked September 2025
Must have been delusional when I picked out this paper with 500 steps I don't understand.
ModelAngelo was trained on 3715 map-model pairs: maps with resolution <4Å deposited before Apr 1 2022, resampled to pixel size 1Å, paired with PDB models covering the entire maps correctly. (Compare to 117 pairs for training findMySequence and 1400 for DeepTracer.)
ModelAngelo generates residue (amino acid) type predictions and converts them into HMM profiles for HMMER search against a set of sequences. The sequences are supplied by the user if the proteins in the cryo-EM experiment are known.
To enable model building for structures with unknown sequences, however, a version of ModelAngelo was also trained without the sequence module. That version also generates amino acid type probabilities and converts them into HMM profiles, but in that case, the HMMER search is against a larger proteome instead of just the user-supplied sequences.