Automated domain annotation is an important tool for structural informatics. These pipelines typically involve searching query sequences against hidden Markov model (HMM) profiles, yielding matches to profiles for various domains. However, domain annotation can be ambiguous or inaccurate when proteins contain domains with non-contiguous residue ranges, and especially when insertional domains are hosted within them. Here, we present DomainMapper, an algorithm that accurately assigns a unique domain structure annotation to a query sequence, including those with complex topologies. We validate our domain assignments using the AlphaFold database and confirm that non-contiguity is pervasive (10.74% of all domains in yeast and 4.52% in human). Using this resource, we find that certain folds have strong propensities to be non-contiguous or insertional across the Tree of Life. DomainMapper is freely available and can be ran as a single command-line function.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9601794PMC
http://dx.doi.org/10.1002/pro.4465DOI Listing

Publication Analysis

Top Keywords

domain structure
8
structure annotation
8
domain annotation
8
domain
5
domainmapper accurate
4
accurate domain
4
annotation
4
annotation including
4
including non-contiguous
4
non-contiguous topologies
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!