MrParse: finding homologues in the PDB and the EBI AlphaFold database for molecular replacement and more.

Acta Crystallogr D Struct Biol

Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom.

Published: May 2022

Crystallographers have an array of search-model options for structure solution by molecular replacement (MR). The well established options of homologous experimental structures and regular secondary-structure elements or motifs are increasingly supplemented by computational modelling. Such modelling may be carried out locally or may use pre-calculated predictions retrieved from databases such as the EBI AlphaFold database. MrParse is a new pipeline to help to streamline the decision process in MR by consolidating bioinformatic predictions in one place. When reflection data are provided, MrParse can rank any experimental homologues found using eLLG, which indicates the likelihood that a given search model will work in MR. Inbuilt displays of predicted secondary structure, coiled-coil and transmembrane regions further inform the choice of MR protocol. MrParse can also identify and rank homologues in the EBI AlphaFold database, a function that will also interest other structural biologists and bioinformaticians.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9063843PMC
http://dx.doi.org/10.1107/S2059798322003576DOI Listing

Publication Analysis

Top Keywords

ebi alphafold
12
alphafold database
12
molecular replacement
8
mrparse
4
mrparse finding
4
finding homologues
4
homologues pdb
4
pdb ebi
4
database molecular
4
replacement crystallographers
4

Similar Publications

The evolutionary classification of protein domains (ECOD) classifies protein domains using a combination of sequence and structural data (http://prodata.swmed.edu/ecod).

View Article and Find Full Text PDF

The Pfam protein families database: embracing AI/ML.

Nucleic Acids Res

January 2025

European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK.

The Pfam protein families database is a comprehensive collection of protein domains and families used for genome annotation and protein structure and function analysis (https://www.ebi.ac.

View Article and Find Full Text PDF

Defining unique structural features in the MAFA and MAFB transcription factors that control Insulin gene activity.

J Biol Chem

December 2024

Department of Molecular Physiology & Biophysics, Vanderbilt University, Nashville, Tennessee, USA. Electronic address:

MAFA and MAFB are related basic-leucine-zipper domain-containing transcription factors which have important overlapping and distinct regulatory roles in a variety of cellular contexts, including hormone production in pancreatic islet cells. Here, we first examined how mutating conserved MAF protein-DNA contact sites obtained from X-ray crystal structure analysis impacted their DNA-binding and Insulin enhancer-driven activity. While most of these interactions were essential and their disruption severely compromised activity, we identified that regions outside of these contact sites also contributed to transcriptional activity.

View Article and Find Full Text PDF

Macromolecular protein complexes carry out most functions in the cell including essential functions required for cell survival. Unfortunately, we lack the subunit composition for all human protein complexes. To address this gap we integrated >25,000 mass spectrometry experiments using a machine learning approach to identify > 15,000 human protein complexes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!