Determining the atomic-level structure of a protein has been a decades-long challenge. However, recent advances in transformers and related neural network architectures have enabled researchers to significantly improve solutions to this problem. These methods use large datasets of sequence information and corresponding known protein template structures, if available. Yet, such methods only focus on sequence information. Other available prior knowledge could also be utilized, such as constructs derived from x-ray crystallography experiments and the known structures of the most common conformations of amino acid residues, which we refer to as partial structures. To the best of our knowledge, we propose the first transformer-based model that directly utilizes experimental protein crystallographic data and partial structure information to calculate electron density maps of proteins. In particular, we use Patterson maps, which can be directly obtained from x-ray crystallography experimental data, thus bypassing the well-known crystallographic phase problem. We demonstrate that our method, CrysFormer, achieves precise predictions on two synthetic datasets of peptide fragments in crystalline forms, one with two residues per unit cell and the other with fifteen. These predictions can then be used to generate accurate atomic models using established crystallographic refinement programs.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11326852 | PMC |
http://dx.doi.org/10.1063/4.0000252 | DOI Listing |
J Vet Med Educ
November 2024
Center for Survey Statistics and Methodology, Iowa State University, 2401 Osborn Drive, Ames, IA 50011 USA.
Curriculum review is a required and essential part of the continuous improvement process to ensure that all elements of the curriculum are integrated to help students achieve intended outcomes. It is also an effective way of avoiding a disparity between the knowledge and skills students gain throughout their education and the knowledge and skills required in practice. One commonly used curriculum analysis approach is curriculum mapping, which requires extensive labor and time commitment.
View Article and Find Full Text PDFMol Ecol Resour
October 2024
Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, USA.
A fundamental goal in population genetics is to understand how variation is arrayed over natural landscapes. From first principles we know that common features such as heterogeneous population densities and barriers to dispersal should shape genetic variation over space, however there are few tools currently available that can deal with these ubiquitous complexities. Geographically referenced single nucleotide polymorphism (SNP) data are increasingly accessible, presenting an opportunity to study genetic variation across geographic space in myriad species.
View Article and Find Full Text PDFStruct Dyn
July 2024
Department of Computer Science, Rice University, Houston, Texas 77005, USA.
Determining the atomic-level structure of a protein has been a decades-long challenge. However, recent advances in transformers and related neural network architectures have enabled researchers to significantly improve solutions to this problem. These methods use large datasets of sequence information and corresponding known protein template structures, if available.
View Article and Find Full Text PDFA fundamental goal in population genetics is to understand how variation is arrayed over natural landscapes. From first principles we know that common features such as heterogeneous population densities and barriers to dispersal should shape genetic variation over space, however there are few tools currently available that can deal with these ubiquitous complexities. Geographically referenced single nucleotide polymorphism (SNP) data are increasingly accessible, presenting an opportunity to study genetic variation across geographic space in myriad species.
View Article and Find Full Text PDFGenes (Basel)
December 2023
Department of Computer Science, Georgia State University, Atlanta, GA 30303, USA.
Protein structure analysis is essential in various bioinformatics domains such as drug discovery, disease diagnosis, and evolutionary studies. Within structural biology, the classification of protein structures is pivotal, employing machine learning algorithms to categorize structures based on data from databases like the Protein Data Bank (PDB). To predict protein functions, embeddings based on protein sequences have been employed.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!