Genetic Algorithm Embedded with a Search Space Dimension Reduction Scheme for Efficient Peptide Structure Predictions.

J Phys Chem B

Hefei National Research Center for Physical Sciences at Microscales & CAS Key Laboratory of Strongly-Coupled Quantum Matter Physics, Department of Physics, University of Science and Technology of China, Hefei 230026, China.

Published: April 2021

The computational determination of peptide conformations is a challenging task of finding minima in a high dimensional space. By combining the sampling efficiency of the genetic algorithm (GA) and the dimensionality reduction resulted from the backbone dihedral angle correlations, named as the path matrix (PM) method, a new searching algorithm, parallel microgenetic algorithm (PMGA), is proposed. Meanwhile, PMGA employs the density functional theory based energy as the fitness function and performs local geometry optimizations to enhance the reliability of its GA encoding strategy. Tests on peptides with up to eight amino-acid residues show PMGA is quite efficient for providing high-quality conformational coverages. The computational cost of the PMGA search increases slowly with the number of amino-acid residues in a peptide, with no sign of deterioration on the searching results for the increased length of the peptide. The PMGA method should therefore be useful for determining the conformations of oligopeptide, studying the protein-ligand interactions, and designing the peptide-based drugs.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jpcb.1c01255DOI Listing

Publication Analysis

Top Keywords

genetic algorithm
8
amino-acid residues
8
pmga
5
algorithm embedded
4
embedded search
4
search space
4
space dimension
4
dimension reduction
4
reduction scheme
4
scheme efficient
4

Similar Publications

Genome-wide association studies are enriched for interacting genes.

BioData Min

January 2025

The Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, 90069, USA.

Background: With recent advances in single cell technology, high-throughput methods provide unique insight into disease mechanisms and more importantly, cell type origin. Here, we used multi-omics data to understand how genetic variants from genome-wide association studies influence development of disease. We show in principle how to use genetic algorithms with normal, matching pairs of single-nucleus RNA- and ATAC-seq, genome annotations, and protein-protein interaction data to describe the genes and cell types collectively and their contribution to increased risk.

View Article and Find Full Text PDF

Background: Preeclampsia significantly impacts maternal and perinatal health. Early screening using advanced models and primary prevention with low-dose acetylsalicylic acid for high-risk populations is crucial to reduce the disease's incidence. This study assesses the feasibility of implementing preterm preeclampsia screening and prevention by leveraging information from our current aneuploidy screening program in a real-world setting with geographic separation clinical site and laboratory analysis site.

View Article and Find Full Text PDF

Exploring Brain Imaging and Genetic Risk Factors in Different Progression States of Alzheimer's Disease Through OSnetNMF-Based Methods.

J Mol Neurosci

January 2025

Bio-Med Big Data Center, CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China.

Alzheimer's disease (AD) is a neurodegenerative disease with no effective treatment, often preceded by mild cognitive impairment (MCI). Multimodal imaging genetics integrates imaging and genetic data to gain a deeper understanding of disease progression and individual variations. This study focuses on exploring the mechanisms that drive the transition from normal cognition to MCI and ultimately to AD.

View Article and Find Full Text PDF

Resolving tissue complexity by multimodal spatial omics modeling with MISO.

Nat Methods

January 2025

Statistical Center for Single-Cell and Spatial Genomics, Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.

Spatial molecular profiling has provided biomedical researchers valuable opportunities to better understand the relationship between cellular localization and tissue function. Effectively modeling multimodal spatial omics data is crucial for understanding tissue complexity and underlying biology. Furthermore, improvements in spatial resolution have led to the advent of technologies that can generate spatial molecular data with subcellular resolution, requiring the development of computationally efficient methods that can handle the resulting large-scale datasets.

View Article and Find Full Text PDF

Preeclampsia (PE) is a major pregnancy-specific cardiovascular complication posing latent life-threatening risks to mothers and neonates. The contribution of immune dysregulation to PE is not fully understood, highlighting the need to explore molecular markers and their relationship with immune infiltration to potentially inform therapeutic strategies. We used bioinformatics tools to analyze gene expression data from the Gene Expression Omnibus (GEO) database using the GEOquery package in R.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!