Background: Phylogenetic study of protein sequences provides unique and valuable insights into the molecular and genetic basis of important medical and epidemiological problems as well as insights about the origins and development of physiological features in present day organisms. Consensus phylogenies based on the bootstrap and other resampling methods play a crucial part in analyzing the robustness of the trees produced for these analyses.

Methodology: Our focus was to increase the number of bootstrap replications that can be performed on large protein datasets using the maximum parsimony, distance matrix, and maximum likelihood methods. We have modified the PHYLIP package using MPI to enable large-scale phylogenetic study of protein sequences, using a statistically robust number of bootstrapped datasets, to be performed in a moderate amount of time. This paper discusses the methodology used to parallelize the PHYLIP programs and reports the performance of the parallel PHYLIP programs that are relevant to the study of protein evolution on several protein datasets.

Conclusions: Calculations that currently take a few days on a state of the art desktop workstation are reduced to calculations that can be performed over lunchtime on a modern parallel computer. Of the three protein methods tested, the maximum likelihood method scales the best, followed by the distance method, and then the maximum parsimony method. However, the maximum likelihood method requires significant memory resources, which limits its application to more moderately sized protein datasets.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2981553PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0013999PLOS

Publication Analysis

Top Keywords

study protein
12
maximum likelihood
12
protein
8
large protein
8
phylogenetic study
8
protein sequences
8
protein datasets
8
maximum parsimony
8
phylip programs
8
likelihood method
8

Similar Publications

Objective: Therapeutic interventions for epithelial ovarian cancer (EOC) have increased greatly over the last decade but improvements outside of biomarker selected therapies have been limited. There remains a pressing need for more effective treatment options that can prolong survival and enhance the quality of life of patients with EOC. In contrast to the significant benefits of immunotherapy with immune checkpoint inhibitors (CPI) seen in many solid tumors, initial experience in EOC suggests limited efficacy of CPIs monotherapy.

View Article and Find Full Text PDF

Anaemia is a common phenomenon in patients with malignant gynecological tumors. The occurrence of anaemia in the perioperative period leads to an increased probability of blood transfusion, increased surgical complications,poor wound healing, prolonged hospitalization, increased medical costs, and increased mortality. Intravenous iron, which is known for its rapid onset and lack of gastrointestinal side effects, has become increasingly prevalent in clinical practice.

View Article and Find Full Text PDF

Machine learning (ML) is a powerful tool for the automated data analysis of molecular dynamics (MD) simulations. Recent studies showed that ML models can be used to identify protein-ligand unbinding pathways and understand the underlying mechanism. To expedite the examination of MD simulations, we constructed PathInHydro, a set of supervised ML models capable of automatically assigning unbinding pathways for the dissociation of gas molecules from [NiFe] hydrogenases, using the unbinding trajectories of CO and H from [NiFe] hydrogenase as a training set.

View Article and Find Full Text PDF

Naturally occurring vitamin E is a lipophilic plant-derived molecule corresponding to the 2R forms of alpha-tocopherol. A series of natural analogs or tocochromanols are present in nature, including β-, γ- and δ-tocopherol (βT, γT, δT), the corresponding tocotrienols (αTE, βTE, γTE, δTE) and tocomonoenols. Differences between these analogs as lipophilic antioxidants and modulators of molecular processes suggest specific therapeutic properties against various disorders associated with acute and chronic inflammation.

View Article and Find Full Text PDF

Background: Bioinformatics analysis of hepatocellular carcinoma (HCC) expression profiles can aid in understanding its molecular mechanisms and identifying new targets for diagnosis and treatment.

Aim: In this study, we analyzed expression profile datasets and miRNA expression profiles related to HCC from the GEO using R software to detect differentially expressed genes (DEGs) and differentially expressed miRNAs (DEmiRs).

Methods And Results: Common DEGs were identified, and a PPI network was constructed using the STRING database and Cytoscape software to identify hub genes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!