Ultra-large alignments using phylogeny-aware profiles.

Genome Biol

Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, 1206 West Gregory Drive, Urbana, 61801, Illinois, USA.

Published: June 2015

Many biological questions, including the estimation of deep evolutionary histories and the detection of remote homology between protein sequences, rely upon multiple sequence alignments and phylogenetic trees of large datasets. However, accurate large-scale multiple sequence alignment is very difficult, especially when the dataset contains fragmentary sequences. We present UPP, a multiple sequence alignment method that uses a new machine learning technique, the ensemble of hidden Markov models, which we propose here. UPP produces highly accurate alignments for both nucleotide and amino acid sequences, even on ultra-large datasets or datasets containing fragmentary sequences. UPP is available at https://github.com/smirarab/sepp .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4492008PMC
http://dx.doi.org/10.1186/s13059-015-0688-zDOI Listing

Publication Analysis

Top Keywords

multiple sequence
12
sequence alignment
8
fragmentary sequences
8
sequences upp
8
ultra-large alignments
4
alignments phylogeny-aware
4
phylogeny-aware profiles
4
profiles biological
4
biological questions
4
questions including
4

Similar Publications

We report a case of Acanthamoeba infection in an HCT recipient with steroid-refractory GVHD. We highlight the multiple challenges that free-living ameba infections present to the clinician, the clinical laboratory, transplant infectious disease for review, hospital epidemiology if nosocomial transmission is considered, and public health officials, as exposure source identification can be a significant challenge. Transplant physicians should include Acanthamoeba infections in their differential diagnosis of a patient with skin, sinus, lung, and/or brain involvement.

View Article and Find Full Text PDF

Genetic architecture of Multiple Myeloma and its prognostic implications - An updated review.

Malays J Pathol

December 2024

Universiti Sains Malaysia, School of Medical Sciences, Human Genome Centre, Health Campus, Kelantan, Malaysia.

Multiple myeloma (MM), a clonal B-cell neoplasia, is an incurable and heterogeneous disease where survival ranges from a few months to more than 10 years. The clinical heterogeneity of MM arises from multiple genomic events that result in tumour development and progression. Recurring genomic abnormalities including cytogenetic abnormalities, gene mutations and abnormal gene expression profiles in myeloma cells have a strong prognostic power.

View Article and Find Full Text PDF

Background: The regulatory role of the apolipoprotein E (APOE) ε4 allele in the clinical manifestations of spinocerebellar ataxia type 3 (SCA3) remains unclear. This study aimed to evaluate the impact of the APOE ε4 allele on cognitive and motor functions in SCA3 patients.

Methods: This study included 281 unrelated SCA3 patients and 182 controls.

View Article and Find Full Text PDF

Therapeutic effect of novel drug candidate, PRG-N-01, on NF2 syndrome-related tumor.

Neuro Oncol

December 2024

Department of Molecular Biology, College of Natural Science, Pusan National University, Busan, Republic of Korea.

Background: NF2-related schwannomatosis (NF2-SWN) is associated with multiple benign tumors in the nervous system. NF2-SWN, caused by mutations in the NF2 gene, has developed into intracranial and spinal schwannomas. Because of the high surgical risk and frequent recurrence of multiple tumors, targeted therapy is necessary.

View Article and Find Full Text PDF

Familial hypercholesterolemia in Chinese children and adolescents: a multicenter study.

Lipids Health Dis

December 2024

Department of Endocrinology, Children's Hospital of Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Hangzhou, Zhejiang, 310052, China.

Background: Familial hypercholesterolemia (FH) is an inherited disorder mainly marked by increased low-density lipoprotein cholesterol (LDL-C) concentrations and a heightened risk of early-onset arteriosclerotic cardiovascular disease (ASCVD). This study seeks to characterize the genetic spectrum and genotype‒phenotype correlations of FH in Chinese pediatric individuals.

Methods: Data were gathered from individuals diagnosed with FH either clinically or genetically at multiple hospitals across mainland China from January 2016 to June 2024.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!