Bayesian nonparametrics in protein remote homology search.

Bioinformatics

Institute of Biotechnology, Vilnius University, Vilnius 10257, Lithuania.

Published: September 2016

Motivation: Wide application of modeling of three-dimensional protein structures in biomedical research motivates developing protein sequence alignment computer tools featuring high alignment accuracy and sensitivity to remotely homologous proteins. In this paper, we aim at improving the quality of alignments between sequence profiles, encoded multiple sequence alignments. Modeling profile contexts, fixed-length profile fragments, is engaged to achieve this goal.

Results: We develop a hierarchical Dirichlet process mixture model to describe the distribution of profile contexts, which is able to capture dependencies between amino acids in each context position. The model represents an attempt at modeling profile fragments at several hierarchical levels, within the profile and among profiles. Even modeling unit-length contexts leads to greater improvements than processing 13-length contexts previously. We develop a new profile comparison method, called COMER, integrating the model. A benchmark with three other profile-to-profile comparison methods shows an increase in both sensitivity and alignment quality.

Availability And Implementation: COMER is open-source software licensed under the GNU GPLv3, available at https://sourceforge.net/projects/comer

Contact: mindaugas.margelevicius@bti.vu.lt

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btw213DOI Listing

Publication Analysis

Top Keywords

modeling profile
8
profile contexts
8
profile fragments
8
profile
6
bayesian nonparametrics
4
nonparametrics protein
4
protein remote
4
remote homology
4
homology search
4
search motivation
4

Similar Publications

Objectives: To investigate the clinical and laboratory features of Sjögren's syndrome-associated autoimmune liver disease (SS-ALD) patients and identify potential risk and prognostic factors.

Methods: SS patients with or without ALD, who visited Tongji Hospital between the years 2011 and 2021 and met the 2012 American College of Rheumatology (ACR) classification criteria for Sjögren's syndrome, were retrospectively enrolled. The clinical and laboratory data of the enrolled patients, including autoimmune antibodies, were collected and analyzed with principal component analysis, correlation analysis, LASSO regression, and Cox regression.

View Article and Find Full Text PDF

This study aimed to identify shared gene expression related to circadian rhythm disruption in polycystic ovary syndrome (PCOS) and non-alcoholic fatty liver disease (NAFLD) to discover common diagnostic biomarkers. Visceral fat RNA samples were collected from 12 PCOS and 14 non-PCOS patients, a sample size representing the clinical situation and sufficient to capture PCOS gene expression profiles. Along with liver transcriptome profiles from NAFLD patients, these data were analyzed to identify crosstalk circadian rhythm-related genes (CRRGs) between the diseases.

View Article and Find Full Text PDF

Cells are subjected to dynamic mechanical environments which impart forces and induce cellular responses. In age-related conditions like pulmonary fibrosis, there is both an increase in tissue stiffness and an accumulation of senescent cells. While senescent cells produce a senescence-associated secretory phenotype (SASP), the impact of physical stimuli on both cellular senescence and the SASP is not well understood.

View Article and Find Full Text PDF

Prediction of pre-eclampsia using maternal hemodynamic parameters at 12 + 0 to 15 + 6 weeks.

Ultrasound Obstet Gynecol

January 2025

Department of Obstetrics and Gynaecology, Prince of Wales Hospital, The Chinese University of Hong Kong, Hong Kong, SAR, China.

Objectives: To compare the maternal hemodynamic profile at 12 + 0 to 15 + 6 weeks' gestation in women who subsequently developed pre-eclampsia (PE) and those who did not, and to assess the screening performance of maternal hemodynamic parameters for PE in combination with the Fetal Medicine Foundation (FMF) triple test, including maternal factors (MF), mean arterial pressure (MAP), uterine artery pulsatility index and placental growth factor.

Methods: This was a prospective case-control study involving Chinese women with a singleton pregnancy who underwent preterm PE screening at 11 + 0 to 13 + 6 weeks' gestation using the FMF triple test, between February 2020 and February 2023. Women identified as being at high risk (≥ 1:100) for preterm PE by the FMF triple test were matched 1:1 with women identified as low risk (< 1:100) for maternal age ± 3 years, maternal weight ± 5 kg and date of screening ± 14 days.

View Article and Find Full Text PDF

The neurobiological mechanisms driving the ictal-interictal fluctuations and the chronification of migraine remain elusive. We aimed to construct a composite genetic-microRNA model that could reflect the dynamic perturbations of the disease course and inform the pathogenesis of migraine. We prospectively recruited four groups of participants, including interictal episodic migraine (i.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!