Combining phylogenetic and hidden Markov models in biosequence analysis.

J Comput Biol

Center for Biomolecular Science and Engineering, University of California, 1156 High Street, Santa Cruz, CA 95064, USA.

Published: August 2004

A few models have appeared in recent years that consider not only the way substitutions occur through evolutionary history at each site of a genome, but also the way the process changes from one site to the next. These models combine phylogenetic models of molecular evolution, which apply to individual sites, and hidden Markov models, which allow for changes from site to site. Besides improving the realism of ordinary phylogenetic models, they are potentially very powerful tools for inference and prediction--for example, for gene finding or prediction of secondary structure. In this paper, we review progress on combined phylogenetic and hidden Markov models and present some extensions to previous work. Our main result is a simple and efficient method for accommodating higher-order states in the HMM, which allows for context-dependent models of substitution--that is, models that consider the effects of neighboring bases on the pattern of substitution. We present experimental results indicating that higher-order states, autocorrelated rates, and multiple functional categories all lead to significant improvements in the fit of a combined phylogenetic and hidden Markov model, with the effect of higher-order states being particularly pronounced.

Download full-text PDF

Source
http://dx.doi.org/10.1089/1066527041410472DOI Listing

Publication Analysis

Top Keywords

hidden markov
16
phylogenetic hidden
12
markov models
12
higher-order states
12
models
9
changes site
8
phylogenetic models
8
combined phylogenetic
8
combining phylogenetic
4
hidden
4

Similar Publications

Polariton lattices as binarized neuromorphic networks.

Light Sci Appl

January 2025

Spin-Optics laboratory, St. Petersburg State University, St. Petersburg, 198504, Russia.

We introduce a novel neuromorphic network architecture based on a lattice of exciton-polariton condensates, intricately interconnected and energized through nonresonant optical pumping. The network employs a binary framework, where each neuron, facilitated by the spatial coherence of pairwise coupled condensates, performs binary operations. This coherence, emerging from the ballistic propagation of polaritons, ensures efficient, network-wide communication.

View Article and Find Full Text PDF

RNA-specific nucleotidyltransferases (rNTrs) add nontemplated nucleotides to the 3 end of RNA. Two noncanonical rNTRs that are thought to be poly(A) polymerases (PAPs) have been identified in the mitochondria of trypanosomes - KPAP1 and KPAP2. KPAP1 is the primary polymerase that adds adenines (As) to trypanosome mitochondrial mRNA 3 tails, while KPAP2 is a non-essential putative polymerase whose role in the mitochondria is ambiguous.

View Article and Find Full Text PDF

Uncovering dissipation from coarse observables: A case study of a random walk with unobserved internal states.

J Chem Phys

January 2025

Department of Chemistry and Oden Institute for Computational Engineering and Sciences, University of Texas at Austin, Austin, Texas 78712, USA.

Inferring underlying microscopic dynamics from low-dimensional experimental signals is a central problem in physics, chemistry, and biology. As a trade-off between molecular complexity and the low-dimensional nature of experimental data, mesoscopic descriptions such as the Markovian master equation are commonly used. The states in such descriptions usually include multiple microscopic states, and the ensuing coarse-grained dynamics are generally non-Markovian.

View Article and Find Full Text PDF

Pretrained Deep Neural Network Kin-SiM for Single-Molecule FRET Trace Idealization.

J Phys Chem B

January 2025

Single Molecule Analysis Group, Department of Chemistry, The University of Michigan, Ann Arbor, Michigan 48109, United States.

Single-molecule fluorescence resonance energy transfer (smFRET) has emerged as a pivotal technique for probing biomolecular dynamics over time at nanometer scales. Quantitative analyses of smFRET time traces remain challenging due to confounding factors such as low signal-to-noise ratios, photophysical effects such as bleaching and blinking, and the complexity of modeling the underlying biomolecular states and kinetics. The dynamic distance information shaping the smFRET trace powerfully uncovers even transient conformational changes in single biomolecules both at or far from equilibrium, relying on trace idealization to identify specific interconverting states.

View Article and Find Full Text PDF

The heat shock protein 70 (HSP70) family plays an important role in the growth and development of lettuce and in the defense response to high-temperature stress; however, its bioinformatics analysis in lettuce has been extremely limited. Genome-wide bioinformatics analysis methods such as chromosome location, phylogenetic relationships, gene structure, collinearity analysis, and promoter analysis were performed in the gene family, and the expression patterns in response to high-temperature stress were analyzed. The mechanism of in heat resistance in lettuce was studied by virus-induced gene silencing (VIGS) and transient overexpression techniques.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!