Picking alignments from (Steiner) trees.

J Comput Biol

Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.

Published: March 2004

The application of Needleman-Wunsch alignment techniques to biological sequences is complicated by two serious problems when the sequences are long: the running time, which scales as the product of the lengths of sequences, and the difficulty in obtaining suitable parameters that produce meaningful alignments. The running time problem is often corrected by reducing the search space, using techniques such as banding, or chaining of high-scoring pairs. The parameter problem is more difficult to fix, partly because the probabilistic model, which Needleman-Wunsch is equivalent to, does not capture a key feature of biological sequence alignments, namely the alternation of conserved blocks and seemingly unrelated nonconserved segments. We present a solution to the problem of designing efficient search spaces for pair hidden Markov models that align biological sequences by taking advantage of their associated features. Our approach leads to an optimization problem, for which we obtain a 2-approximation algorithm, and that is based on the construction of Manhattan networks, which are close relatives of Steiner trees. We describe the underlying theory and show how our methods can be applied to alignment of DNA sequences in practice, successfully reducing the Viterbi algorithm search space of alignment PHMMs by three orders of magnitude.

Download full-text PDF

Source
http://dx.doi.org/10.1089/10665270360688156DOI Listing

Publication Analysis

Top Keywords

steiner trees
8
biological sequences
8
running time
8
search space
8
sequences
5
picking alignments
4
alignments steiner
4
trees application
4
application needleman-wunsch
4
needleman-wunsch alignment
4

Similar Publications

Background And Aims: Hepatocellular carcinoma (HCC) exhibits a propensity for early recurrence following liver resection, resulting in a bleak prognosis. At present, majority of the predictive models for the early postoperative recurrence of HCC rely on the linear assumption of the Cox Proportional Hazard (CPH) model. However, the predictive efficacy of this model is constrained by the intricate nature of clinical data.

View Article and Find Full Text PDF

The Steiner tree Prosecutor: Revealing and disrupting criminal networks through a single suspect.

PLoS One

December 2024

Departamento de Ingeniería Industrial, FCFM, Universidad de Chile, Santiago, Chile.

Disrupting a criminal organization requires a significant deployment of human resources, time, information, and financial investment. In the early stages of an investigation, details about a specific crime are typically scarce, often with no known suspect. The literature has shown that an effective approach for analyzing criminal organizations is social network analysis.

View Article and Find Full Text PDF

Monkeypox (Mpox) is a growing public health concern, with complex interactions within host systems contributing to its impact. This study employs multi-omics approaches to uncover therapeutic targets and potential drug repurposing opportunities to better understand Mpox's molecular pathogenesis. We developed an in silico host-pathogen interaction (HPI) network and applied weighted gene co-expression network analysis (WGCNA) to explore interactions between Mpox and host proteins.

View Article and Find Full Text PDF

A heuristic method for solving the Steiner tree problem in graphs using network centralities.

PLoS One

June 2024

Department of Management Science, Graduate School of Engineering, Tokyo University of Science, Katsushika-ku, Tokyo, Japan.

We propose a heuristic method of using network centralities for constructing small-weight Steiner trees in this paper. The Steiner tree problem in graphs is one of the practical NP-hard combinatorial optimization problems. Given a graph and a set of vertices called terminals in the graph, the objective of the Steiner tree problem in graphs is to find a minimum weight Steiner tree that is a tree containing all the terminals.

View Article and Find Full Text PDF

The continuous spread of Bursaphelenchus xylophilus (Steiner and Buhrer) Nickle, commonly known as the organism that causes pine wilt disease (PWD), has become a notable threat to forest security in East Asia and southern Europe, and an assessment of the carbon loss caused by PWD damage is important to achieving carbon neutrality. This study used satellite remote sensing and 15-year ground monitoring data to measure the impact of PWD on the carbon storage of Pinus massoniana Lamb. (P.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!