Phylogenetic distances are encoded in networks of interacting pathways.

Bioinformatics

Systems Biology Group, Institut Pasteur, 25 rue du Docteur Roux, 75015 Paris, France.

Published: November 2008

Motivation: Although metabolic reactions are unquestionably shaped by evolutionary processes, the degree to which the overall structure and complexity of their interconnections are linked to the phylogeny of species has not been evaluated in depth. Here, we apply an original metabolome representation, termed Network of Interacting Pathways or NIP, with a combination of graph theoretical and machine learning strategies, to address this question. NIPs compress the information of the metabolic network exhibited by a species into much smaller networks of overlapping metabolic pathways, where nodes are pathways and links are the metabolites they exchange.

Results: Our analysis shows that a small set of descriptors of the structure and complexity of the NIPs combined into regression models reproduce very accurately reference phylogenetic distances derived from 16S rRNA sequences (10-fold cross-validation correlation coefficient higher than 0.9). Our method also showed better scores than previous work on metabolism-based phylogenetic reconstructions, as assessed by branch distances score, topological similarity and second cousins score. Thus, our metabolome representation as network of overlapping metabolic pathways captures sufficient information about the underlying evolutionary events leading to the formation of metabolic networks and species phylogeny. It is important to note that precise knowledge of all of the reactions in these pathways is not required for these reconstructions. These observations underscore the potential for the use of abstract, modular representations of metabolic reactions as tools in studying the evolution of species.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2579716PMC
http://dx.doi.org/10.1093/bioinformatics/btn503DOI Listing

Publication Analysis

Top Keywords

phylogenetic distances
8
interacting pathways
8
metabolic reactions
8
structure complexity
8
metabolome representation
8
overlapping metabolic
8
metabolic pathways
8
pathways
6
metabolic
6
distances encoded
4

Similar Publications

Computational Methods for Lineage Reconstruction.

Methods Mol Biol

January 2025

Centro Nacional de Análisis Genómico, Barcelona, Spain.

The recent development of genetic lineage recorders, designed to register the genealogical history of cells using induced somatic mutations, has opened the possibility of reconstructing complete animal cell lineages. To reconstruct a cell lineage tree from a molecular recorder, it is crucial to use an appropriate reconstruction algorithm. Current approaches include algorithms specifically designed for cell lineage reconstruction and the repurposing of phylogenetic algorithms.

View Article and Find Full Text PDF

Backtracking Cell Phylogenies in the Human Brain with Somatic Mosaic Variants.

Methods Mol Biol

January 2025

Sorbonne Université, Institut du Cerveau (Paris Brain Institute) ICM, Inserm, CNRS, Hôpital de la Pitié Salpêtrière, Paris, France.

Somatic mosaic variants, and especially somatic single nucleotide variants (sSNVs), occur in progenitor cells in the developing human brain frequently enough to provide permanent, unique, and cumulative markers of cell divisions and clones. Here, we describe an experimental workflow to perform lineage studies in the human brain using somatic variants. The workflow consists in two major steps: (1) sSNV calling through whole-genome sequencing (WGS) of bulk (non-single-cell) DNA extracted from human fresh-frozen tissue biopsies, and (2) sSNV validation and cell phylogeny deciphering through single nuclei whole-genome amplification (WGA) followed by targeted sequencing of sSNV loci.

View Article and Find Full Text PDF

Measurements of cell phylogeny based on natural or induced mutations, known as lineage barcodes, in conjunction with molecular phenotype have become increasingly feasible for a large number of single cells. In this chapter, we delve into Quantitative Fate Mapping (QFM) and its computational pipeline, which enables the interrogation of the dynamics of progenitor cells and their fate restriction during development. The methods described here include inferring cell phylogeny with the Phylotime model, and reconstructing progenitor state hierarchy, commitment time, population size, and commitment bias with the ICE-FASE algorithm.

View Article and Find Full Text PDF

Bayesian Phylogenetic Lineage Reconstruction with Loss of Heterozygosity Mutations Derived from Single-Cell RNA Sequencing.

Methods Mol Biol

January 2025

Allen Discovery Center for Lineage Tracing and Department of Laboratory Medicine & Pathology, University of Washington, Seattle, WA, USA.

Mutations are acquired frequently, such t`hat each cell's genome inscribes its history of cell divisions. Loss of heterozygosity (LOH) accumulates throughout the genome, offering large encoding capacity for phylogenetic inference of cell lineage.In this chapter, we demonstrate a method, using single-cell RNA sequencing, for reconstructing cell lineages from inferred LOH events in a Bayesian manner, annotating the lineage with cell phenotypes, and marking developmental time points based on X-chromosome inactivation.

View Article and Find Full Text PDF

Unlabelled: Thousands of complete genome sequences for strains of a species that are now available enable the advancement of pangenome analytics to a new level of sophistication. We collected 2,377 publicly available complete genomes of for detailed pangenome analysis. The core genome and accessory genomes consisted of 2,398 and 5,182 genes, respectively.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!