Treelink: data integration, clustering and visualization of phylogenetic trees.

BMC Bioinformatics

Faculty of Engineering and Sciences, Universidad Adolfo Ibañez, Diagonal las Torres 2640, Santiago, 7941169, Chile.

Published: December 2015

Background: Phylogenetic trees are central to a wide range of biological studies. In many of these studies, tree nodes need to be associated with a variety of attributes. For example, in studies concerned with viral relationships, tree nodes are associated with epidemiological information, such as location, age and subtype. Gene trees used in comparative genomics are usually linked with taxonomic information, such as functional annotations and events. A wide variety of tree visualization and annotation tools have been developed in the past, however none of them are intended for an integrative and comparative analysis.

Results: Treelink is a platform-independent software for linking datasets and sequence files to phylogenetic trees. The application allows an automated integration of datasets to trees for operations such as classifying a tree based on a field or showing the distribution of selected data attributes in branches and leafs. Genomic and proteonomic sequences can also be linked to the tree and extracted from internal and external nodes. A novel clustering algorithm to simplify trees and display the most divergent clades was also developed, where validation can be achieved using the data integration and classification function. Integrated geographical information allows ancestral character reconstruction for phylogeographic plotting based on parsimony and likelihood algorithms.

Conclusion: Our software can successfully integrate phylogenetic trees with different data sources, and perform operations to differentiate and visualize those differences within a tree. File support includes the most popular formats such as newick and csv. Exporting visualizations as images, cluster outputs and genomic sequences is supported. Treelink is available as a web and desktop application at http://www.treelinkapp.com .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4696249PMC
http://dx.doi.org/10.1186/s12859-015-0860-1DOI Listing

Publication Analysis

Top Keywords

phylogenetic trees
16
data integration
8
tree nodes
8
nodes associated
8
trees
7
tree
6
treelink data
4
integration clustering
4
clustering visualization
4
phylogenetic
4

Similar Publications

The Hepatincolaceae (Alphaproteobacteria) are a group of bacteria that inhabit the gut of arthropods and other ecdysozoans, associating extracellularly with microvilli. Previous phylogenetic studies, primarily single-gene analyses, suggested their relationship to the Holosporales, which includes intracellular bacteria in protist hosts. However, the genomics of Hepatincolaceae is still in its early stages.

View Article and Find Full Text PDF

Comparative Analysis of Protist Communities in Oilsands Tailings Using Amplicon Sequencing and Metagenomics.

Environ Microbiol

January 2025

Division of Infectious Diseases, Department of Medicine, and Department of Biological Sciences, University of Alberta, Edmonton, Alberta, Canada.

The Canadian province of Alberta contains substantial oilsands reservoirs, consisting of bitumen, clay and sand. Extracting oil involves separating bitumen from inorganic particles using hot water and chemical diluents, resulting in liquid tailings waste with ecotoxicologically significant compounds. Ongoing efforts aim to reclaim tailings-affected areas, with protist colonisation serving as one assessment method of reclamation progress.

View Article and Find Full Text PDF

Genome-Wide Identification and Functional Characterization of Gene Family Reveal Its Involvement in Response to Stress in Cotton.

Int J Mol Sci

January 2025

Institute of Cotton, Hebei Academy of Agriculture and Forestry Sciences/Key Laboratory of Cotton Biology and Genetic Breeding in Huanghuaihai Semiarid Area, Ministry of Agriculture and Rural Affairs, Shijiazhuang 050000, China.

SKP1 constitutes the Skp1-Cullin-F-box ubiquitin E3 ligase (SCF), which plays a role in plant growth and development and biotic and abiotic stress in ubiquitination. However, the response of the gene family to abiotic and biotic stresses in cotton has not been well characterized. In this study, a total of 72 genes with the conserved domain of SKP1 were identified in four Gossypium species.

View Article and Find Full Text PDF

Systematic Analysis of Cotton RING E3 Ubiquitin Ligase Genes Reveals Their Potential Involvement in Salt Stress Tolerance.

Int J Mol Sci

January 2025

Key Laboratory of Cotton Breeding and Cultivation in Huang-Huai-Hai Plain, Ministry of Agriculture and Rural Affairs, Institute of Industrial Crops Shandong Academy of Agricultural Sciences, Jinan 250100, China.

The Really Interesting New Gene (RING) E3 ubiquitin ligases represent the largest class of E3 ubiquitin ligases involved in protein degradation and play a pivotal role in plant growth, development, and environmental responses. Despite extensive studies in numerous plant species, the functions of RING E3 ligases in cotton remain largely unknown. In this study, we performed systematic identification, characterization, and expression analysis of genes in cotton.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!