The evolutionary histories of genes are susceptible of differing greatly from each other which could be explained by evolutionary variations in horizontal gene transfers or biological recombinations. A phylogenetic tree would therefore represent the evolutionary history of each gene, which may present different patterns from the species tree that defines the main evolutionary patterns. In addition, phylogenetic trees of closely related species should be merged, thus minimizing the topological conflicts they present and obtaining consensus trees (in the case of homogeneous data) or supertrees (in the case of heterogeneous data). The traditional approaches are consensus tree inference (if the set of trees contains the same set of species) or supertrees (if the set of trees contains different, but overlapping sets of species). Consensus trees and supertrees are constructed to produce unique trees. However, these methods lose precision with respect to different evolutionary variability. Other approaches have been implemented to preserve this variability using the [Formula: see text]-means algorithm or the [Formula: see text]-medoids algorithm. Using a new method, we determine all possible consensus trees and supertrees that best represent the most significant evolutionary models in a set of phylogenetic trees, thereby increasing the precision of the results and decreasing the time required. This paper presents in detail a new method for predicting the number of clusters in a Robinson and Foulds (RF) distance matrix using a convolutional neural network (CNN). We developed a new CNN approach (called CNNTrees) for multiple tree classification. This new strategy returns a number of clusters of the input phylogenetic trees for different-size sets of trees, which makes the new approach more stable and more robust. The paper provides an in-depth analysis of the relevant, but very difficult, problem of constructing alternative supertrees using phylogenies with different but overlapping sets of taxa. This new model will play an important role in the inference of Trees of Life (ToL). CNNTrees is available through a web server at https://tahirinadia.github.io/. The source code, data and information about installation procedures are also available at https://github.com/TahiriNadia/CNNTrees. Supplementary data are available on GitHub platform. The evolutionary history of species is not unique, but is specific to sets of genes. Indeed, each gene has its own evolutionary history that differs considerably from one gene to another. For example, some individual genes or operons may be affected by specific horizontal gene transfer and recombination events. Thus, the evolutionary history of each gene must be represented by its own phylogenetic tree, which may exhibit different evolutionary patterns than the species tree that accounts for the major vertical descent patterns. The result of traditional consensus tree or supertree inference methods is a single consensus tree or supertree. In this paper, we present in detail a new method for predicting the number of clusters in a Robinson and Foulds (RF) distance matrix using a convolutional neural network (CNN). We developed a new CNN approach (CNNTrees) to construct multiple tree classification. This new strategy returns a number of clusters in the order of the input trees, which allows this new approach to be more stable and also more robust.

Download full-text PDF

Source
http://dx.doi.org/10.1142/S0219720022500123DOI Listing

Publication Analysis

Top Keywords

evolutionary history
16
number clusters
16
robinson foulds
12
foulds distance
12
convolutional neural
12
neural network
12
trees
12
phylogenetic trees
12
consensus trees
12
consensus tree
12

Similar Publications

Comparative Evolutionary Epidemiology of SARS-CoV-2 Delta and Omicron Variants in Kuwait.

Viruses

November 2024

Department of Public Health, Ministry of Health, P.O. Box 24923, Kuwait City 13110, Kuwait.

Continuous surveillance is critical for early intervention against emerging novel SARS-CoV-2 variants. Therefore, we investigated and compared the variant-specific evolutionary epidemiology of all the Delta and Omicron sequences collected between 2021 and 2023 in Kuwait. We used Bayesian phylodynamic models to reconstruct, trace, and compare the two variants' demographics, phylogeographic, and host characteristics in shaping their evolutionary epidemiology.

View Article and Find Full Text PDF

Successful pollination and fertilization are crucial for grain setting in cereals. Wheat is an allohexaploid autogamous species. Due to its evolutionary history, the genetic diversity of current bread wheat () cultivars is limited.

View Article and Find Full Text PDF

Integrons in the Age of Antibiotic Resistance: Evolution, Mechanisms, and Environmental Implications: A Review.

Microorganisms

December 2024

State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bio-Resources, College of Life Science and Technology, Guangxi University, 100 Daxue Road, Nanning 530004, China.

Integrons, which are genetic components commonly found in bacteria, possess the remarkable capacity to capture gene cassettes, incorporate them into their structure, and thereby contribute to an increase in genomic complexity and phenotypic diversity. This adaptive mechanism allows integrons to play a significant role in acquiring, expressing, and spreading antibiotic resistance genes in the modern age. To assess the current challenges posed by integrons, it is necessary to have a thorough understanding of their characteristics.

View Article and Find Full Text PDF

Pierid species of the group are among the largest Sino-Himalayan members of genus , with four conventionally recognised species, namely , , , and . Recent publications indicated that some of these species may contain more than one species despite their similar morphological characters. The present research analysed this group of butterflies using mitogenomic data, and proved that , , , and should be recognised as distinct species, while , and should be subspecies of .

View Article and Find Full Text PDF

Using next-generation sequencing data, the complete mitogenomes of six species from the genus were assembled. This study explores the mitochondrial genomes of species, among them the five species from the complex, comparing them with each other and with other species from Dolichoderinae subfamily to understand their evolutionary relationships and evolution. mitochondrial genomes contain the typical set of 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNAs, and the A + T-rich control region.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!