Nowadays defying homology relationships among sequences is essential for biological research. Within homology the analysis of orthologs sequences is of great importance for computational biology, annotation of genomes and for phylogenetic inference. Since 2007, with the increase in the number of new sequences being deposited in large biological databases, researchers have begun to analyse computerized methodologies and tools aimed at selecting the most promising ones in the prediction of orthologous groups. Literature in this field of research describes the problems that the majority of available tools show, such as those encountered in accuracy, time required for analysis (especially in light of the increasing volume of data being submitted, which require faster techniques) and the automatization of the process without requiring manual intervention. Conducting our search through BMC, Google Scholar, NCBI PubMed, and Expasy, we examined more than 600 articles pursuing the most recent techniques and tools developed to solve most the problems still existing in orthology detection. We listed the main computational tools created and developed between 2011 and 2017, taking into consideration the differences in the type of orthology analysis, outlining the main features of each tool and pointing to the problems that each one tries to address. We also observed that several tools still use as their main algorithm the BLAST "all-against-all" methodology, which entails some limitations, such as limited number of queries, computational cost, and high processing time to complete the analysis. However, new promising tools are being developed, like OrthoVenn (which uses the Venn diagram to show the relationship of ortholog groups generated by its algorithm); or proteinOrtho (which improves the accuracy of ortholog groups); or ReMark (tackling the integration of the pipeline to turn the entry process automatic); or OrthAgogue (using algorithms developed to minimize processing time); and proteinOrtho (developed for dealing with large amounts of biological data). We made a comparison among the main features of four tool and tested them using four for prokaryotic genomas. We hope that our review can be useful for researchers and will help them in selecting the most appropriate tool for their work in the field of orthology.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5674930 | PMC |
http://dx.doi.org/10.3389/fgene.2017.00165 | DOI Listing |
Microbial Genome Database for Comparative Analysis (MBGD) is a comprehensive ortholog database encompassing published complete microbial genomes. The ortholog tables in MBGD are constructed in a hierarchical manner. The top-level ortholog table is now constructed from 1,812 genus-level pan-genomes, 6,268 species-level pan-genomes, and 34,079 genomes in total.
View Article and Find Full Text PDFImeta
December 2024
Children's Hospital Zhejiang University School of Medicine, National Clinical Research Center for Child Health Hangzhou China.
First introduced in 2021, MetOrigin has quickly established itself as a powerful web server to distinguish microbial metabolites and identify the bacteria responsible for specific metabolic processes. Building on the growing understanding of the interplay between the microbiome and metabolome, and in response to user feedback, MetOrigin has undergone a significant upgrade to version 2.0.
View Article and Find Full Text PDFFront Genet
December 2024
Department of Agricultural Sciences, University of Naples 'Federico II', Portici, Italy.
Microbiome
December 2024
Department of Biochemistry, Western University, Middlesex Drive, London, N6G 2V4, Ontario, Canada.
Background: The application of '-omics' technologies to study bacterial vaginosis (BV) has uncovered vast differences in composition and scale between the vaginal microbiomes of healthy and BV patients. Compared to amplicon sequencing and shotgun metagenomic approaches focusing on a single or few species, investigating the transcriptome of the vaginal microbiome at a system-wide level can provide insight into the functions which are actively expressed and differential between states of health and disease.
Results: We conducted a meta-analysis of vaginal metatranscriptomes from three studies, split into exploratory (n = 42) and validation (n = 297) datasets, accounting for the compositional nature of sequencing data and differences in scale between healthy and BV microbiomes.
Genetics
December 2024
Instituto de Biología Molecular de Barcelona (IBMB), CSIC, Parc Científic de Barcelona, C. Baldiri Reixac, 4-8, 08028 Barcelona, Spain.
Transcription factors (TFs) play a pivotal role in orchestrating critical intricate patterns of gene regulation. Although gene expression is complex, differential expression of hundreds of genes is often due to regulation by just a handful of TFs. Despite extensive efforts to elucidate TF-target regulatory relationships in Caenorhabditis elegans, existing experimental datasets cover distinct subsets of TFs and leave data integration challenging.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!