Orthology detection combining clustering and synteny for very large datasets.

PLoS One

Bioinformatics Group, Department of Computer Science, Universität Leipzig, Leipzig, Germany; Interdisciplinary Center for Bioinformatics, Universität Leipzig, Leipzig, Germany; Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany; Institute for Theoretical Chemistry, University of Vienna, Vienna, Austria; Center for non-coding RNA in Technology and Health, University of Copenhagen, Frederiksberg, Denmark; The Santa Fe Institute, Santa Fe, New Mexico, United States of America; RNomics Group, Fraunhofer Institut for Cell Therapy and Immunology, Leipzig, Germany.

Published: May 2015

The elucidation of orthology relationships is an important step both in gene function prediction as well as towards understanding patterns of sequence evolution. Orthology assignments are usually derived directly from sequence similarities for large data because more exact approaches exhibit too high computational costs. Here we present PoFF, an extension for the standalone tool Proteinortho, which enhances orthology detection by combining clustering, sequence similarity, and synteny. In the course of this work, FFAdj-MCS, a heuristic that assesses pairwise gene order using adjacencies (a similarity measure related to the breakpoint distance) was adapted to support multiple linear chromosomes and extended to detect duplicated regions. PoFF largely reduces the number of false positives and enables more fine-grained predictions than purely similarity-based approaches. The extension maintains the low memory requirements and the efficient concurrency options of its basis Proteinortho, making the software applicable to very large datasets.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4138177PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0105015PLOS

Publication Analysis

Top Keywords

orthology detection
8
detection combining
8
combining clustering
8
large datasets
8
orthology
4
clustering synteny
4
synteny large
4
datasets elucidation
4
elucidation orthology
4
orthology relationships
4

Similar Publications

Phytocytokines belong to a category of small secreted peptides with signaling functions that play pivotal roles in diverse plant physiological processes. However, due to low levels of sequence conservation across plant species and poorly understood biological functions, the accurate detection and annotation of corresponding genes is challenging. The availability of a high-quality apple (Malus domestica) genome has enabled the exploration of five phytocytokine gene families, selected on the basis of their altered expression profiles in response to biotic stresses.

View Article and Find Full Text PDF

Light sensing is a critical function in most organisms and is mediated by photoreceptor proteins and phototransduction. Although most nematodes lack eyes, some species exhibit phototaxis. In the nematode Caenorhabditis elegans, the unique photoreceptor protein Cel-LITE-1, its downstream G proteins, and cyclic GMP (cGMP)-dependent pathways are required for phototransduction.

View Article and Find Full Text PDF
Article Synopsis
  • Multiple sequence alignments and phylogenetic trees are essential tools in biology research, providing detailed biological insights.
  • PhyKIT is a software tool designed to streamline the analysis of these alignments and trees, offering various functionalities like constructing supermatrices and identifying errors in orthology inference.
  • The text outlines several protocols for using PhyKIT, covering installation, data processing, and features that aid in understanding gene functions and evolutionary relationships.
View Article and Find Full Text PDF

Single-cell transcriptome sequencing technology has been applied to decode the cell types and functional states of immune cells, revealing their tissue-specific gene expression patterns and functions in cancer immunity. Comprehensive assessments of immune cells within and across tissues will provide us with a deeper understanding of the tumor immune system in general. Here, we present Cross-tissue Immune cell type or state Enrichment analysis of gene lists for Cancer (CIEC), the first web-based application that integrates database and enrichment analysis to estimate the cross-tissue immune cell type or state.

View Article and Find Full Text PDF

Analysis and identification of mRNAsi‑related expression signatures via RNA sequencing in lung cancer.

Oncol Lett

November 2024

Clinical Research Unit, Shanghai Chest Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200050, P.R. China.

High stemness index scores are associated with poor survival in patients with lung cancer. Studies on the mRNA expression-based stemness index (mRNAsi) are typically conducted using tumor tissues; however, mRNAsi-related expression signatures based on cell-free RNA (cfRNA) are yet to be comprehensively investigated. The present study aimed to elucidate the gene expression profiles of tumor stemness in lung cancer tissues and corresponding cfRNAs in blood, and to assess their links with immune infiltration.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!