Using shared genomic synteny and shared protein functions to enhance the identification of orthologous gene pairs.

Bioinformatics

Assays and Bioinformatics, Celera Genomics Corporation, 45 West Gude Drive, Rockville, MD 20850, USA.

Published: March 2005

Motivation: The identification of orthologous gene pairs is generally based on sequence similarity. Gene pairs that are mutually 'best hits' between the genomes being compared are asserted to be orthologs. Although this method identifies most orthologous gene pairs with high confidence, it will miss a fraction of them, especially genes in duplicated gene families. In addition, the approach depends heavily on the completeness and quality of gene annotation. When the gene sequences are not correctly represented the approach is unlikely to find the correct ortholog. To overcome these limitations, we have developed an approach to identify orthologous gene pairs using shared chromosomal synteny and the annotation of protein function.

Results: Assembled mouse and human genomes were used to identify the regions of conserved synteny between these genomes. 'Syntenic anchors' are conserved non-repetitive locations between mouse and human genomes. Using these anchors, we identified blocks of sequences that contain consistently ordered anchors between the two genomes (syntenic blocks). The synteny information has been used to help us identify orthologous gene pairs between mouse and human genomes. The approach combines the mutual selection of the best tBlastX hits between human and mouse transcripts, and inferring gene orthologous relationships based on sharing syntenic anchors, collocating in the same syntenic blocks and sharing the same annotated protein function. Using this approach, we were able to find 19,357 orthologous gene pairs between human and mouse genomes, a 20% increase in the number of orthologs identified by conventional approaches.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bti045DOI Listing

Publication Analysis

Top Keywords

gene pairs
28
orthologous gene
24
mouse human
12
human genomes
12
gene
11
identification orthologous
8
approach find
8
identify orthologous
8
syntenic blocks
8
human mouse
8

Similar Publications

To protect the body from infections, the brain has evolved the ability to coordinate behavioral and immunological responses. The conditioned immune response (CIR) is a form of Pavlovian conditioning wherein a sensory (for example, taste) stimulus, when paired with an immunomodulatory agent, evokes aversive behavior and an anticipatory immune response after re-experiencing the taste. Although taste and its valence are represented in the anterior insular cortex and immune response in the posterior insula and although the insula is pivotal for CIRs, the precise circuitry underlying CIRs remains unknown.

View Article and Find Full Text PDF

Purpose: This work described a new species of Ceratomyxa, based on morphological and phylogenetic analyzes of myxospores collected from the gallbladder of the fish Astyanax mexicanus.

Methods: Sixty-two specimens were captured, between December 2022 and February 2024, in the Flexal River, in the community of Tessalônica, state of Amapá. The specimens were transported alive to the Laboratory of Morphophysiology and Animal Health, at the State University of Amapá, where the studies were carried out.

View Article and Find Full Text PDF

Recent barcoding technologies allow reconstructing lineage trees while capturing paired single-cell RNA-sequencing (scRNA-seq) data. Such datasets provide opportunities to compare gene expression memory maintenance through lineage branching and pinpoint critical genes in these processes. Here we develop Permutation, Optimization, and Representation learning based single Cell gene Expression and Lineage ANalysis (PORCELAN) to identify lineage-informative genes or subtrees where lineage and expression are tightly coupled.

View Article and Find Full Text PDF

Obstacles in quantifying A-to-I RNA editing by Sanger sequencing.

Methods Enzymol

January 2025

Faculty of Biology, Technion - Israel Institute of Technology, Technion City, Haifa, Israel. Electronic address:

Adenosine-to-Inosine (A-to-I) RNA editing is the most prevalent type of RNA editing, in which adenosine within a completely or largely double-stranded RNA (dsRNA) is converted to inosine by deamination. RNA editing was shown to be involved in many neurological diseases and cancer; therefore, detection of A-to-I RNA editing and quantitation of editing levels are necessary for both basic and clinical biomedical research. While high-throughput sequencing (HTS) is widely used for global detection of editing events, Sanger sequencing is the method of choice for precise characterization of editing site clusters (hyper-editing) and for comparing levels of editing at a particular site under different environmental conditions, developmental stages, genetic backgrounds, or disease states.

View Article and Find Full Text PDF

CellREADR: An ADAR-based RNA sensor-actuator device.

Methods Enzymol

January 2025

Department of Neurobiology, Duke University School of Medicine, Durham, NC, United States; Department of Biomedical Engineering, Duke University, Durham, NC, United States. Electronic address:

RNAs are central mediators of genetic information flow and gene regulation that underlie diverse cell types and cell states across species. Thus, methods that can sense and respond to RNA profiles in living cells will have broad applications in biology and medicine. CellREADR - Cell access through RNA sensing by Endogenous ADAR (adenosine deaminase acting on RNA), is a programmable RNA sensor-actuator technology that couples the detection of a cell-defining RNA to the translation of an effector protein to monitor and manipulate the cell.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!