One of the major challenges in genome research is the identification of the complete set of genes in a genome. Alignments of expressed sequences (RNA and EST) with genomic sequences have been used to characterize genes. However, the number of alignments far exceeds the likely number of genes in a genome, suggesting that, for many genes, two or more alignments can be joined through overlapping sequences to yield accurate gene structures. High-throughput EST sequencing becomes less efficient in closing those alignment gaps due to its nonselective nature. We sought to bridge these alignments through a novel approach: targeted cDNA sequencing. Human expressed sequences from GenBank version 124 were aligned with the genomic sequence from NCBI build 24 using LEADS, Compugen's EST and RNA clustering and assembly software system. Nine hundred forty-eight pairs of alignments were selected based on EST clone information and/or their homology to the same known proteins. Reverse transcriptase PCR and sequencing yielded sequences for 363 of those pairs. These sequences helped characterize over 60 novel or otherwise incomplete genes in the recent UniGene build 153, which included over 1 million additional ESTs. These results indicate that this integrated and targeted strategy, combining computational prediction and experimental cDNA sequencing, can efficiently generate the overlapping sequences and enable the full characterization of genomes. Additional information about the contig pairs, the resultant overlapping sequences, tissue sources, and tissue profiles are available in a supplemental file.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ygeno.2003.07.003DOI Listing

Publication Analysis

Top Keywords

cdna sequencing
12
overlapping sequences
12
targeted cdna
8
genes genome
8
sequences
8
expressed sequences
8
alignments
6
sequencing
5
genes
5
bridging expressed
4

Similar Publications

Variation of gene ratios in mock communities constructed with purified 16S rRNA during processing.

Sci Rep

December 2024

Department of Chemical Engineering, Polytechnic School, University of São Paulo, Av. Prof. Luciano Gualberto, Travessa 3, n. 380., São Paulo, SP, CEP 05508-900, Brazil.

16S ribosomal nucleic acid (16S rRNA) analysis allows to specifically target the metabolically active members of microbial communities. The stability of the ratios between target genes in the workflow, which is essential for the bioprocess-relevance of the data derived from this analysis, was investigated using synthetic mock communities constructed by mixing purified 16S rRNA from Bacillus subtilis (Bs), Staphylococcus aureus (Sa), Pseudomonas aeruginosa (Pa), Klebsiella pneumoniae (Kp) and Burkholderia cepacia (Bc) in different proportions. The RT reaction yielded one copy of cDNA per rRNA molecule for Pa, Bc and Sa but only 2/3 of the expected cDNA from 16S rRNAs of Bs and Kp.

View Article and Find Full Text PDF

Background & objectives Alkaptonuria (AKU) is an autosomal recessive disease wherein biallelic pathogenic variants in the homogentisate 1,2- dioxygenase (HGD) gene encoding the enzyme homogentisate 1,2 dioxygenase cause high levels of homogentisic acid (HGA) to circulate within the body leading to its deposition in connective tissues and excretion in urine. A homozygous splice donor variant (c.87+1G>A) has been identified to be the founder variant causing alkaptonuria among Narikuravars, a group of gypsies settled in Tamil Nadu.

View Article and Find Full Text PDF

Background: Myogenic factor 6 (Myf6) plays an important role in muscle growth and differentiation. In aquatic animals and livestock, Myf6 contributes to improving meat quality and strengthening the accumulation of muscle flavor substances. However, studies on Myf6 gene polymorphisms in crustaceans have not been reported.

View Article and Find Full Text PDF

We developed a versatile 'IHC/LCM-Seq' method for spatial transcriptomics of immunohistochemically detected neurons collected with laser-capture microdissection (LCM). IHC/LCM-Seq uses aluminon and polyvinyl sulfonic acid for inventive RNA-preserving strategies to maintain RNA integrity in free-floating sections of 4% formaldehyde-fixed brains. To validate IHC/LCM-Seq, we first immunostained and harvested striatal cholinergic interneurons with LCM.

View Article and Find Full Text PDF

Outdoor microcosms, metabarcoding with next-generation sequencing of the 16S rRNA bacterial gene, total body score (TBS) and physicochemical analyses were used to monitor Mus musculus decomposition aboveground (A) and in the subsurface (S), and compared to soil-only controls (C). As determined by MaAsLin2 analysis, significant shifts in bacterial communities at 30 cm depths within the A, S and C treatments distinguished control from experimental soils, and between aboveground and subsurface deposition, demonstrating the potential for gravesoil discrimination during the first 90 days. For example, Dokdonella (p = 0.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!