Clustering and assembly of expressed sequence tags (ESTs) constitute the basis for most genomewide descriptions of a transcriptome. This approach is limited by the decline in sequence quality toward the end of each EST, impacting both sequence clustering and assembly. Here, we exploit the available draft genome sequence of the unicellular green alga Chlamydomonas reinhardtii to guide clustering and to correct errors in the ESTs. We have grouped all available EST and cDNA sequences into 12,063 ACEGs (assembly of contiguous ESTs based on genome) and generated 15,857 contigs of average length 934 nt. We predict that roughly 3000 of our contigs represent full-length transcripts. Compared to previous assemblies, ACEGs show extended contig length, increased accuracy and a reduction in redundancy. Because our assembly protocol also uses ESTs with no corresponding genomic sequences, it provides sequence information for genes interrupted by sequence gaps. Detailed analysis of randomly sampled ACEGs reveals several hundred putative cases of alternative splicing, many overlapping transcription units and new genes not identified by gene prediction algorithms. Our protocol, although developed for and tailored to the C. reinhardtii dataset, can be exploited by any eukaryotic genome project for which both a draft genome sequence and ESTs are available.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1874618PMC
http://dx.doi.org/10.1093/nar/gkm081DOI Listing

Publication Analysis

Top Keywords

draft genome
12
genome sequence
12
sequence
8
chlamydomonas reinhardtii
8
clustering assembly
8
genome
5
ests
5
est assembly
4
assembly supported
4
supported draft
4

Similar Publications

High-quality draft genomes of six subspecies strains from Cambodian poultry marketplaces were sequenced. The strains were identified as Corvallis-, Monschaui-, and Kentucky-serovars. The fluoroquinolone resistance gene, was found in three strains in different Cambodian provinces.

View Article and Find Full Text PDF

To address food and nutrition security in the face of burgeoning global populations and erratic climatic conditions there is a need to include nutrient dense, climatic resilient but neglected indigenous fruit trees in agrifood systems. Here we present the draft genome sequence of Kei Apple, Dovyalis afra, a neglected indigenous African fruit tree with untapped potential to contribute to nutrient security and improved livelihoods. Our long-read-based genome assembly comprises 440 Mbp sequence across 1190 contigs with a N50 and L50 of 13.

View Article and Find Full Text PDF

sp. nov., a crude oil aggregation-forming anaerobic bacterium isolated from marine sediment.

Int J Syst Evol Microbiol

January 2025

Graduate School of Marine Science and Technology, Tokyo University of Marine Science and Technology, 4-5-7 Konan, Minato, Tokyo 108-8477, Japan.

A crude oil aggregation-forming, strictly anaerobic, Gram-stain-positive, spore-forming, rod-shaped, motile and mesophilic bacterium, named strain SH18-2, was isolated from marine sediment near Sado Island in the Sea of Japan. The temperature, salinity and pH ranges of this strain for the growth were 15-40 °C (optimum 35 °C), 0.5-6.

View Article and Find Full Text PDF

Draft genome sequence of glyphosate-degrading sp. strain GPK 3 (VKM B-2554D) isolated from agricultural soil.

Microbiol Resour Announc

January 2025

G.K. Skryabin Institute of Biochemistry and Physiology of Microorganisms, Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Federal Research Center, Pushchino, Russia.

The genome of sp. isolated from agricultural soil polluted with the herbicide glyphosate is reported. The genome size is 5.

View Article and Find Full Text PDF

The draft genome sequence of DS1 was isolated for high-acidity vinegar fermentation. The DS1 draft genome has a length of 8,793,442 bp, with 44.27% GC content, 3,053 proteins, 3,169 genes, and 15 genes that play an important role in acetic acid production.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!