De novo transcriptome assembly of Sorghum bicolor variety Taejin.

Genom Data

Department of Agricultural Biotechnology, College of Agriculture and Life Sciences, Seoul National University, Seoul 151-921, Republic of Korea; The Taejin Genome Institute, Gadam-gil 61, Hoengseong, 25239, Republic of Korea.

Published: June 2016

Sorghum (Sorghum bicolor), also known as great millet, is one of the most popular cultivated grass species in the world. Sorghum is frequently consumed as food for humans and animals as well as used for ethanol production. In this study, we conducted de novo transcriptome assembly for sorghum variety Taejin by next-generation sequencing, obtaining 8.748 GB of raw data. The raw data in this study can be available in NCBI SRA database with accession number of SRX1715644. Using the Trinity program, we identified 222,161 transcripts from sorghum variety Taejin. We further predicted coding regions within the assembled transcripts by the TransDecoder program, resulting in a total of 148,531 proteins. We carried out BLASTP against the Swiss-Prot protein sequence database to annotate the functions of the identified proteins. To our knowledge, this is the first transcriptome data for a sorghum variety derived from Korea, and it can be usefully applied to the generation of genetic markers.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4878842PMC
http://dx.doi.org/10.1016/j.gdata.2016.05.002DOI Listing

Publication Analysis

Top Keywords

variety taejin
12
sorghum variety
12
novo transcriptome
8
transcriptome assembly
8
assembly sorghum
8
sorghum bicolor
8
raw data
8
sorghum
7
variety
4
bicolor variety
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!