The Physalis peruviana leaf transcriptome: assembly, annotation and gene model prediction.

BMC Genomics

Plant Molecular Genetics Laboratory, Center of Biotechnology and Bioindustry (CBB), Colombian Corporation for Agricultural Research (CORPOICA), Bogota, Colombia.

Published: April 2012

AI Article Synopsis

  • Cape gooseberry (Physalis peruviana) is gaining attention for its nutritional and medicinal properties, but genomic resources for it are limited compared to other Solanaceae family members like tomato and potato.
  • Researchers generated 652,614 Expressed Sequence Tags (ESTs) from Cape gooseberry leaves, identifying significant biological functions and developing genetic markers to enhance the understanding of its genomic features.
  • This study offers the first detailed analysis of Cape gooseberry's leaf transcriptome, providing essential resources for future genetic research and potential applications in improving the species' productivity and fruit quality.

Article Abstract

Background: Physalis peruviana commonly known as Cape gooseberry is a member of the Solanaceae family that has an increasing popularity due to its nutritional and medicinal values. A broad range of genomic tools is available for other Solanaceae, including tomato and potato. However, limited genomic resources are currently available for Cape gooseberry.

Results: We report the generation of a total of 652,614 P. peruviana Expressed Sequence Tags (ESTs), using 454 GS FLX Titanium technology. ESTs, with an average length of 371 bp, were obtained from a normalized leaf cDNA library prepared using a Colombian commercial variety. De novo assembling was performed to generate a collection of 24,014 isotigs and 110,921 singletons, with an average length of 1,638 bp and 354 bp, respectively. Functional annotation was performed using NCBI's BLAST tools and Blast2GO, which identified putative functions for 21,191 assembled sequences, including gene families involved in all the major biological processes and molecular functions as well as defense response and amino acid metabolism pathways. Gene model predictions in P. peruviana were obtained by using the genomes of Solanum lycopersicum (tomato) and Solanum tuberosum (potato). We predict 9,436 P. peruviana sequences with multiple-exon models and conserved intron positions with respect to the potato and tomato genomes. Additionally, to study species diversity we developed 5,971 SSR markers from assembled ESTs.

Conclusions: We present the first comprehensive analysis of the Physalis peruviana leaf transcriptome, which will provide valuable resources for development of genetic tools in the species. Assembled transcripts with gene models could serve as potential candidates for marker discovery with a variety of applications including: functional diversity, conservation and improvement to increase productivity and fruit quality. P. peruviana was estimated to be phylogenetically branched out before the divergence of five other Solanaceae family members, S. lycopersicum, S. tuberosum, Capsicum spp, S. melongena and Petunia spp.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3488962PMC
http://dx.doi.org/10.1186/1471-2164-13-151DOI Listing

Publication Analysis

Top Keywords

physalis peruviana
12
peruviana leaf
8
leaf transcriptome
8
gene model
8
solanaceae family
8
average length
8
peruviana
6
transcriptome assembly
4
assembly annotation
4
gene
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!