TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline.

PLoS One

Institute for Genomic Diversity, Cornell University, Ithaca, New York, United States of America ; USDA Agricultural Research Service, Ithaca, New York, United States of America.

Published: October 2014

Genotyping by sequencing (GBS) is a next generation sequencing based method that takes advantage of reduced representation to enable high throughput genotyping of large numbers of individuals at a large number of SNP markers. The relatively straightforward, robust, and cost-effective GBS protocol is currently being applied in numerous species by a large number of researchers. Herein we describe a bioinformatics pipeline, TASSEL-GBS, designed for the efficient processing of raw GBS sequence data into SNP genotypes. The TASSEL-GBS pipeline successfully fulfills the following key design criteria: (1) Ability to run on the modest computing resources that are typically available to small breeding or ecological research programs, including desktop or laptop machines with only 8-16 GB of RAM, (2) Scalability from small to extremely large studies, where hundreds of thousands or even millions of SNPs can be scored in up to 100,000 individuals (e.g., for large breeding programs or genetic surveys), and (3) Applicability in an accelerated breeding context, requiring rapid turnover from tissue collection to genotypes. Although a reference genome is required, the pipeline can also be run with an unfinished "pseudo-reference" consisting of numerous contigs. We describe the TASSEL-GBS pipeline in detail and benchmark it based upon a large scale, species wide analysis in maize (Zea mays), where the average error rate was reduced to 0.0042 through application of population genetic-based SNP filters. Overall, the GBS assay and the TASSEL-GBS pipeline provide robust tools for studying genomic diversity.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3938676PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0090346PLOS

Publication Analysis

Top Keywords

tassel-gbs pipeline
12
genotyping sequencing
8
individuals large
8
large number
8
pipeline
6
large
6
tassel-gbs
5
tassel-gbs high
4
high capacity
4
capacity genotyping
4

Similar Publications

Exploring genetic diversity of wild and related tetraploid wheat species Triticum turgidum and Triticum timopheevii.

J Adv Res

June 2023

Department of Plant Sciences and Landscape Architecture, University of Maryland, College Park, MD 20742, USA. Electronic address:

Introduction: The domestication bottleneck has reduced genetic diversity inwheat, necessitating the use of wild relatives in breeding programs. Wild tetraploid wheat are widely used in the breeding programs but with morphological characters, it is difficult to distinguish these, resulting in misclassification/mislabeling or duplication of accessions in the Gene bank.

Objectives: The study aims to exploreGenotyping by sequencing (GBS) to characterize wild and domesticated tetraploid wheat accessions to generate a core set of accessions to be used in the breeding program.

View Article and Find Full Text PDF

Genotyping-by-sequencing (GBS) allows rapid identification of markers for use in development of linkage maps, which expedite efficient breeding programs. In the present study, we have utilized GBS approach to identify and genotype single-nucleotide polymorphism (SNP) markers in an inter-specific RIL population of Cicer arietinum L. X C.

View Article and Find Full Text PDF

has been targeted for domestication as future oilseed and catch crop. Three hundred eighty plants comprising genotypes of , , and their interspecific F mapping population were genotyped using genotyping by sequencing (GBS), and the generated polymorphic markers were used for the construction of high-density genetic linkage map. TASSEL-GBS, a reference genome-based pipeline, was used for this analysis using a draft whole genome sequence.

View Article and Find Full Text PDF

Background: Genotyping-by-sequencing (GBS) has been used broadly in genetic studies for several species, especially those with agricultural importance. However, its use is still limited in autopolyploid species because genotype calling software generally fails to properly distinguish heterozygous classes based on allele dosage.

Results: VCF2SM is a Python script that integrates sequencing depth information of polymorphisms in variant call format (VCF) files and SUPERMASSA software for quantitative genotype calling.

View Article and Find Full Text PDF

Verticillium wilt (VW) of alfalfa is a soilborne disease causing severe yield loss in alfalfa. To identify molecular markers associated with VW resistance, we used an integrated framework of genome-wide association study (GWAS) with high-throughput genotyping by sequencing (GBS) to identify loci associated with VW resistance in an F1 full-sib alfalfa population. Phenotyping was performed using manual inoculation of the pathogen to cloned plants of each individual and disease severity was scored using a standard scale.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!