AI Article Synopsis

  • Advances in nucleotide sequencing tech are leading to new genomic methods for studying natural populations, but researchers must manage increasingly large data sets and understand potential biases and errors in this data.
  • The study focuses on genomic sampling biases in Populus species, highlighting a tendency to favor coding regions when using the ApeKI enzyme during genotyping-by-sequencing.
  • The researchers identify effective data filtering techniques that enhance the reliability of population genetic analyses, producing results consistent with previous studies and demonstrating that straightforward approaches can be applied successfully in population genomics.

Article Abstract

Continuing advances in nucleotide sequencing technology are inspiring a suite of genomic approaches in studies of natural populations. Researchers are faced with data management and analytical scales that are increasing by orders of magnitude. With such dramatic advances comes a need to understand biases and error rates, which can be propagated and magnified in large-scale data acquisition and processing. Here we assess genomic sampling biases and the effects of various population-level data filtering strategies in a genotyping-by-sequencing (GBS) protocol. We focus on data from two species of Populus, because this genus has a relatively small genome and is emerging as a target for population genomic studies. We estimate the proportions and patterns of genomic sampling by examining the Populus trichocarpa genome (Nisqually-1), and demonstrate a pronounced bias towards coding regions when using the methylation-sensitive ApeKI restriction enzyme in this species. Using population-level data from a closely related species (P. tremuloides), we also investigate various approaches for filtering GBS data to retain high-depth, informative SNPs that can be used for population genetic analyses. We find a data filter that includes the designation of ambiguous alleles resulted in metrics of population structure and Hardy-Weinberg equilibrium that were most consistent with previous studies of the same populations based on other genetic markers. Analyses of the filtered data (27,910 SNPs) also resulted in patterns of heterozygosity and population structure similar to a previous study using microsatellites. Our application demonstrates that technically and analytically simple approaches can readily be developed for population genomics of natural populations.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3991623PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0095292PLOS

Publication Analysis

Top Keywords

population genomics
8
natural populations
8
data
8
genomic sampling
8
population-level data
8
population structure
8
population
6
genotyping-by-sequencing populus
4
populus population
4
genomics assessment
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!