A beginner's guide to low-coverage whole genome sequencing for population genomics.

Mol Ecol

Department of Natural Resources and the Environment, Cornell University, Ithaca, New York, USA.

Published: December 2021

AI Article Synopsis

  • Low-coverage whole genome sequencing (lcWGS) is a valuable and cost-effective method for studying population genomics, but it requires special tools due to its low read depth, which complicates genotype accuracy.
  • The guide provides a comparison of lcWGS costs to RAD-seq and Pool-seq, introduces software that helps manage genotype uncertainty, and evaluates the accuracy of various genetic analyses based on different sequencing strategies.
  • Results suggest that distributing sequencing effort across more samples with lower individual depth generally enhances inference accuracy, although some exceptions exist; the guide also explores the potential of imputation in improving analysis for nonmodel species and outlines future research directions.

Article Abstract

Low-coverage whole genome sequencing (lcWGS) has emerged as a powerful and cost-effective approach for population genomic studies in both model and nonmodel species. However, with read depths too low to confidently call individual genotypes, lcWGS requires specialized analysis tools that explicitly account for genotype uncertainty. A growing number of such tools have become available, but it can be difficult to get an overview of what types of analyses can be performed reliably with lcWGS data, and how the distribution of sequencing effort between the number of samples analysed and per-sample sequencing depths affects inference accuracy. In this introductory guide to lcWGS, we first illustrate how the per-sample cost for lcWGS is now comparable to RAD-seq and Pool-seq in many systems. We then provide an overview of software packages that explicitly account for genotype uncertainty in different types of population genomic inference. Next, we use both simulated and empirical data to assess the accuracy of allele frequency, genetic diversity, and linkage disequilibrium estimation, detection of population structure, and selection scans under different sequencing strategies. Our results show that spreading a given amount of sequencing effort across more samples with lower depth per sample consistently improves the accuracy of most types of inference, with a few notable exceptions. Finally, we assess the potential for using imputation to bolster inference from lcWGS data in nonmodel species, and discuss current limitations and future perspectives for lcWGS-based population genomics research. With this overview, we hope to make lcWGS more approachable and stimulate its broader adoption.

Download full-text PDF

Source
http://dx.doi.org/10.1111/mec.16077DOI Listing

Publication Analysis

Top Keywords

low-coverage genome
8
genome sequencing
8
population genomics
8
population genomic
8
nonmodel species
8
explicitly account
8
account genotype
8
genotype uncertainty
8
lcwgs data
8
sequencing effort
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!