Revealing DNA sequence variation within the Lolium perenne genepool is important for genetic analysis and development of breeding applications. We reviewed current literature on plant development to select candidate genes in pathways that control agronomic traits, and identified 503 orthologues in L. perenne. Using targeted resequencing, we constructed a comprehensive catalogue of genomic variation for a L. perenne germplasm collection of 736 genotypes derived from current cultivars, breeding material and wild accessions. To overcome challenges of variant calling in heterogeneous outbreeding species, we used two complementary strategies to explore sequence diversity. First, four variant calling pipelines were integrated with the VariantMetaCaller to reach maximal sensitivity. Additional multiplex amplicon sequencing was used to empirically estimate an appropriate precision threshold. Second, a de novo assembly strategy was used to reconstruct divergent alleles for each gene. The advantage of this approach was illustrated by discovery of 28 novel alleles of LpSDUF247, a polymorphic gene co-segregating with the S-locus of the grass self-incompatibility system. Our approach is applicable to other genetically diverse outbreeding species. The resulting collection of functionally annotated variants can be mined for variants causing phenotypic variation, either through genetic association studies, or by selecting carriers of rare defective alleles for physiological analyses.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6379033PMC
http://dx.doi.org/10.1093/dnares/dsy033DOI Listing

Publication Analysis

Top Keywords

variant calling
12
challenges variant
8
sequence diversity
8
candidate genes
8
plant development
8
lolium perenne
8
outbreeding species
8
overcoming challenges
4
calling exploring
4
exploring sequence
4

Similar Publications

Background: Current clinical sequencing methods cannot effectively detect DNA methylation and allele-specific variation to provide parent-of-origin information from the proband alone. Parent-of-origin effects can lead to differential disease and the inability to assign this in de novo cases limits prognostication in the majority of affected individuals with retinoblastoma, a hereditary cancer with suspected parent-of-origin effects.

Methods: To directly assign parent-of-origin in retinoblastoma patients, genomic DNA was extracted from blood samples for sequencing using a programmable, targeted single-molecule long-read DNA genomic and epigenomic approach.

View Article and Find Full Text PDF

Motivation: The Variant Call Format (VCF) is widely used in genome sequencing but scales poorly. For instance, we estimate a 150,000 genome VCF would occupy 900 TiB, making it costly and complicated to produce, analyze, and store. The issue stems from VCF's requirement to densely represent both reference-genotypes and allele-indexed arrays.

View Article and Find Full Text PDF

Background: The high burden of malaria in Africa is largely due to the presence of competent and adapted Anopheles vector species. With invasive Anopheles stephensi implicated in malaria outbreaks in Africa, understanding the genomic basis of vector-parasite compatibility is essential for assessing the risk of future outbreaks due to this mosquito. Vector compatibility with P.

View Article and Find Full Text PDF

Protocol for the assessment, improvement, and harmonization of somatic variant calling using ONCOLINER.

STAR Protoc

December 2024

Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain; Institució Catalana per la Recerca i Estudis Avançats (ICREA), Barcelona, Spain. Electronic address:

The interoperability of variant identification pipelines is fundamental for achieving consistent clinical care across oncology research centers and hospitals. Here, we present a protocol for using ONCOLINER, a platform for the assessment, improvement, and harmonization of somatic variant discovery of multiple pipelines. We describe steps for acquiring benchmarking datasets and executing the user variant calling pipeline.

View Article and Find Full Text PDF

The incorporation of sequencing technologies in frontline and public health healthcare settings was vital in developing virus surveillance programs during the Coronavirus Disease 2019 (COVID-19) pandemic caused by transmission of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). However, increased data acquisition poses challenges for both rapid and accurate analyses. To overcome these hurdles, we developed the SARS-CoV-2 Illumina GeNome Assembly Line (SIGNAL) for quick bulk analyses of Illumina short-read sequencing data.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!