Despite decreasing sequencing costs, whole-genome sequencing for population-based genome scans for selection is still prohibitively expensive for organisms with large genomes. Moreover, the repetitive nature of large genomes often represents a challenge in bioinformatic and downstream analyses. Here, we use in-depth transcriptome sequencing to design probes for exome capture in Swiss stone pine (Pinus cembra), a conifer with an estimated genome size of 29.3 Gbp and no reference genome available. We successfully applied around 55,000 self-designed probes, targeting 25,000 contigs, to DNA pools of seven populations from the Swiss Alps and identified >160,000 SNPs in around 15,000 contigs. The probes performed equally well in pools of the closely related species Pinus sibirica; in both species, more than 70% of the targeted contigs were sequenced at a depth ≥40× (number of haplotypes in the pool). However, a thorough analysis of individually sequenced P. cembra samples indicated that a majority of the contigs (63%) represented multi-copy genes. We therefore removed paralogous contigs based on heterozygote excess and deviation from allele balance. Without putatively paralogous contigs, allele frequencies of population pools represented accurate estimates of individually determined allele frequencies. We show that inferences of neutral and adaptive genetic variation may be biased when not accounting for such multi-copy genes. Without individual genotype data, it would have been nearly impossible to recognize and deal with the problem of multi-copy contigs. We advocate to put more emphasis on identifying paralogous loci, which will be facilitated by the establishment of additional high-quality reference genomes.

Download full-text PDF

Source
http://dx.doi.org/10.1111/1755-0998.12986DOI Listing

Publication Analysis

Top Keywords

transcriptome sequencing
8
exome capture
8
pinus cembra
8
large genomes
8
multi-copy genes
8
paralogous contigs
8
allele frequencies
8
contigs
7
sequencing pooled
4
pooled exome
4

Similar Publications

BACKGROUND Cleidocranial dysplasia (CCD) is a rare (1: 1 000 000) autosomal dominant congenital skeletal dysplasia characterized by widely patent calvarial sutures, clavicular hypoplasia, supernumerary teeth, and short stature. Only a minority of the cases are diagnosed early after birth. We present another case of proven CCD presenting with typical neonatal phenotype to promote awareness of this rare disorder.

View Article and Find Full Text PDF

Recent studies have suggested that sVEGFR3 is involved in cardiac diseases by regulating lymphangiogenesis; however, results are inconsistent. The aim of this study was to investigate the function and mechanism of sVEGFR3 in myocardial ischemia/reperfusion injury (MI/RI). sVEGFR3 effects were evaluated in vivo in mice subjected to MI/RI, and in vitro using HL-1 cells exposed to oxygen-glucose deprivation/reperfusion.

View Article and Find Full Text PDF

In our research, we performed temporal transcriptomic profiling of host cells infected with Equid alphaherpesvirus 1 (EHV-1) by utilizing direct cDNA sequencing based on nanopore MinION technology. The sequencing reads were harnessed for transcript quantification at various time points. Viral infection-induced differential gene expression was identified through the edgeR package.

View Article and Find Full Text PDF

Evaluation of transcriptomic changes after photobiomodulation in spinal cord injury.

Sci Rep

January 2025

Neuroscience and Ophthalmology, Department of Inflammation and Ageing, School of Infection, Inflammation and Immunology, College of Medicine and Health, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK.

Spinal cord injury (SCI) is a significant cause of lifelong disability, with no available disease-modifying treatments to promote neuroprotection and axon regeneration after injury. Photobiomodulation (PBM) is a promising therapy which has proven effective at restoring lost function after SCI in pre-clinical models. However, the precise mechanism of action is yet to be determined.

View Article and Find Full Text PDF

Osteoarthritis (OA) is a complex, degenerative, multi-factorial joint disease. Because of the difficulty in treating OA, developing new targeting strategies that can be used to understand its molecular mechanisms is critical. Protaetia brevitarsis seulensis larvae offer much therapeutic value; however, the presence of various active compounds and the multi-factorial risk factors for OA render the precise mechanisms of action unclear.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!