Falling fowl of the chicken reference genome: pitfalls of studying polymorphic endogenous retroviruses.

Retrovirology

Jack Birch Unit for Molecular Carcinogenesis, The Department of Biology and York Biomedical Research Institute, The University of York, York, YO10 5DD, UK.

Published: April 2021

High quality reference genomes have facilitated the study of endogenous retroviruses (ERVs). However, there are an increasing number of published works which assume the ERVs in reference genomes are universal; even those of evolutionarily recent integrations. Consequently, these studies fail to properly characterise polymorphic ERVs, and even propose biological functions for ERVs that may not actually be present in the genomes of interest. Here, I outline the pitfalls of three studies of chicken endogenous Avian Leukosis Viruses (ALVEs or "ev genes": the "original" ERVs), all confounded by the assumption that the reference genome provides a representative ALVE baseline.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8059273PMC
http://dx.doi.org/10.1186/s12977-021-00555-3DOI Listing

Publication Analysis

Top Keywords

reference genome
8
endogenous retroviruses
8
reference genomes
8
ervs
5
falling fowl
4
fowl chicken
4
reference
4
chicken reference
4
genome pitfalls
4
pitfalls studying
4

Similar Publications

Background: Treponemal diseases are a significant global health risk, presenting challenges to public health and severe consequences to individuals if left untreated. Despite numerous genomic studies on Treponema pallidum and the known possible biases introduced by the choice of the reference genome used for mapping, few investigations have addressed how these biases affect phylogenetic and evolutionary analysis of these bacteria. In this study, we ascertain the importance of selecting an appropriate genomic reference on phylogenetic and evolutionary analyses of T.

View Article and Find Full Text PDF

The influx of whole genome sequencing (WGS) data in the public health and clinical diagnostic sectors has created a need for data analysis methods and bioinformatics expertise, which can be a bottleneck for many laboratories. At Sciensano, the Belgian national public health institute, an intuitive and user-friendly bioinformatics tool portal was implemented using Galaxy, an open-source platform for data analysis and workflow creation. The Galaxy @Sciensano instance is available to both internal and external scientists and offers a wide range of tools provided by the community, complemented by over 50 custom tools and pipelines developed in-house.

View Article and Find Full Text PDF

Segmental duplications (SDs) contribute significantly to human disease, evolution and diversity but have been difficult to resolve at the sequence level. We present a population genetics survey of SDs by analyzing 170 human genome assemblies (from 85 samples representing 38 Africans and 47 non-Africans) in which the majority of autosomal SDs are fully resolved using long-read sequence assembly. Excluding the acrocentric short arms and sex chromosomes, we identify 173.

View Article and Find Full Text PDF

Telomere-to-telomere sheep genome assembly identifies variants associated with wool fineness.

Nat Genet

January 2025

Frontiers Science Center for Molecular Design Breeding (MOE); State Key Laboratory of Animal Biotech Breeding; College of Animal Science and Technology, China Agricultural University, Beijing, China.

Ongoing efforts to improve sheep reference genome assemblies still leave many gaps and incomplete regions, resulting in a few common failures and errors in genomic studies. Here, we report a 2.85-Gb gap-free telomere-to-telomere genome of a ram (T2T-sheep1.

View Article and Find Full Text PDF

Complete datasets of genetic variants are key to biodiversity genomic studies. Long-read sequencing technologies allow the routine assembly of highly contiguous, haplotype-resolved reference genomes. However, even when complete, reference genomes from a single individual may bias downstream analyses and fail to adequately represent genetic diversity within a population or species.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!