Cattle have been selectively bred for coat color, spotting, and depigmentation patterns. The assumed autosomal dominant inherited genetic variants underlying the characteristic white head of Fleckvieh, Simmental, and Hereford cattle have not been identified yet, although the contribution of structural variation upstream the gene has been proposed. Here, we construct a graph pangenome from 24 haplotype assemblies representing seven taurine cattle breeds to identify and characterize the white head-associated locus for the first time based on long-read sequencing data and pangenome analyses.
View Article and Find Full Text PDFGenomics Proteomics Bioinformatics
July 2024
Sheep were domesticated in the Fertile Crescent and then spread globally, where they have been encountering various environmental conditions. The Tibetan sheep has adapted to high altitudes on the Qinghai-Tibet Plateau over the past 3000 years. To explore genomic variants associated with high-altitude adaptation in Tibetan sheep, we analyzed Illumina short-reads of 994 whole genomes representing ∼ 60 sheep breeds/populations at varied altitudes, PacBio High fidelity (HiFi) reads of 13 breeds, and 96 transcriptomes from 12 sheep organs.
View Article and Find Full Text PDFBackground: Association testing between molecular phenotypes and genomic variants can help to understand how genotype affects phenotype. RNA sequencing provides access to molecular phenotypes such as gene expression and alternative splicing while DNA sequencing or microarray genotyping are the prevailing options to obtain genomic variants.
Results: We genotype variants for 74 male Braunvieh cattle from both DNA (~ 13-fold coverage) and deep total RNA sequencing from testis, vas deferens, and epididymis tissue (~ 250 million reads per tissue).
Background: Mastitis is a disease that incurs significant costs in the dairy industry. A promising approach to mitigate its negative effects is to genetically improve the resistance of dairy cattle to mastitis. A meta-analysis of genome-wide association studies (GWAS) across multiple breeds for clinical mastitis (CM) and its indicator trait, somatic cell score (SCS), is a powerful method to identify functional genetic variants that impact mastitis resistance.
View Article and Find Full Text PDFEmbryonic diapause in mammals is a temporary developmental delay occurring at the blastocyst stage. In contrast to other diapausing species displaying a full arrest, the blastocyst of the European roe deer (Capreolus capreolus) proliferates continuously and displays considerable morphological changes in the inner cell mass. We hypothesised that developmental progression also continues during this period.
View Article and Find Full Text PDFExpression and splicing quantitative trait loci (e/sQTL) are large contributors to phenotypic variability. Achieving sufficient statistical power for e/sQTL mapping requires large cohorts with both genotypes and molecular phenotypes, and so, the genomic variation is often called from short-read alignments, which are unable to comprehensively resolve structural variation. Here we build a pangenome from 16 HiFi haplotype-resolved cattle assemblies to identify small and structural variation and genotype them with PanGenie in 307 short-read samples.
View Article and Find Full Text PDFBreeding bulls are well suited to investigate inherited variation in male fertility because they are genotyped and their reproductive success is monitored through semen analyses and thousands of artificial inseminations. However, functional data from relevant tissues are lacking in cattle, which prevents fine-mapping fertility-associated genomic regions. Here, we characterize gene expression and splicing variation in testis, epididymis, and vas deferens transcriptomes of 118 mature bulls and conduct association tests between 414,667 molecular phenotypes and 21,501,032 genome-wide variants to identify 41,156 regulatory loci.
View Article and Find Full Text PDFThe branch point sequence is a degenerate intronic heptamer required for the assembly of the spliceosome during pre-mRNA splicing. Disruption of this motif may promote alternative splicing and eventually cause phenotype variation. Despite its functional relevance, the branch point sequence is not included in most genome annotations.
View Article and Find Full Text PDFBackground: Combining the results of within-population genome-wide association studies (GWAS) based on whole-genome sequences into a single meta-analysis (MA) is an accurate and powerful method for identifying variants associated with complex traits. As part of the H2020 BovReg project, we performed sequence-level MA for beef production traits. Five partners from France, Switzerland, Germany, and Canada contributed summary statistics from sequence-based GWAS conducted with 54,782 animals from 15 purebred or crossbred populations.
View Article and Find Full Text PDFDespite passing routine laboratory tests for semen quality, bulls used in artificial insemination exhibit significant variation in fertility. Routine analysis of fertility data identified a dairy bull with extreme subfertility (10% pregnancy rate). To characterize the subfertility phenotype, a range of in vitro, in vivo, and molecular assays were carried out.
View Article and Find Full Text PDFBackground: Structural variations (SVs) in individual genomes are major determinants of complex traits, including adaptability to environmental variables. The Mongolian and Hainan cattle breeds in East Asia are of taurine and indicine origins that have evolved to adapt to cold and hot environments, respectively. However, few studies have investigated SVs in East Asian cattle genomes and their roles in environmental adaptation, and little is known about adaptively introgressed SVs in East Asian cattle.
View Article and Find Full Text PDFStructural variants (SVs) and short tandem repeats (STRs) are significant sources of genetic variation. However, the impacts of these variants on gene regulation have not been investigated in cattle. Here, we genotyped and characterized 19,408 SVs and 374,821 STRs in 183 bovine genomes and investigated their impact on molecular phenotypes derived from testis transcriptomes.
View Article and Find Full Text PDFCattle are a well-suited "model organism" to study the genetic underpinnings of variation in male reproductive performance. The adoption of artificial insemination and genomic prediction in many cattle breeds provide access to microarray-derived genotypes and repeated measurements for semen quality and insemination success in several thousand bulls. Similar-sized mapping cohorts with phenotypes for male fertility are not available for most other species precluding powerful association testing.
View Article and Find Full Text PDFThe Bovine Pangenome Consortium (BPC) is an international collaboration dedicated to the assembly of cattle genomes to develop a more complete representation of cattle genomic diversity. The goal of the BPC is to provide genome assemblies and a community-agreed pangenome representation to replace breed-specific reference assemblies for cattle genomics. The BPC invites partners sharing our vision to participate in the production of these assemblies and the development of a common, community-approved, pangenome reference as a public resource for the research community ( https://bovinepangenome.
View Article and Find Full Text PDFBackground: Several models and algorithms have been proposed to build pangenomes from multiple input assemblies, but their impact on variant representation, and consequently downstream analyses, is largely unknown.
Results: We create multi-species super-pangenomes using pggb, cactus, and minigraph with the Bos taurus taurus reference sequence and eleven haplotype-resolved assemblies from taurine and indicine cattle, bison, yak, and gaur. We recover 221 k nonredundant structural variations (SVs) from the pangenomes, of which 135 k (61%) are common to all three.
Background: Low-pass sequencing followed by sequence variant genotype imputation is an alternative to the routine microarray-based genotyping in cattle. However, the impact of haplotype reference panels and their interplay with the coverage of low-pass whole-genome sequencing data have not been sufficiently explored in typical livestock settings where only a small number of reference samples is available.
Methods: Sequence variant genotyping accuracy was compared between two variant callers, GATK and DeepVariant, in 50 Brown Swiss cattle with sequencing coverages ranging from 4- to 63-fold.
Selection for system-wide morphological, physiological, and metabolic adaptations has led to extreme athletic phenotypes among geographically diverse horse breeds. Here, we identify genes contributing to exercise adaptation in racehorses by applying genomics approaches for racing performance, an end-point athletic phenotype. Using an integrative genomics strategy to first combine population genomics results with skeletal muscle exercise and training transcriptomic data, followed by whole-genome resequencing of Asian horses, we identify protein-coding variants in genes of interest in galloping racehorse breeds (Arabian, Mongolian and Thoroughbred).
View Article and Find Full Text PDFUnderstanding the genetic mechanism of how animals adapt to extreme conditions is fundamental to determine the relationship between molecular evolution and changing environments. Goat is one of the first domesticated species and has evolved rapidly to adapt to diverse environments, including harsh high-altitude conditions with low temperature and poor oxygen supply but strong ultraviolet radiation. Here, we analyzed 331 genomes of domestic goats and wild caprid species living at varying altitudes (high > 3000 m above sea level and low < 1200 m), along with a reference-guided chromosome-scale assembly (contig-N50: 90.
View Article and Find Full Text PDFUndisturbed reproduction is key for successful breeding of beef and dairy cattle. Improving reproductive ability can be difficult because of antagonistic relationships with other economically relevant traits. In cattle, thorough investigation of female fertility revealed unfavorable genetic correlations with various production phenotypes.
View Article and Find Full Text PDFAdvantages of pangenomes over linear reference assemblies for genome research have recently been established. However, potential effects of sequence platform and assembly approach, or of combining assemblies created by different approaches, on pangenome construction have not been investigated. Here we generate haplotype-resolved assemblies from the offspring of three bovine trios representing increasing levels of heterozygosity that each demonstrate a substantial improvement in contiguity, completeness, and accuracy over the current Bos taurus reference genome.
View Article and Find Full Text PDF