The gyrfalcon (Falco rusticolus) genome.

G3 (Bethesda)

Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia.

Published: March 2023

High-quality genome assemblies are characterized by high-sequence contiguity, completeness, and a low error rate, thus providing the basis for a wide array of studies focusing on natural species ecology, conservation, evolution, and population genomics. To provide this valuable resource for conservation projects and comparative genomics studies on gyrfalcon (Falco rusticolus), we sequenced and assembled the genome of this species using third-generation sequencing strategies and optical maps. Here, we describe a highly contiguous and complete genome assembly comprising 20 scaffolds and 13 contigs with a total size of 1.193 Gbp, including 8,064 complete Benchmarking Universal Single-Copy Orthologs (BUSCOs) of the total 8,338 BUSCO groups present in the library aves_odb10. Of these BUSCO genes, 96.7% were complete, 96.1% were present as a single copy, and 0.6% were duplicated. Furthermore, 0.8% of BUSCO genes were fragmented and 2.5% (210) were missing. A de novo search for transposable elements (TEs) identified 5,716 TEs that masked 7.61% of the F. rusticolus genome assembly when combined with publicly available TE collections. Long interspersed nuclear elements, in particular, the element Chicken-repeat 1 (CR1), were the most abundant TEs in the F. rusticolus genome. A de novo first-pass gene annotation was performed using 293,349 PacBio Iso-Seq transcripts and 496,195 transcripts derived from the assembly of 42,429,525 Illumina PE RNA-seq reads. In all, 19,602 putative genes, of which 59.31% were functionally characterized and associated with Gene Ontology terms, were annotated. A comparison of the gyrfalcon genome assembly with the publicly available assemblies of the domestic chicken (Gallus gallus), zebra finch (Taeniopygia guttata), and hummingbird (Calypte anna) revealed several genome rearrangements. In particular, nine putative chromosome fusions were identified in the gyrfalcon genome assembly compared with those in the G. gallus genome assembly. This genome assembly, its annotation for TEs and genes, and the comparative analyses presented, complement and strength the base of high-quality genome assemblies and associated resources available for comparative studies focusing on the evolution, ecology, and conservation of Aves.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9997569PMC
http://dx.doi.org/10.1093/g3journal/jkad001DOI Listing

Publication Analysis

Top Keywords

genome assembly
24
genome
12
rusticolus genome
12
gyrfalcon falco
8
falco rusticolus
8
high-quality genome
8
genome assemblies
8
studies focusing
8
ecology conservation
8
busco genes
8

Similar Publications

In single cells, variably sized nanoscale chromatin structures are observed, but it is unknown whether these form a cohesive framework that regulates RNA transcription. Here, we demonstrate that the human genome is an emergent, self-assembling, reinforcement learning system. Conformationally defined heterogeneous, nanoscopic packing domains form by the interplay of transcription, nucleosome remodeling, and loop extrusion.

View Article and Find Full Text PDF

Introduction: Varenicline is an α4β2 nicotinic acetylcholine receptor partial agonist with the highest therapeutic efficacy of any pharmacological smoking cessation aid and a 12-month cessation rate of 26%. Genetic variation may be associated with varenicline response, but to date no genome-wide association studies of varenicline response have been published.

Methods: In this study, we investigated the genetic contribution to varenicline effectiveness using two electronic health record-derived phenotypes.

View Article and Find Full Text PDF

Genes encoding OXA-48-like carbapenem-hydrolyzing enzymes are often located on plasmids and are abundant among carbapenemase-producing (CPE) worldwide. After a large plasmid-mediated outbreak in 2011, routine screening of patients at risk of CPE carriage on admission and every 7 days during hospitalization was implemented in a large hospital in the Netherlands. The objective of this study was to investigate the dynamics of the hospitals' 2011 outbreak-associated plasmid among CPE collected from 2011 to 2021.

View Article and Find Full Text PDF

Transposable elements (TEs) are significant drivers of genome evolution, yet their recent dynamics and impacts within and among species, as well as the roles of host genes and non-coding RNAs in the transposition process, remain elusive. With advancements in large-scale pan-genome sequencing and the development of open data sharing, large-scale comparative genomics studies have become feasible. Here, we performed complete de novo TE annotations and identified active TEs in 310 plant genome assemblies across 119 species and seven crop populations.

View Article and Find Full Text PDF

Coronaviruses (CoVs) encode non-structural proteins (nsp's) 1-16, which assemble to form replication-transcription complexes that function in viral RNA synthesis. All CoVs encode a proofreading 3'-5' exoribonuclease in non-structural protein 14 (nsp14-ExoN) that mediates proofreading and high-fidelity replication and is critical for other roles in replication and pathogenesis. The enzymatic activity of nsp14-ExoN is enhanced in the presence of the cofactor nsp10.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!