Background: Nematode model organisms such as Caenorhabditis elegans and Pristionchus pacificus are powerful systems for studying the evolution of gene function at a mechanistic level. However, the identification of P. pacificus orthologs of candidate genes known from C. elegans is complicated by the discrepancy in the quality of gene annotations, a common problem in nematode and invertebrate genomics.

Results: Here, we combine comparative genomic screens for suspicious gene models with community-based curation to further improve the quality of gene annotations in P. pacificus. We extend previous curations of one-to-one orthologs to larger gene families and also orphan genes. Cross-species comparisons of protein lengths, screens for atypical domain combinations and species-specific orphan genes resulted in 4311 candidate genes that were subject to community-based curation. Corrections for 2946 gene models were implemented in a new version of the P. pacificus gene annotations. The new set of gene annotations contains 28,896 genes and has a single copy ortholog completeness level of 97.6%.

Conclusions: Our work demonstrates the effectiveness of comparative genomic screens to identify suspicious gene models and the scalability of community-based approaches to improve the quality of thousands of gene models. Similar community-based approaches can help to improve the quality of gene annotations in other invertebrate species, including parasitic nematodes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7552371PMC
http://dx.doi.org/10.1186/s12864-020-07100-0DOI Listing

Publication Analysis

Top Keywords

gene annotations
24
gene models
16
gene
12
quality gene
12
improve quality
12
curation improve
8
pristionchus pacificus
8
candidate genes
8
comparative genomic
8
genomic screens
8

Similar Publications

A chromosome-anchored reference assembly for the gray snapper, Lutjanus griseus.

Mol Biol Rep

January 2025

School of Ocean Science and Engineering, The University of Southern Mississippi, Ocean Springs, MS, 39564, USA.

Background: The gray snapper (Lutjanus griseus) is a marine reef fish commonly found in coastal and shelf waters of the tropical and subtropical western Atlantic Ocean. In this work, a draft reference genome was developed to support population genomic studies of gray snapper needed to assist with conservation and fisheries management efforts.

Methods And Results: Hybrid assembly of PacBio and Illumina sequencing reads yielded a 1,003,098,032 bp reference across 2039 scaffolds with N50 and L50 values of 1,691,591 bp and 163 scaffolds, respectively.

View Article and Find Full Text PDF

Medicinal plants often harbour various endophytic actinomycetia, which are well known for their potent antimicrobial properties and plant growth-promoting traits. In this study, we isolated an endophytic actinomycetia, A13, from the leaves of tea clone P312 from the MEG Tea Estate, Meghalaya, India. The isolate A13 was identified as Streptomyces sp.

View Article and Find Full Text PDF

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA.

Background: Genome-wide association studies (GWAS) in Alzheimer's disease (AD) leveraging endophenotypes beyond case/control diagnosis, such as brain amyloid β pathology, have shown promise in identifying novel variants and understanding their potential functional impact. In this study, we leverage two brain amyloid β pathology measurement modalities, PET imaging and neuropathology, to address sample size limitations and to discover novel genetic drivers of disease.

Method: We conducted a meta-analysis on an amyloid PET imaging GWAS (N = 7,036, 35% amyloid positive, 53.

View Article and Find Full Text PDF

Background: Increasing evidence suggests that alternative splicing plays an important role in Alzheimer's disease (AD), a devastating neurodegenerative disorder involving the intracellular aggregation of hyperphosphorylated tau.

Method: We used whole transcriptome and targeted long-read cDNA sequencing to profile transcript diversity in the entorhinal cortex of wild-type (WT) and transgenic (TG) mice harbouring a mutant form of human tau.

Result: Whole transcriptome profiling showed that previously reported gene-level expression differences between WT and TG mice reflect changes in the abundance of specific transcripts.

View Article and Find Full Text PDF

Background: NIAGADS is a national genomics data repository that facilitates access of genotypic and sequencing data to qualified investigators for the study of the genetics of Alzheimer's disease (AD) and related neurological diseases. Collaborations with large consortia and centers such as the Alzheimer's Disease Genetics Consortium (ADGC), Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium, the Alzheimer's Disease Sequencing Project (ADSP), and the Genome Center for Alzheimer's Disease (GCAD) allow NIAGADS to lead the effort in managing large AD datasets that can be easily accessed and fully utilized by the research community.

Method: NIAGADS is supported by the National Institute on Aging (NIA) under a cooperative agreement.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!