Improving the Sequence Ontology terminology for genomic variant annotation.

J Biomed Semantics

Department of Biomedical Informatics, University of Utah, Salt Lake City, UT USA.

Published: August 2015

Background: The Genome Variant Format (GVF) uses the Sequence Ontology (SO) to enable detailed annotation of sequence variation. The annotation includes SO terms for the type of sequence alteration, the genomic features that are changed and the effect of the alteration. The SO maintains and updates the specification and provides the underlying ontologicial structure.

Methods: A requirements analysis was undertaken to gather terms missing in the SO release at the time, but needed to adequately describe the effects of sequence alteration on a set of variant genomic annotations. We have extended and remodeled the SO to include and define all terms that describe the effect of variation upon reference genomic features in the Ensembl variation databases.

Results: The new terminology was used to annotate the human reference genome with a set of variants from both COSMIC and dbSNP. A GVF file containing 170,853 sequence alterations was generated using the SO terminology to annotate the kinds of alteration, the effect of the alteration and the reference feature changed. There are four kinds of alteration and 24 kinds of effect seen in this dataset. (Ensembl Variation annotates 34 different SO consequence terms: http://www.ensembl.org/info/docs/variation/predicted_data.html).

Conclusions: We explain the updates to the Sequence Ontology to describe the effect of variation on existing reference features. We have provided a set of annotations using this terminology, and the well defined GVF specification. We have also provided a provisional exploration of this large annotation dataset.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4520272PMC
http://dx.doi.org/10.1186/s13326-015-0030-4DOI Listing

Publication Analysis

Top Keywords

sequence ontology
12
sequence alteration
8
genomic features
8
describe variation
8
ensembl variation
8
terminology annotate
8
kinds alteration
8
sequence
6
alteration
6
variation
5

Similar Publications

Active Ingredients and Potential Mechanism of Additive Sishen Decoction in Treating Rheumatoid Arthritis with Network Pharmacology and Molecular Dynamics Simulation and Experimental Verification.

Drug Des Devel Ther

January 2025

Shanxi Key Laboratory of Innovative Drug for the Treatment of Serious Diseases Basing on the Chronic Inflammation, College of Traditional Chinese Medicine and Food Engineering, Shanxi University of Chinese Medicine, Jinzhong, People's Republic of China.

Background: Rheumatoid arthritis (RA) is a chronic inflammatory autoimmune disease in which macrophages produce cytokines that enhance inflammation and contribute to the destruction of cartilage and bone. Additive Sishen decoction (ASSD) is a widely used traditional Chinese medicine for the treatment of RA; however, its active ingredients and the mechanism of its therapeutic effects remain unclear.

Methods: To predict the ingredients and key targets of ASSD, we constructed "drug-ingredient-target-disease" and protein-protein interaction networks.

View Article and Find Full Text PDF

Background: Myocardial ischemia/reperfusion (I/R) injury, which is associated with high morbidity and mortality, is a main cause of unexpected myocardial injury after acute myocardial infarction. However, the underlying mechanism remains unclear. Circular RNAs (circRNAs), which are formed from protein-coding genes, can sequester microRNAs or proteins, modulate transcription and interfere with splicing.

View Article and Find Full Text PDF

Despite numerous attempts to understand the molecular mechanisms behind the development of liver cancer, it continues to pose a significant worldwide health challenge. Transcriptome sequencing, a powerful tool in molecular biology, has played a pivotal role in uncovering the intricate gene expression profiles underlying hepatocellular carcinoma (HCC). In the present study, we identified a total of 808 differentially expressed genes (DEGs), with 584 exhibiting downregulation, and 224 showing upregulation following apigetrin treatment.

View Article and Find Full Text PDF

High expression of SERPINE1 and CTSL in keratinocytes in pressure injury caused by ischemia-reperfusion injury.

Tissue Cell

January 2025

Institute of Regenerative Medicine, Binzhou Medical University, Yantai, Shandong 264003, PR China; Department of Histology and Embryology, Binzhou Medical University, Yantai, Shandong 264003, PR China. Electronic address:

Introduction: Pressure Injury (PI) is a complex disease process which is influenced by multiple factors, among which ischemia-reperfusion (I/R) injury is closely related to the progression of PI. But its biomarkers are still unclearly. Understanding its physiological mechanisms and related molecular biomarkers is a key to developing effective prevention and therapeutic strategies.

View Article and Find Full Text PDF

Background: During mammalian spermatogenesis, the cytoskeleton system plays a significant role in morphological changes. Male infertility such as non-obstructive azoospermia (NOA) might be explained by studies of the cytoskeletal system during spermatogenesis.

Methods: The cytoskeleton, scaffold, and actin-binding genes were analyzed by microarray and bioinformatics (771 spermatogenic cellsgenes and 774 Sertoli cell genes).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!