Background: Caenorhabditis elegans gene-based phenotype information dates back to the 1970's, beginning with Sydney Brenner and the characterization of behavioral and morphological mutant alleles via classical genetics in order to understand nervous system function. Since then C. elegans has become an important genetic model system for the study of basic biological and biomedical principles, largely through the use of phenotype analysis. Because of the growth of C. elegans as a genetically tractable model organism and the development of large-scale analyses, there has been a significant increase of phenotype data that needs to be managed and made accessible to the research community. To do so, a standardized vocabulary is necessary to integrate phenotype data from diverse sources, permit integration with other data types and render the data in a computable form.

Results: We describe a hierarchically structured, controlled vocabulary of terms that can be used to standardize phenotype descriptions in C. elegans, namely the Worm Phenotype Ontology (WPO). The WPO is currently comprised of 1,880 phenotype terms, 74% of which have been used in the annotation of phenotypes associated with greater than 18,000 C. elegans genes. The scope of the WPO is not exclusively limited to C. elegans biology, rather it is devised to also incorporate phenotypes observed in related nematode species. We have enriched the value of the WPO by integrating it with other ontologies, thereby increasing the accessibility of worm phenotypes to non-nematode biologists. We are actively developing the WPO to continue to fulfill the evolving needs of the scientific community and hope to engage researchers in this crucial endeavor.

Conclusions: We provide a phenotype ontology (WPO) that will help to facilitate data retrieval, and cross-species comparisons within the nematode community. In the larger scientific community, the WPO will permit data integration, and interoperability across the different Model Organism Databases (MODs) and other biological databases. This standardized phenotype ontology will therefore allow for more complex data queries and enhance bioinformatic analyses.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3039574PMC
http://dx.doi.org/10.1186/1471-2105-12-32DOI Listing

Publication Analysis

Top Keywords

phenotype ontology
16
phenotype data
12
phenotype
10
worm phenotype
8
data
8
model organism
8
ontology wpo
8
scientific community
8
wpo will
8
elegans
7

Similar Publications

Quantitative natural history modeling of HPDL-related disease based on cross-sectional data reveals genotype-phenotype correlations.

Genet Med

December 2024

Movement Disorders Program, Department of Neurology and F.M. Kirby Neurobiology Center, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, USA. Electronic address:

Objectives: Biallelic HPDL variants have been identified as the cause of a progressive childhood-onset movement disorder, with a broad clinical spectrum from severe neurodevelopmental disorder to juvenile-onset pure hereditary spastic paraplegia type 83. This study aims at delineating the geno- and phenotypic spectra of patients with HPDL-related disease, quantitatively modelling the natural history, and uncovering genotype-phenotype associations.

Methods: A cross-sectional analysis of 90 published and one novel case was performed, employing a Human Phenotype Ontology-based approach.

View Article and Find Full Text PDF

Background & Aims: GD2, a member of the ganglioside (GS) family (sialic acid-containing glycosphingolipids), is a potential biomarker of cancer stem cells (CSC) in several tumours. However, the possible role of GD2 and its biosynthetic enzyme, GD3 synthase (GD3S), in intrahepatic cholangiocarcinoma (iCCA) has not been explored.

Methods: The stem-like subset of two iCCA cell lines was enriched by sphere culture (SPH) and compared to monolayer parental cells (MON).

View Article and Find Full Text PDF

Rare diseases affect 1-in-10 people in the United States and despite increased genetic testing, up to half never receive a diagnosis. Even when using advanced genome sequencing platforms to discover variants, if there is no connection between the variants found in the patient's genome and their phenotypes in the literature, then the patient will remain undiagnosed. When a direct variant-phenotype connection is not known, putting a patient's information in the larger context of phenotype relationships and protein-protein interactions may provide an opportunity to find an indirect explanation.

View Article and Find Full Text PDF

Objectives: Little is known on the mechanisms necessary to maintain the physiological adult human skin integrity. This study aims to quantitatively describe anatomical changes in systemic sclerosis (SSc)-skin compared to controls and investigate the underlying mechanisms.

Methods: Skin morphology was histologically assessed in twenty-three SSc-patients, eighteen controls and fifteen patients with hypertrophic scars.

View Article and Find Full Text PDF

With the increasing utilization of exome and genome sequencing in clinical and research genetics, accurate and automated extraction of human phenotype ontology (HPO) terms from clinical texts has become imperative. Traditional methods for HPO term extraction, such as PhenoTagger, often face limitations in coverage and precision. In this study, we propose a novel approach that leverages large language models (LLMs) to generate synthetic sentences with clinical context, which were semantically encoded into vector embeddings.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!