AI Article Synopsis

  • The study focuses on improving quality assurance for large ontological systems like SNOMED CT using a new hybrid method that combines structural analysis with lexical patterns.
  • The researchers extracted structural data from SNOMED CT using a MapReduce algorithm and identified four lexical patterns that highlight potential missing hierarchical relationships or concepts.
  • The approach led to the discovery of numerous non-lattice subgraphs with confirmed errors, mainly involving missing "is-a" relations, thus demonstrating the method's effectiveness in error detection and suggesting possible solutions.

Article Abstract

Objective: Quality assurance of large ontological systems such as SNOMED CT is an indispensable part of the terminology management lifecycle. We introduce a hybrid structural-lexical method for scalable and systematic discovery of missing hierarchical relations and concepts in SNOMED CT.

Material And Methods: All non-lattice subgraphs (the structural part) in SNOMED CT are exhaustively extracted using a scalable MapReduce algorithm. Four lexical patterns (the lexical part) are identified among the extracted non-lattice subgraphs. Non-lattice subgraphs exhibiting such lexical patterns are often indicative of missing hierarchical relations or concepts. Each lexical pattern is associated with a potential specific type of error.

Results: Applying the structural-lexical method to SNOMED CT (September 2015 US edition), we found 6801 non-lattice subgraphs that matched these lexical patterns, of which 2046 were amenable to visual inspection. We evaluated a random sample of 100 small subgraphs, of which 59 were reviewed in detail by domain experts. All the subgraphs reviewed contained errors confirmed by the experts. The most frequent type of error was missing is-a relations due to incomplete or inconsistent modeling of the concepts.

Conclusions: Our hybrid structural-lexical method is innovative and proved effective not only in detecting errors in SNOMED CT, but also in suggesting remediation for these errors.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6080685PMC
http://dx.doi.org/10.1093/jamia/ocw175DOI Listing

Publication Analysis

Top Keywords

non-lattice subgraphs
20
missing hierarchical
12
hierarchical relations
12
relations concepts
12
structural-lexical method
12
lexical patterns
12
concepts snomed
8
hybrid structural-lexical
8
subgraphs reviewed
8
subgraphs
7

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!