Auditing concept categorizations in the UMLS.

Artif Intell Med

Department of Health Informatics, University of Medicine and Dentistry of NJ, Newark, NJ 07107, USA.

Published: May 2004

The Unified Medical Language System (UMLS) integrates about 880,000 concepts from 100 biomedical terminologies. Each concept is categorized to at least one semantic type of the Semantic Network. During the integration, it is unavoidable that some categorization errors and inconsistencies will be introduced. In this paper, we present an auditing technique to find such errors and inconsistencies. Our technique is based on an expert reviewing the pure intersections of meta-semantic types of a metaschema, a compact abstract view of the UMLS Semantic Network. We use a divide and conquer approach, handling differently small pure intersections and medium to large pure intersections. By using this approach, we limit the number of concepts reviewed, for which we expect a high percentage of errors. We reviewed all concepts in 657 pure intersections containing one to 10 concepts. Various kinds of errors are identified and the analysis of the results are presented in the paper. Also, we checked the pure intersections containing more than 10 concepts for their semantic soundness, where the semantically suspicious pure intersections are presented in the paper and their concepts are reviewed.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.artmed.2004.02.002DOI Listing

Publication Analysis

Top Keywords

pure intersections
24
semantic network
8
errors inconsistencies
8
concepts reviewed
8
intersections concepts
8
presented paper
8
concepts
6
pure
6
intersections
6
auditing concept
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!