Gu H, Perl Y, Elhanan G, Min H, Zhang L,
Peng Y.
Auditing concept categorizations in the UMLS.
Artif Intell Med. 2004 May;31(1):29-44.
Abstract: The Unified Medical Language System (UMLS)
integrates about 880,000 concepts from 100 biomedical
terminologies. Each concept is categorized to at least
one semantic type of the Semantic Network. During the
integration, it is unavoidable that some categorization
errors and inconsistencies will be introduced. In this
paper, we present an auditing technique to find such
errors and inconsistencies. Our technique is based on an
expert reviewing the pure intersections of meta-semantic
types of a metaschema, a compact abstract view of the
UMLS Semantic Network. We use a divide and conquer
approach, handling differently small pure intersections
and medium to large pure intersections. By using this
approach, we limit the number of concepts reviewed, for
which we expect a high percentage of errors. We reviewed
all concepts in 657 pure intersections containing one to
10 concepts. Various kinds of errors are identified and
the analysis of the results are presented in the paper.
Also, we checked the pure intersections containing more
than 10 concepts for their semantic soundness, where the
semantically suspicious pure intersections are presented
in the paper and their concepts are reviewed. |