Zhang L, Hripcsak G, Perl Y, Halper M,
Geller J.
An expert study evaluating the UMLS
lexical metaschema.
Artif Intell Med. 2005 Jul;34(3):219-33.
Abstract: Objective: A metaschema is an abstraction
network of the UMLS’s Semantic Network (SN) obtained
from a connected partition of its collection of semantic
types. A lexical metaschema was previously derived based
on a lexical partition which partitioned the SN into
semantictype groups using identical word-usage among the
names of semantic types and the definitions of their
respective children. In this paper, a statistical
analysis methodology is presented to evaluate the
lexical metaschema based on a study involving a group of
established UMLS experts.
Design: In the study, each expert was asked to identify
subject areas of the SN based on his or her
understanding of the various semantic types. For this
purpose, the expert scans the SN
hierarchy top-down, identifying semantic types, which
are important and different enough
from their parent semantic types, as roots of their
groups. From the response of each expert,
an “expert metaschema” is constructed. The different
experts’ metaschemas can vary widely.
So, additional metaschemas are obtained from
aggregations of the experts’ responses. Of
special interest is the consensus metaschema which
represents an aggregation of a simple
majority of the experts’ responses. Statistical analysis
comparing the lexical metaschema
with the experts’ metaschemas and the consensus
metaschema is presented.
Results: The analysis results shows that 17 out of the
21 meta-semantic types in the lexical metaschema also
appear in the consensus metaschema (about 81%). There
are 107 (about 79%) semantic types covered by identical
meta-semantic types and refinements. The results show
the high similarity between the two metaschemas.
Furthermore, the statistical analysis shows that the
lexical metaschema did not grossly underperform compared
to the experts.
Conclusion: Our study shows that the lexical metaschema
provides a good approximation for
a partition of meaningful subject areas in the SN, when
compared to the consensus metaschema capturing the
aggregation of a simple majority of the human experts’
opinions. |