introduction
research
diagrams
oohvr
aboutus
search

Research Papers: The research papers are organized by the source terminologies and research techniques.

All Publications
Yan Chen, Yehoshua Perl, James Geller, and James J. Cimino. Analysis of a Study of the Users, Uses and Future Agenda of the UMLS. To appear on Journal of the American Medical Informatics Association. 2006 Dec.

Abstract: Objective: The UMLS constitutes the largest existing collection of medical terms. However, little has been published about the users and uses of the UMLS. This study sheds lights on these issues.
Design: We designed a questionnaire consisting of 26 questions and distributed it to the UMLS user mailing list. Participants were assured complete confidentiality of their replies. To further encourage list members to respond, we promised to provide them with early results prior to publication. Sector analysis of the responses, according to employment organizations is used to obtain insights into some responses.
Result: We received 70 responses. The study confirms two intended uses of the UMLS, access to source terminologies (75%) and mapping among them (44%). However, most access is just to a few sources, led by SNOMED, MeSH and ICD. Out of 119 reported purposes of use, terminology research (37), information retrieval (19), and terminology translation (14) lead. Four important observations are that the UMLS is widely used as a terminology (77%), even though it was not designed as one; many users (73%) want the NLM to mark concepts with multiple parents in an indented hierarchy and to derive a terminology from the UMLS (73%). Finally, auditing the UMLS is a top budget priority (35%) for users.
Conclusion: The study reports many uses of the UMLS in a variety of subjects from terminology research to decision support and phenotyping. The study confirms that the UMLS is used to access its source terminologies and to map among them. Two primary concerns of the existing user base are auditing the UMLS and the design of a UMLS-based derived terminology.

Yue Wang, Michael Halper, Hua Min, Yehoshua Perl, Yan Chen, Kent A. Spackman. Structural Methodologies for Auditing SNOMED. Pending after revision and Journal of Biomedical Informatics.

Abstract: SNOMED is one of the leading health care terminologies being used worldwide. As such, quality assurance is an important part of its maintenance cycle. Methodologies for auditing SNOMED based on structural aspects of its organization are presented. In particular, automated techniques for partitioning SNOMED into smaller groups of concepts based primarily on relationships patterns are defined. Two abstraction networks, the area taxonomy and p-area taxonomy, are derived from the partitions. The highlevel views afforded by these abstraction networks form the basis for systematic auditing. The networks tend to highlight errors that manifest themselves as irregularities at the abstract level. They also support group-based auditing, where sets of purportedly similar concepts are focused on for review. The auditing methodologies are demonstrated on one of SNOMED top-level hierarchies. Errors discovered during the auditing process are reported.

Min H, Perl Y, Chen Y, Halper M, Geller J, Wang Yue. Auditing as part of the Terminology Design Life Cycle. To appear on J Am Med Inform Assoc. 2006 Nov.

Abstract: Objective: To develop and test an auditing methodology for detecting errors in medical terminologies satisfying systematic inheritance. This methodology is based on various abstraction taxonomies that provide high-level views of a terminology and highlight potentially erroneous concepts.

Zhang L, Halper M, Perl Y, Geller J, Cimino JJ. Relationship structures and semantic type assignments of the UMLS Enriched Semantic Network. J Am Med Inform Assoc. 2005 Nov-Dec;12(6):657-66. Epub 2005 Jul 27.

Abstract: Objective: The Enriched Semantic Network (ESN) was introduced as an extension of the Unified Medical Language System (UMLS) Semantic Network (SN). Its multiple subsumption configuration and concomitant multiple inheritance make the ESN's relationship structures and semantic type assignments different from those of the SN. A technique for deriving the relationship structures of the ESN's semantic types and an automated technique for deriving the ESN's semantic type assignments from those of the SN are presented.

Zhang L, Hripcsak G, Perl Y, Halper M, Geller J. An expert study evaluating the UMLS lexical metaschema. Artif Intell Med. 2005 Jul;34(3):219-33.

Abstract:Objective: A metaschema is an abstraction network of the UMLS’s Semantic Network (SN) obtained from a connected partition of its collection of semantic types. A lexical metaschema was previously derived based on a lexical partition which partitioned the SN into semantictype groups using identical word-usage among the names of semantic types and the definitions of their respective children. In this paper, a statistical analysis methodology is presented to evaluate the lexical metaschema based on a study involving a group of established UMLS experts.
Design: In the study, each expert was asked to identify subject areas of the SN based on his or her understanding of the various semantic types. For this purpose, the expert scans the SN
hierarchy top-down, identifying semantic types, which are important and different enough
from their parent semantic types, as roots of their groups. From the response of each expert,
an “expert metaschema” is constructed. The different experts’ metaschemas can vary widely.
So, additional metaschemas are obtained from aggregations of the experts’ responses. Of
special interest is the consensus metaschema which represents an aggregation of a simple
majority of the experts’ responses. Statistical analysis comparing the lexical metaschema
with the experts’ metaschemas and the consensus metaschema is presented.
Results: The analysis results shows that 17 out of the 21 meta-semantic types in the lexical metaschema also appear in the consensus metaschema (about 81%). There are 107 (about 79%) semantic types covered by identical meta-semantic types and refinements. The results show the high similarity between the two metaschemas. Furthermore, the statistical analysis shows that the lexical metaschema did not grossly underperform compared to the experts.
Conclusion: Our study shows that the lexical metaschema provides a good approximation for
a partition of meaningful subject areas in the SN, when compared to the consensus metaschema capturing the aggregation of a simple majority of the human experts’ opinions.

Zhang L, Perl Y, Halper M, Geller J, Hripcsak G. A lexical metaschema for the UMLS semantic network. Artif Intell Med. 2005 Jan;33(1):41-59.

Abstract: Objective: A metaschema is a high-level abstraction network of the UMLS Semantic Network (SN) obtained from a partition of the SN collection of semantic types. Every metaschema has nodes, called meta-semantic types, each of which denotes a group of semantic types constituting a subject area of the SN. A new kind of metaschema, called the lexical metaschema, is derived from a lexical partition of the SN. The lexical metaschema is compared to previously derived metaschemas, e.g., the cohesive metaschema.

Gu H, Perl Y, Elhanan G, Min H, Zhang L, Peng Y. Auditing concept categorizations in the UMLS. Artif Intell Med. 2004 May;31(1):29-44.

Abstract: The Unified Medical Language System (UMLS) integrates about 880,000 concepts from 100 biomedical terminologies. Each concept is categorized to at least one semantic type of the Semantic Network. During the integration, it is unavoidable that some categorization errors and inconsistencies will be introduced. In this paper, we present an auditing technique to find such errors and inconsistencies. Our technique is based on an expert reviewing the pure intersections of meta-semantic types of a metaschema, a compact abstract view of the UMLS Semantic Network. We use a divide and conquer approach, handling differently small pure intersections and medium to large pure intersections. By using this approach, we limit the number of concepts reviewed, for which we expect a high percentage of errors. We reviewed all concepts in 657 pure intersections containing one to 10 concepts. Various kinds of errors are identified and the analysis of the results are presented in the paper. Also, we checked the pure intersections containing more than 10 concepts for their semantic soundness, where the semantically suspicious pure intersections are presented in the paper and their concepts are reviewed.

   H. Gu, M. Halper, J. Geller and E. Neuhold. Contextual Partitioning for Comprehension of OODB Schemas, Knowledge and Information Systems(KAIS), 6(3), May 2004, pp.315-344.

Abstract: Object-oriented databases (OODBs) have been utilized for complex modeling tasks within a variety of application domains. The OODB schema, typically expressed in a graphical notation, can serve as a useful presentation tool for the information contained in the underlying OODB. However, such a schema can be a large, complex network of classes and relationships. This may greatly hinder its effectiveness in helping users gain an understanding of the OODB contents and data organization. To facilitate this orientation process, a theoretical framework is presented that guides the refinement\/ of an existing schema subclass-of\/ relationship hierarchy ?the backbone of any OODB. The framework sets forth three rules which, when satisfied, lead to the establishment of a collection of contexts, each of which exhibits an internal subclass-of\/ tree structure. A formal proof of this result is presented. An algorithmic methodology, involving a humanomputer interaction, describes how the approach can be applied to a given OODB schema. An application of the methodology to an example OODB schema is included.

Zhang L, Perl Y, Halper M, Geller J, Cimino JJ. An enriched unified medical language system semantic network with a multiple subsumption hierarchy. J Am Med Inform Assoc. 2004 May-Jun;11(3):195-206. Epub 2004 Feb 5.

Abstract: Objective: The Unified Medical Language System (UMLS) Semantic Network (SN) two-tree structure is restrictive because it does not allow a semantic type to be a specialization of several other semantic types. In this paper, the SN is expanded into a multiple subsumption structure with a directed acyclic graph (DAG) IS-A hierarchy, allowing a semantic type to have multiple parents. New viable IS-A links are added as warranted.

Cimino JJ, Min H, Perl Y. Consistency across the hierarchies of the UMLS Semantic Network and Metathesaurus. J Biomed Inform. 2003 Dec;36(6):450-61.

Abstract: Objective: To develop and test a method for automatically detecting inconsistencies between the parent-child is-a relationships in the Metathesaurus and the ancestor-descendant relationships in the Semantic Network of the Unified Medical Language System (UMLS).

Zhang L, Perl Y, Halper M, Geller J. Designing metaschemas for the UMLS enriched semantic network. J Biomed Inform. 2003 Dec;36(6):433-49.

Abstract: Objective: The Unified Medical Language System (UMLS) designed by NLM combines many well established authoritative medical informatics terminologies in one knowledge representation system. Such a resource is very valuable to the healthcare community and industry. However, the UMLS is very large and complex and poses serious comprehension problems for users and maintenance personnel. We present a representation to support the comprehension and navigation of the UMLS.

Perl Y, Geller J. Research on structural issues of the UMLS--past, present, and future. J Biomed Inform. 2003 Dec;36(6):409-13.

Abstract: None.

James Geller, Huanying Gu, and Michael Halper. Semantic refinement and error correction in large terminological knowledge bases, Data & Knowledge Engineering, 45(1), 2003, pp. 1-32.

Abstract: Capturing the semantics of concepts in a terminology has been an important problem in AI. A two-level approach has been proposed where concepts are classified into high-level semantic types, with these types constituting a portion of the concepts' semantics. We present an algorithmic methodology for refining such two-level terminologic networks. A new network is produced consisting of "pure" semantic types and intersection types. Concepts are uniquely re-assigned to these new types. Overall, these types form a better conceptual abstraction, with each exhibiting uniform semantics. using them, it becomes easier to detect classification errors. The methodology is applied to the UMLS.

Li-min Liu, James Geller and Yehoshua Perl, Enhancing OODB semantics to support browsing in an OODB vocabulary representation, Concurrency and Computation: Practice and Experience, 15(9), August 2003, pp. 845--869.

Abstract: In previous work, we have modeled a vocabulary given as a semantic network by an OODB (Object-Oriented Database). The OODB schema thus obtained provides a compact abstract view of the vocabulary. This enables fast traversal of the vocabulary by a user. In the semantic network vocabulary, the IS-A relationships express the specialization hierarchy. In our OODB modeling of the vocabulary, the SUBCLASS relationship expresses the specialization hierarchy of the classes and supports the inheritance of their properties. A typical IS-A path in the vocabulary has a corresponding shorter SUBCLASS path in the OODB schema. In the current paper we expose several cases where the SUBCLASS hierarchy fails to fully correspond to the IS-A hierarchy of the vocabulary. In these cases there exist traversal paths in the semantic network for which there are no corresponding traversal paths in the OODB schema. The reason for this failure is the existence of some IS-A relationships between concepts of two classes, that are not connected by a SUBCLASS relationship. This
phenomenon weakens the accuracy of our modeling. To rectify the situation we introduce a new OODB semantic relationship IS-A?to represent the existence of IS-A relationships between concepts of a pair of classes which are not connected via a SUBCLASS relationship. The resulting schema contains both SUBCLASS relationships and IS-A?relationships which completely models the ISA hierarchy of the vocabulary. We define a mixed class level traversal path to contain either SUBCLASS or IS-A? relationships. Consequently, each traversal path in the semantic network has a corresponding mixed traversal path in the OODB schema. Hence the introduction of the semantic OODB IS-A?relationship improves the modeling of semantic network vocabularies by OODBs.

Y. Perl, Z. Chen, M. Halper, J. Geller, L. Zhang, and Y. Peng. The cohesive metaschema: A higher-level abstraction of the UMLS Semantic Network. Journal of Biomedical Informatics, (35)3, pages 194-212, June 2003.

Abstract: The Unified Medical Language System (UMLS) joins together a group of established medical terminologies in a unified knowledge representation framework. Two major resources of the UMLS are its Metathesaurus, containing a large number of concepts, and the Semantic Network (SN), containing semantic types and forming an abstraction of the Metathesaurus. However, the SN itself is large and complex and may still be difficult to view and comprehend. Our structural partitioning technique partitions the SN into structurally uniform sets of semantic types based on the distribution of the relationships within the SN. An enhancement of the structural partition results in cohesive, singly rooted sets of semantic types. Each such set is named after its root which represents the common nature of the group. These sets of semantic types are represented by higher-level components called metasemantic types. A network, called a metaschema, which consists of the meta-semantic types connected by hierarchical and semantic relationships is obtained and provides an abstract view supporting orientation to the SN. The metaschema is utilized to audit the UMLS classifications. We present a set of graphical views of the SN based on the metaschema to help in user orientation to the SN. A study compares the cohesive metaschema to metaschemas derived semantically by UMLS experts.

J. Geller, Y. Perl, M. Halper, Z. Chen and H. Gu, Evaluation and Application of a Semantic Network Partition, IEEE Transactions on Information Technology in Biomedicine, 6(2), pp. 109-115, 2002.

Abstract: Semantic networks are excellent knowledge representation structures.However, large semantic networks are hard to comprehend. To overcome this difficulty, several methods of partitioning have been developed which rely on different mixes of structural and semantic methods. However, little has appeared in the literature concerning the question whether a partition of a semantic network creates subnetworks that agree with human in-sight. We address this issue by presenting a comparison between the results of an algorithmic partitioning method and a partition created by a group of experts. Subsequently, we show how a network partition can be used to generate various partial views of a semantic network, which facilitate user orientation. Examles from the UMLS Semantic Network are used to demonstrate partial views.

Z. Chen, Y. Perl, M. Halper, J. Geller, H. Gu, Partitioning the UMLS Semantic Network, IEEE Transactions on Information Technology in Biomedicine, 6(2), pp. 102-108, 2002.

Abstract: The unified medical language system (UMLS) integrates many well-established biomedical terminologies. The UMLS semantic network (SN) can help orient users to the vast knowledge content of the UMLS Metathesaurus (META) via its abstract conceptual view. However, the SN itself is large and complex and may still be difficult to comprehend. Our technique partitions the SN into smaller meaningful units amenable to display on limited-sized computer screens. The basis for the partitioning is the distribution of the relationships within the SN. Three rules are applied to transform the original partition into a second more cohesive partition.

Li-Min Liu, Michael Halper, James Geller and Yehoshua Perl, Using OODB Modeling to Partition a Vocabulary into Structurally and Semantically Uniform Concept Groups, Transactions on Knowledge and Data Engineering, in press, 2001.

Abstract: Controlled Vocabularies (CVs) are networks of concepts that unify disparate terminologies and facilitate the process of information sharing within an application domain. We describe a general methodology for representing an existing CV as an object-oriented database (OODB), called an Object-Oriented Vocabulary Repository (OOVR).

H. Gu, Y. Perl,  M. Halper, J. Geller, F. Kuo and J.J. Cimino, Partitioning an Object-Oriented Terminology Schema, Methods in Medical Informatics , 40, pp. 204­212, (2001).

Abstract: Controlled medical terminologies are increasingly becoming strategic components of various healthcare enterprises. However, the typical medical terminology can be difficult to exploit due to its extensive size and high density. The schema of a medical terminology offered by an object-oriented representation is a valuable tool in providing an abstract view of the terminology, enhancing comprehensibility and making it more usable. However, schemas themselves can be large and unwieldy. We present a methodology for partitioning a medical terminology schema into manageably sized fragments that promote increased comprehension. Our methodology has a refinement process for the subclass hierarchy of the terminology schema. The methodology is carried out by a medical domain expert in conjunction with a computer. The expert is guided by a set of three modeling rules, which guarantee that the resulting partitioned schema consists of a forest of trees. This makes it easier to understand and consequently use the medical terminology. The application of our methodology to the schema of the Medical Entities Dictionary (MED) is presented.

H. Gu, Y. Perl,  J. Geller, M. Halper, L. Liu and J.J. Cimino, Representing the UMLS as an OODB: Modelling Issues and Advantages. Journal of the American Medical Informatics Association, 7(1), January 2000, pp.66-80

Abstract: Objective: The Unified Medical Language System (UMLS) designed by NLM combines many well established authoritative medical informatics terminologies in one knowledge representation system. Such a resource is very valuable to the healthcare community and industry. However, the UMLS is very large and complex and poses serious comprehension problems for users and maintenance personnel. We present a representation to support the comprehension and navigation of the UMLS.

H.Gu, M. Halper, J. Geller and Y. Perl,  Benefits of an Object-oriented Database representation for Controlled Medical Terminologies. Journal of the American Medical Informatics Association, 6(4), 283-303, July 1999.

Abstract: Objective: Controlled medical terminologies (CMTs) have been recognized as important tools in a variety of medical informatics applications ranging from patient-record systems to decision-support systems. CMTs are typically organized in semantic network structures consisting of tens to hundreds of thousands of concepts. This overwhelming size and complexity can be a serious barrier to their maintenance and wide-spread utilization. In this paper, we propose the use of object-oriented databases (OODBs) to address the problems posed by the extensive scope and high complexity of most CMTs for maintenance personnel and general user alike.

L. Liu, M. Halper, J. Geller and Y.Perl. Controlled Vocabularies in OODBs: Modeling issues and implementation. Distributed and Parallel Databases, 7(1), 37-65, January 1999.

Abstract: A major problem that arises in many large application domains is the discrepancy among terminologies of different information systems. The terms used by the information systems of one organization may not agree with the terms used by another organization even when they are in the same domain. Such a situation clearly impedes communication and the sharing of information, and decreases the efficiency of doing business. Problems of this nature can be overcome using a controlled vocabulary (CV), a system of concepts that consolidates and unifies the terminologies of a domain. However, CVs are large and complex and difficult to comprehend. This paper presents a methodology for representing a semantic network-based CV as an object-oriented database (OODB). We call such a representation an Object-Oriented Vocabulary Repository (OOVR). The methodology is based on a structural analysis and partitioning of the source CV. The representation of a CV as an OOVR offers both the level of support typical of database management systems and an abstract view which promotes comprehension of the CV structure and content. After discussing the theoretical aspects of the methodology, we apply it to the MED and InterMED, two existing CVs from the medical field. A program, called the OOVR Generator, for automatically carrying out our methodology is described. Both the MED-OOVR and the InterMED-OOVR have been created using the OOVR Generator, and each exists on top of ONTOS, a commercial OODBMS. The OOVR derived from the InterMED is presently available on the Web.

H. Gu, Y. Perl, J. Geller, M. Halper and M. Singh, A methodology for Partitioning a Vocabulary Hierarchy into Trees, Artificial Intelligence in Medicine, 15 77-98, 1999.

Abstract: Controlled medical vocabularies are useful in application areas such as medical information systems and decision-support system. However, such vocabularies are large and complex, and working with them can be daunting. It is important to provide a means for orienting vocabulary designers and users to the vocabulary's contents. We describe a methodology for partitioning a vocabulary based on an IS-A hierarchy into small meaningful pieces. The methodology uses our disciplined modeling framework to refine the IS-A hierarchy according to prescribed rules in a process carried out by a user in conjunction with the computer. The partitioning of the hierarchy implies a partitioning of the vocabulary. We demonstrate the methodology with respect to a complex sample of the MED, an existing medical vocabulary.

Li Zhang, Yehoshua Perl, Michael Halper, James Geller, James J. Cimino. Enriching the Structure of the UMLS Semantic Network. In the proceding of the 2002 AMIA Conference

Abstract: The Unified Medical Language System (UMLS) Semantic Network (SN) onsisting of a network of semantic typesas a two-tree structure, where each semantic type has at most one parent semantic type. This arrangement is restrictive because some semantic types are, by their definition, specializations of several parents. As a proposed enhancement to the SN, its semantic types have previously been partitioned into groups, each of which contains semantic types of some specific area. However, some groups of this proposed partition contain forest (i.e., multiple-tree) structures or even isolated semantic types. Both situations imply a disconnected internal structure. Connectivity is actually one way to assess the proposed emantic validity? principle for partitions. It is a desired, although not required, property. In this paper, we introduce a methodology for identifying issing? IS-A links and adding them to the SN. This process transforms the SN into a Directed Acyclic Graph (DAG) structure, with semantic types permitted to have multiple parents. A result of our methodology is the transformation of the proposed SN partition into groups satisfying the connectivity property.

Yi Peng, Michael Halper. Auditing the UMLS For Redundant Classifications. In the proceding of the 2002 AMIA Conference, pp 612-616

Abstract: The UMLS Semantic Network (SN) serves as a valuable abstraction for the underlying concept repository called the Metathesaurus (META). Specifically, the SN forms a classification layer for the META, with each of the META constituent concepts assigned to one or more semantic types in the SN. The rule in the design of the SN is to have concepts explicitly assigned to the lowest possible semantic types in the SN IS-A hierarchy. Implicit assignment to higher semantic types can be inferred via the IS-A relationships. However, in subsequent versions of the UMLS, unnecessary, simultaneous assignments to descendant and ancestor semantic types have been discovered (e.g., 8,622 in the UMLS 1998 version and 12,657 in the 2001 version). The assignment of concepts to such ancestor semantic types is called redundant classification. There is a need for an automated auditing tool that can identify all these redundant classifications. In this paper, an efficient algorithm for this auditing task is introduced. Details of its application to the current (2001) version of the UMLS are presented and the results are discussed.

Huanying (Helen) Hu, Hua Min, Yi Peng, Li Zhang, Yehoshua Perl. Using the Metaschema to Audit UMLS Classification Errors. In the proceding of the 2002 AMIA Conference, pp 310-314

Abstract: li.pdf is for Enriching the Structure of the UMLS Semantic Network; not for this paper. Need to find the ps file then convert it into pdf and link it.

H. Gu, J. Geller, L. Liu, and M. Halper. Using a similarity measurement to partition a vocabulary of medical concepts. In T. Bench-Capon, G. Soda, and A. M. Tjoa, editors, Proceedings of the 10th International Conference on Database and Expert Systems Applications: DEXA'99, Florence, Italy, August 1999. Published as: Lecture Notes in Computer Science, 1677, Springer-Verlag, New York, pages 712-723.

Abstract: Repeat, Auditing concept categorizations in the UMLS

H.Gu,Y. Perl, J. Geller, M. Halper, L. Liu, and J. J.Cimino. Modeling the UMLS using an OODB. In N. M. Lorenzi, editor, Proceedings of the 1999 American Medical Informatics Association (AMIA) Annual Symposium, pages 82-86, Washington, DC, November1999.

Abstract: Objective: The Unified Medical Language System (UMLS) designed by NLM combines many well established authoritative medical informatics terminologies in one knowledge representation system. Such a resource is very valuable to the healthcare community and industry. However, the UMLS is very large and complex and poses serious comprehension problems for users and maintenance personnel. We present a representation to support the comprehension and navigation of the UMLS.

H. Gu, L. Liu, M. Halper , J. Geller, and Y. Perl. Converting an Integrated Hospital Formulary into an Object-Oriented Database Representation. In C. G. Chute, editor, Proceedings of AMIA'98, Orlando, FL, pp 770-774, November, 1998.

Abstract: Controlled Medical Vocabularies (CMVs) have proven to be extremely useful in their support of the tasks of information sharing and integration, communication among various software applications, and decision support. Modeling a CMV as an Object-Oriented Database (OODB) provides additional benefits such as increased support for vocabulary comprehension and flexible access. In this paper, we describe the process of modeling and converting an existing integrated hospital formulary (i.e., set of pharmacological concepts) into an equivalent OODB representation, which, in general, we refer to as an Object-Oriented Healthcare Vocabulary Repository (OOHVR). The source for our example OOHVR is a formulary provided by the Connecticut Healthcare Research and Education Foundation (CHREF). Utilizing this source formulary together with the semantic hierarchy composed of major and minor drug classes defined as part of the National Drug Code (NDC) directory, we constructed a CMV that was eventually converted into its OOHVR form (the CHREF-OOHVR). The actual conversion step was carried out automatically by a program, called the OOHVR Generator, that we have developed. At present, the CHREF-OOHVR is running on top of ONTOS, a commercial OODB management system, and is accessible on the Web.

H. Gu, Y. Perl, J. Geller, M. Halper, J. J. Cimino, and M. Singh. Partitioning a vocabulary's IS-A hierarchy into trees. In D. R. Masys, editor, Proceedings of the 1997 American Medical Informatics Association (AMIA) Annual Fall Symposium, pages 630-634, Nashville, TN, October 1997.

Abstract: Controlled medical vocabularies are useful in application areas such as medical information-systems and decision-support. However, such vocabularies are large and complex, and working with them can be daunting. It is important to provide a means for orienting users to the vocabulary's contents. This paper introduces a methodology for partitioning a vocabulary into small, meaningful pieces. The partitioning is done with respect to the vocabulary's IS-A hierarchy. The methodology, based on a set of rules for refining the IS-A hierarchy, is a process carried out by a user in conjunction with the computer. The methodology is demonstrated on a complex portion of a vocabulary.

L. Liu, M. Halper, H.Gu, J. Geller, and Y. Perl. Modeling a vocabulary in an object-oriented database. In K. Barker and M. T. Ozsu, editors, CIKM-96, Proceedings of the Fifth International Conference on Information and Knowledge Management, pages 179-188, Rockville, MD, November 1996.

Abstract: Controlled vocabularies have been used as the means for unifying disparate terminologies found within an application field. This unification leads to better administration of information and enhanced communication among various parties. Semantic networks have been shown to be excellent vehicles for modeling controlled vocabularies. However, they often lack the necessary access flexibility and robustness required by external agents such
as intelligent information-locators and decision-support systems. In this paper, we describe the process of mapping an existing medical vocabulary based on a semantic network model into an Object-Oriented Database (OODB) system. We first consider two straightforward approaches to carrying out this task and describe their deficiencies. We then present a new approach which yields a very compact OODB schema for the representation of the vocabulary entire hierarchy and inter-connectivity. We refer to the resulting OODB as the Object-Oriented Healthcare Vocabulary Repository (OOHVR), which is currently up and running in the
context of ONTOS, a commercially available OODB system.

M. Halper, H.Gu, J. J. Cimino, J. Geller, and Y. Perl. Comprehending the structure of a medical vocabulary using object-oriented database modeling. OOPSLA'96 Workshop on Object-Oriented Technology for Health Care and Medical Information Systems, San Jose, CA, October 1996. Position Paper.

Abstract: The purpose of this paper is to demonstrate how the transformation of a medical vocabulary based on a Semantic Network (SN) model into a vocabulary based on an Object-Oriented Database (OODB) model helps in the maintenance of the vocabulary. We describe an OODB schema which captures the overall structure of the vocabulary in a compact form and uncovers some errors and inconsistencies made in the vocabulary's original modeling.

H. Gu, J. J. Cimino, M. Halper, J. Geller, and Y. Perl. Utilizing OODB schema modeling for vocabulary management. In J. J. Cimino, editor, Proceedings of the 1996 American Medical Informatics Association (AMIA) Annual Fall Symposium, pages 274-278, Washington, DC, October 1996.

Abstract: Comprehension of complex controlled vocabularies is often difficult. We present a method, facilitated by an object-oriented database, for depicting such a vocabulary (the Medical Entities Dictionary (MED) from the Columbia-Presbyterian Medical Center) in a schematic way which uses a sparse inheritance network of area classes. The resulting Object Oriented Health Vocabulary repository (OOHVR) allows visualization of the 43,000 MED concepts as 90 area classes. This view has provided valuable information to those responsible with maintaining the MED. As a result, the MED organization has been improved and some previously-unrecognized errors and inconsistencies have been removed. We believe that this schematic approach allows improved comprehension of the gestalt of large controlled medical vocabulary.

There are 33 papers in this category.

 

 

Home Site Map Search    
Copyright © 2006 SABOT Center. All rights reserved. For questions or comments, please contact the Webmaster. Last Update Date: Wed, 04/30/2008 11:20