Advertisement

Controlled Vocabularies in OODBs: Modeling Issues and Implementation

  • Li-Min Liu
  • Michael Halper
  • James Geller
  • Yehoshua Perl
Chapter

Abstract

A major problem that arises in many large application domains is the discrepancy among terminologies of different information systems. The terms used by the information systems of one organization may not agree with the terms used by another organization even when they are in the same domain. Such a situation clearly impedes communication and the sharing of information, and decreases the efficiency of doing business. Problems of this nature can be overcome using a controlled vocabulary (CV), a system of concepts that consolidates and unifies the terminologies of a domain. However, CVs are large and complex and difficult to comprehend. This paper presents a methodology for representing a semantic network-based CV as an object-oriented database (00DB). We call such a representation an Object-Oriented Vocabulary Repository (OOVR). The methodology is based on a structural analysis and partitioning of the source CV. The representation of a CV as an OOVR offers both the level of support typical of database management systems and an abstract view which promotes comprehension of the CV’s structure and content. After discussing the theoretical aspects of the methodology, we apply it to the MED and InterMED, two existing CVs from the medical field. A program, called the OOVR Generator, for automatically carrying out our methodology is described. Both the MED-OOVR and the InterMED-OOVR have been created using the OOVR Generator, and each exists on top of ONTOS, a commercial OODBMS. The OOVR derived from the InterMED is presently available on the Web.

Keywords

Controlled Vocabulary Object-Oriented Database Object-Oriented Modeling Terminology Ontology Ontology Modeling 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    American Society of Hospital Pharmacists, Bethesda, MD. American Hospital Formulary Service Drug Information,1997. Updated annually.Google Scholar
  2. 2.
    R. J. Brachman. On the epistemological status of semantic networks. In N. V. Findler, editor, Associative Networks: Representation and Use of Knowledge by Computers, pages 3–50. Academic Press, Inc., New York, NY, 1979.Google Scholar
  3. 3.
    S. K. Card. User limits and the VDT computer interface (excerpt). In R. M. Baecker and W. A. S. Buxton, editors, Readings in Human-Computer Interaction, pages 180–191. Morgan Kaufmann Publishers, Inc., Los Altos, CA, 1987.Google Scholar
  4. 4.
    M. J. Carey, D. J. DeWitt, J. E. Richardson, and E. J. Shekita. Storage management for objects in EXODUS. In W. Kim and F. H. Lochovsky, editors, Object-Oriented Concepts, Databases, and Applications, pages 341–369. ACM Press. New York, NY, 1989.Google Scholar
  5. 5.
    R. G. G. Cattell and D. K. Barry, editors. The Object Database Standard: ODMG 2.0. Morgan Kaufmann Publishers, Inc., San Francisco. CA, 1997.zbMATHGoogle Scholar
  6. 6.
    P. P.-S. Chen. The Entity-Relationship Model: Toward a unified view of data. ACM Trans. Database Syst., 1 (1): 9–36, 1976.CrossRefGoogle Scholar
  7. 7.
    J. J. Cimino. Personal communication, 1997.Google Scholar
  8. 8.
    J. J. Cimino, P. D. Clayton, G. Hripcsak, and S. B. Johnson. Knowledge-based approaches to the maintenance of a large controlled medical terminology. JAMIA, 1(1):35–50, 1994.Google Scholar
  9. 9.
    J. J. Cimino, G. Hripcsak, S. B. Johnson, and P. D. Clayton. Designing an introspective, multipurpose, controlled medical vocabulary. In Proc. Thirteenth Annual Symposium on Computer Applications in Medical Care, pages 513–517, Washington, DC, Nov. 1989.Google Scholar
  10. 10.
    College of American Pathologists, Skokie, IL. Systematized Nomenclature of Medicine, second edition, 1982.Google Scholar
  11. 11.
    O. Deux et al. The 02 system. Commun. ACM, 34 (10): 34–48, Oct. 1991.CrossRefGoogle Scholar
  12. 12.
    D. H. Fischer. Consistency rules and triggers for thesauri. Int. Classif., 18 (4): 212–225, 1991.Google Scholar
  13. 13.
    D. H. Fischer. Consistency rules and triggers for multilingual terminology. In Proc. TKE’93, Terminology and Knowledge Engineering, pages 333–342, 1993.Google Scholar
  14. 14.
    GemStone Systems, Inc. URL: http://www.gemstone.com.
  15. 15.
    C. A. Goble, A. J. Glowinski, W. A. Nolan, and A. L. Rector. A descriptive semantic formalism for medicine. In Proc. 9th ICDE, pages 624–631, Vienna, Austria, 1993.Google Scholar
  16. 16.
    H. Gu, J. Cimino, M. Halper, J. Geller, and Y. Perl. Utilizing OODB schema modeling for vocabulary management. In J. Cimino, editor, Proc. /996AMIA Annual Fall Symposium, pages 274–278, Washington, DC, Oct. 1996.Google Scholar
  17. 17.
    H. Gu, Y. Perl, J. Geller, M. Halper, J. Cimino, and M. Singh. Partitioning a vocabulary’s IS-A hierarchy into trees. In D. R. Masys, editor, Proc. 1997 AMIA Annual Fall Symposium, pages 630–634, Nashville, TN, Oct. 1997.Google Scholar
  18. 18.
    M. Halper, R. Galnares, J. Geller, and Y. Perl. An analogical, Web-based interface to an OODB medical vocabulary. In preparation.Google Scholar
  19. 19.
    M. Hammer and D. McLeod. Database description with SDM: A semantic database model. ACM Trans. Database Syst., 6 (3): 351–386, 1981.Google Scholar
  20. 20.
    R. Hull and R. King. Semantic database modeling: Survey, applications, and research issues. ACM Comput. Surv., 19 (3): 201–260, Sept. 1987.MathSciNetGoogle Scholar
  21. 21.
    N. Ide, J. L. Maitre, and J. Véronis. Outline of a model for lexical databases. Information Processing and Management, 29 (2): 159–186, 1993.CrossRefGoogle Scholar
  22. 22.
    P. D. Karp, K. Myers, and T. Gruber. The generic frame protocol. In Proc. IJCAI-95, pages 768–774, Montreal, Canada, 1995.Google Scholar
  23. 23.
    P. D. Karp and S. M. Paley. Knowledge representation in the large. In Proc. IJCAI-95, pages 751–758, Montreal, Canada, 1995.Google Scholar
  24. 24.
    M. Kifer, W. Kim, and Y. Sagiv. Querying object-oriented databases. In Proc. 1992 ACM SIGMOD Conference on Management of Data, San Diego, CA, June 1992.Google Scholar
  25. 25.
    W. Kim and F. H. Lochovsky, editors. Object-Oriented Concepts, Databases, and Applications. ACM Press, New York, NY, 1989.Google Scholar
  26. 26.
    C. Lamb, G. Landis, J. Orenstein, and D. Weinreb. The ObjectStore database system. Commun. ACM, 34 (10): 50–63, Oct. 1991.CrossRefGoogle Scholar
  27. 27.
    F. Lehmann. Semantic networks. In [28], pages 1–50.Google Scholar
  28. 28.
    F. Lehmann, editor. Semantic Networks in Artificial Intelligence. Pergamon Press, Tarrytown, NY, 1992.Google Scholar
  29. 29.
    D. B. Lenat and R. V. Guha. Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project. Addison-Wesley Publishing Co., Inc., Reading, MA, 1990.Google Scholar
  30. 30.
    L. Liu, M. Halper, H. Gu, J. Geller, and Y. Perl. Modeling a vocabulary in an object-oriented database. In K. Barker and M. T. Ozsu, editors, CIKM-96, Proc. 5th Int’l Conference on Information and Knowledge Management, pages 179–188, Rockville, MD, Nov. 1996.CrossRefGoogle Scholar
  31. 31.
    E. Mays, C. Apte, J. Griesmer, and J. Kastner. Experience with K-Rep: An object-centered knowledge representation language. In Proc. IEEE Al Application Conference, San Diego, CA, Mar. 1988.Google Scholar
  32. 32.
    G. A. Miller. The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psych. Rev., 63, 1956.Google Scholar
  33. 33.
    G. A. Miller. WordNet: A lexical database for English. Commun. ACM, 38 (11): 39–41, 1995.CrossRefGoogle Scholar
  34. 34.
    W. Möhr and L. Rostek. TEDI: An object-oriented terminology editor. In Proc. TKE’93, Terminology and Knowledge Engineering, pages 363–374, 1993.Google Scholar
  35. 35.
    J. Mylopoulos, A. Borgida, M. Jarke, and M. Koubarakis. Telos: Representing knowledge about information systems. TOIS, 8 (4): 325–362, 1990.CrossRefGoogle Scholar
  36. 36.
    J. Mylopoulos, V. Chaudhri, D. Plexousakis, A. Shrufi, and T. Topaloglou. Building knowledge base management systems: A progress report. Technical Report DKBS-TR-94–4, Department of Computer Science, University of Toronto, 1994.Google Scholar
  37. 37.
    The National Drug Code Directory. URL: http://www.fda.gov/cder/ndc/index.htm.Google Scholar
  38. 38.
    N. F. Noy and C. D. Hafner. The state of the art in ontology design: A survey and comparative review. AI Magazine, 18 (3): 53–74, Fall 1997.Google Scholar
  39. 39.
    Welcome to ODI. URL: http://www.odi.com.
  40. 40.
    D. E. Oliver, E. H. Shortliffe, and InterMed Collaboratory. Collaborative model development for vocabulary and guidelines. In J. Cimino, editor, Proc. /996 AMIA Annual Fall Symposium, page 826, Washington, DC, Oct. 1996.Google Scholar
  41. 41.
    ONTOS Home Page. URL: http://www.ontos.com.
  42. 42.
    ONTOS, Inc. Lowell, MA. ONTOS DB 3.1 Reference Manual, 1995.Google Scholar
  43. 43.
    The OOVR Browser. URL: http://object.njit.edu:2000/Thewoohvr/JBI/INTERMED/InterTerms.html.
  44. 44.
    Object Technology by Ardent Software (02 System). URL: http://www.ardentsoftware.com/objectlindex.html.
  45. 45.
    S. C. Shapiro and W. J. Rapaport. The SNePS family. In [28], pages 243–275.Google Scholar
  46. 46.
    E. H. Shortliffe, G. O. Barnett, J. Cimino, R. A. Greenes, S. M. Huff, and V. L. Patel. Collaborative medical informatics research using the Internet and the World Wide Web. In J. Cimino, editor, Proc. 1996 AMIA Annual Fall Symposium, pages 125–129, Washington, DC, Oct. 1996.Google Scholar
  47. 47.
    V. Soloviev. An overview of three commercial object-oriented database management systems: ONTOS, ObjectStore, and 02. SIGMOD Record, 21 (1): 93–104, Mar. 1992.MathSciNetCrossRefGoogle Scholar
  48. 48.
    J. F. Sowa. Principles of Semantic Networks, Explorations in the Representation of Knowledge. Morgan Kaufmann Publishers, Inc., San Mateo, CA, 1991.Google Scholar
  49. 49.
    U. S. Department of Health and Human Services, National Institutes of Health, National Library of Medicine. Unified Medical Language System, 1996.Google Scholar
  50. 50.
    United States National Center for Health Statistics, Washington, DC. International Classification of Diseases: Ninth Revision, with Clinical Modifications, 1980.Google Scholar
  51. 51.
    P. Valduriez, S. Khoshafian, and G. Copeland. Implementation techniques of complex objects. In Proc. VLDB ‘86, pages 101–109, Kyoto, Japan, Aug. 1986.Google Scholar
  52. 52.
    Versant. URL: http://www.versant.com.
  53. 53.
    W. A. Woods. What’s in a link: Foundations for semantic networks. In R. J. Brachman and H. J. Levesque, editors, Readings in Knowledge Representation, pages 218–241. Morgan Kaufmann Publishers, Inc., San Mateo, CA. 1985.Google Scholar
  54. 54.
    S. B. Zdonik and D. Maier, editors. Readings in Object-Oriented Database Systems. Morgan Kaufmann Publishers, Inc., San Mateo, CA, 1990.Google Scholar
  55. 55.
    J. Zhang. Application of OODB and SGML techniques in text database: An electronic dictionary system. SIGMOD Record, 24 (1): 3–8, Mar. 1995.Google Scholar

Copyright information

© Springer Science+Business Media New York 1999

Authors and Affiliations

  • Li-Min Liu
    • 1
  • Michael Halper
    • 2
  • James Geller
    • 1
  • Yehoshua Perl
    • 1
  1. 1.CIS Dept.NJITNewarkUSA
  2. 2.Dept. of Mathematics and Computer ScienceKean UniversityUnionUSA

Personalised recommendations