Journal of Medical Systems

, Volume 6, Issue 1, pp 77–88 | Cite as

Data base design for natural language medical data

  • Guy Story
  • Lynette Hirschman


The natural language of clinical reporting exhibits the properties of a sublanguage, having a grammar and vocabulary greatly restricted in comparison to the language as a whole. This paper presents a CODASYL-type (network) data base schema for natural language medical records based on the structures and relationships identified in such material through computerized language processing. While the schema described is special to the medical sublanguage, the methods of analysis and of schema design are applicable to natural language material in other subject areas.


Medical Record Data Base Natural Language Subject Area Schema Design 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Korein, J., The computerized medical record: The variable-field-length format system and its applications.Information Processing of Medical Records, North-Holland, 1970, pp. 259–291.Google Scholar
  2. 2.
    Platt, R.C., Wong, R.L., Lautner, K.W., and Gaynon, P.S., POLARS: A pathology on-line logging and reporting system.Comput. Biomed. Res. 783, 1974.Google Scholar
  3. 3.
    PROMIS Laboratory, Representation of medical knowledge, and PROMIS.Proceedings of the Second Annual Symposium on Computer Application in Medical Care (F.H. Orthner, ed.), IEEE, New York, 1978, pp. 368–400.Google Scholar
  4. 4.
    Barnett, G.O., Justice, N.S., Somand, M.E., Adams, J.B., Waxman, B.D., Beaman, P.D., Parent, M.S., Van Deusen, F.R., and Greenlie, J.K., COSTAR-A computer-based medical information system for ambulatory care.Proc. IEEE 67 1226, 1979.Google Scholar
  5. 5.
    White, W., Barkman, B., Bernier-Bonneville, L., and Cousineau, L., Method for automatic coding of medical information in patient records.Methods Inf. Med. 161, 1977.Google Scholar
  6. 6.
    Dunham, G.S., Pacak, M.G., and Pratt, A.W., Automatic indexing of pathology data.J. Am. Soc. Inf. Sci. 29:81–90.Google Scholar
  7. 7.
    Harris, Z,Mathematical Structure of Language. Interscience Tracts in Pure and Applied Mathematics (L. Bers, R. Courant, and J.J. Stokers, eds.), No. 21, Wiley-Interscience, New York, 1968.Google Scholar
  8. 8.
    Sager, N., Information structures in the language of science.The Many Faces of Information Science, AAAS Selected Symposium 3 (E.C. Weiss, ed.), Westview Press, Boulder, Colorado, 1977, pp 53–73.Google Scholar
  9. 9.
    Hirschman, L., and Sager, N., Automatic information formatting of a medical sublanguage.Sublanguage: Studies of Language in Restricted Semantic Domains (R. Kittredge and J. Lehrberger, eds.), Series on Foundations of Communication (R. Posner ed.), Walter de Gruyter, Berlin, 1982.Google Scholar
  10. 10.
    Sager, N., Natural language information formatting: The automatic conversion of texts to a structured data base.Advances in Computers 17 (M.C. Yovits, ed.), Academic Press, New York, 1978, pp 89–162.Google Scholar
  11. 11.
    Sager, N., and Hirschman, L., Information structures in the language of science: Theory and implementation.String Program Reports No. 12, Linguistic String Project, New York University, 1978.Google Scholar
  12. 12.
    CODASYL Data Base Task Group Report, Association for Computing Machinery, New York, 1971.Google Scholar
  13. 13.
    Hirschman, L., Grishman, R., and Sager, N., Grammatically-based automatic word class formation.Inf. Process. Manage.1139–57, 1975.Google Scholar
  14. 14.
    Hirschman, L., Retrieving time information from natural language texts.Information Retrieval Research (R.N. Oddy, C.V. Van Rijsbergen, S.E. Robertson, and P. Williams, eds.), Butterworths, London, 1981.Google Scholar
  15. 15.
    Hobbs, J., and Grishman, R., The automatic transformational analysis of English sentences: An implementation.International J. Comput. Math. 5, (section A): 267–283, 1976.Google Scholar
  16. 16.
    Hirschman, L., Story, G., Marsh, E., Lyman, M., and Sager, N., An experiment in automated health care evaluation from narrative medical records.Comput. Biomed. Res. 14:447–463.Google Scholar

Copyright information

© Plenum Publishing Corporation 1982

Authors and Affiliations

  • Guy Story
    • 1
  • Lynette Hirschman
    • 1
  1. 1.Linguistic String ProjectNew York UniversityNew York

Personalised recommendations