A data model, knowledge base, and natural language processing for sharing a large statistical database
Most existing statistical databases are mere collections of statistical files gathered for specific purposes. Consequently, as they grow in size, users are faced with difficulties in identifying and finding the data they need.
In order to obtain data descriptions independent of specific purposes, this paper proposes an object-oriented data design, which distinguishes between data conceptually obtainable and data actually stored in a database, and specifies relationships among classifications and categories independent of particular data files.
This is followed by a discussion of the representation of knowledge about data and classifications on a knowledge base, giving clear definitions of hierarchies and relationships among statistical data concepts.
Finally, a natural language query system using the knowledge base is demonstrated, which proves the advantage of the proposed statistical data concepts.
Unable to display preview. Download preview PDF.
- [ANSI 75]ANSI/X3/SPARC, "Study Group on Data Base Management Systems: Interim Report," FDT (Bulletin of ACM-SIGMOD), 7(2), 1975.Google Scholar
- [Brackman 83]R.J.Brackman, "What IS-A is and isn't: An Analysis of Taxonomic Links in Semantic Networks," IEEE Computer, Oct. 1983, pp.30–36.Google Scholar
- [Chan 81]P.Chan and A.Shoshani, "SUBJECT: A Directory Driven System for Organizing and Accessing Large Statistical Databases," VLDB, 1981, pp.553–563.Google Scholar
- [Cubitt 83]R.E.Cubitt, "Meta Data: An Experience of its Uses and Management," SSDBM, 1983, pp.167–169.Google Scholar
- [Malmborg 86]E.Malmborg, "On the Semantics of Aggregated Data," SSDBM, 1986, pp.152–158.Google Scholar
- [NLA 86]National Land Agency, Knowledge Management of Land Information, (in Japanese), Publication Bureau of the Ministry of Finance, Japan, 1986.Google Scholar
- [Ozsoyoglu 83]Z.M.Ozsoyoglu and G.Ozsoyoglu, "An Extension of Relational Algebra for Summary Tables," SSDBM, 1983, pp.202–211.Google Scholar
- [Reiter 78]R.Reiter, "On Closed World Data Bases," in H.Gallaire and J.Minker (eds.), Logic and Data Bases, Plenum Press, 1978, pp.55–76.Google Scholar
- [Sato 86]H.Sato, T.Nakano, Y.Fukasawa and R.Hotaka, "Conceptual Schema for a Wide-Scope Statistical Database and Its Applications," SSDBM, 1986, pp.165–172.Google Scholar
- [Sato 88]H.Sato, Design and Development of Statistical Databases: An Application of Data Model and Knowledge Base, (in Japanese), Ohm Co., Japan, 1988, 246 pages.Google Scholar
- [Shoshani 82]A.Shoshani, "Statistical Databases: Characteristics, Problems and some Solutions," VLDB, 1982, pp.208–222.Google Scholar
- [Smith 77]J.M. Smith and D.C.P. Smith, "Database Abstractions: Aggregation and Generalization," TODS, 2(2), June 1977, pp.105–133.Google Scholar