Skip to main content

Evidential techniques in parallel Database Mining

  • Conference paper
  • First Online:
High-Performance Computing and Networking (HPCN-Europe 1995)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 919))

Included in the following conference series:

Abstract

Realisation of the fact that stored masses of data contain more information than what is obvious has led to a great interest in the field of Database Mining in the last couple of years. While hardware requirements for storage of these masses of data have advanced rapidly with the demand as have software methodologies for storage, manipulation and reporting of the data, little progress has been made in methods for automatically analysing the data and extracting knowledge stored implicitly within the data. This process of “reading between the lines” is called Database Mining (DM).

Clearly, the process of DM is a difficult one. This is due to the fact that methods required to achieve the goal of discovering knowledge are complex and data intensive. In this paper we explain how high performance computing can play a vital role in DM and discuss the implementation of a specific algorithm, STRIP (Strong Rule Induction in Parallel) [ANAN94b, ANAN95] developed by the authors for the discovery of Strong or “almost exact” rules from databases. STRIP is the first algorithm to be implemented within a parallel framework for Database Mining based on Evidence Theory (EDM) [ANAN94a] developed by the authors.

In an earlier paper we discussed the different levels of parallelism within STRIP and demonstrated them using a transputer network [ANAN95]. In this paper we discuss the implementation of STRIP on a cluster of Silicon Graphics Workstations connected using an ATM network.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S.S. Anand, D.A. Bell, J.G. Hughes. A General Framework for Database Mining Based on Evidential Theory, Internal Report, Sch. of Inf. and Soft. Eng., Univ. of Ulster (Jordanstown), Nov. 94

    Google Scholar 

  2. S.S. Anand, D.A. Bell, J.G. Hughes. Discovery of Strong Rules from Databases in Parallel, Internal Report, Sch. of Inf. and Soft. Eng., Univ. of Ulster (Jordanstown), Oct. 94

    Google Scholar 

  3. S.S. Anand, D.A. Bell, J.G. Hughes. An Empirical Performance Study of the Ingres Search Accelerator for a Large Property Management Database System, Proc. of the 20th VLDB Conference, Sept. 1994.

    Google Scholar 

  4. S.S. Anand, C.M. Shapcott, D.A. Bell, J.G. Hughes. Data Mining in Parallel, World Occam and Transputer User Group Conference, April, 1995.

    Google Scholar 

  5. A. Geist, A. Beguelin, J. Dingarra, W. Jiang, R. Manchek, V. Sunderam. PVM3 User's Guide and Reference Manual.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Bob Hertzberger Giuseppe Serazzi

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Anand, S.S., Bell, D.A., Hughes, J.G., Shapcott, C.M. (1995). Evidential techniques in parallel Database Mining. In: Hertzberger, B., Serazzi, G. (eds) High-Performance Computing and Networking. HPCN-Europe 1995. Lecture Notes in Computer Science, vol 919. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0046629

Download citation

  • DOI: https://doi.org/10.1007/BFb0046629

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-59393-5

  • Online ISBN: 978-3-540-49242-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics