Multipass Algorithms for Mining Association Rules in Text Databases

Holt, John D.; Chung, Soon M.

doi:10.1007/PL00011664

Multipass Algorithms for Mining Association Rules in Text Databases

Regular Paper
Published: May 2001

Volume 3, pages 168–183, (2001)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

John D. Holt¹ &
Soon M. Chung¹

146 Accesses
33 Citations
Explore all metrics

Abstract.

In this paper, we propose two new algorithms for mining association rules between words in text databases. The characteristics of text databases are quite different from those of retail transaction databases, and existing mining algorithms cannot handle text databases efficiently because of the large number of itemsets (i.e., words) that need to be counted. Two well-known mining algorithms, Apriori algorithm and Direct Hashing and Pruning (DHP) algorithm, are evaluated in the context of mining text databases, and are compared with the new proposed algorithms named Multipass-Apriori (M-Apriori) and Multipass-DHP (M-DHP). It has been shown that the proposed algorithms have better performance for large text databases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Wright State University, Dayton, Ohio, USA, , , , , , US
John D. Holt & Soon M. Chung

Authors

John D. Holt
View author publications
You can also search for this author in PubMed Google Scholar
Soon M. Chung
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Received 12 November 1999 / Revised 27 September 2000 / Accepted in revised form 25 October 2000

Rights and permissions

Reprints and permissions

About this article

Cite this article

Holt, J., Chung, S. Multipass Algorithms for Mining Association Rules in Text Databases. Knowledge and Information Systems 3, 168–183 (2001). https://doi.org/10.1007/PL00011664

Download citation

Issue Date: May 2001
DOI: https://doi.org/10.1007/PL00011664

Keywords: Association rules; Data mining; Performance analysis; Text database

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multipass Algorithms for Mining Association Rules in Text Databases

Abstract.

Access this article

Similar content being viewed by others

A Fast Association Rule Mining Algorithm for Corpus

An improved apriori algorithm based on support weight matrix for data mining in transaction database

SS-FIM: Single Scan for Frequent Itemsets Mining in Transactional Databases

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Navigation

Multipass Algorithms for Mining Association Rules in Text Databases

Abstract.

Access this article

Similar content being viewed by others

A Fast Association Rule Mining Algorithm for Corpus

An improved apriori algorithm based on support weight matrix for data mining in transaction database

SS-FIM: Single Scan for Frequent Itemsets Mining in Transactional Databases

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation