Skip to main content

Query Languages Supporting Descriptive Rule Mining: A Comparative Study

  • Chapter
Database Support for Data Mining Applications

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2682))

Abstract

Recently, inductive databases (IDBs) have been proposed to tackle the problem of knowledge discovery from huge databases. With an IDB, the user/analyst performs a set of very different operations on data using a query language, powerful enough to support all the required manipulations, such as data preprocessing, pattern discovery and pattern post-processing. We provide a comparison between three query languages (MSQL, DMQL and MINE RULE) that have been proposed for descriptive rule mining and discuss their common features and differences. These query languages look like extensions of SQL. We present them using a set of examples, taken from the real practice of rule mining. In the paper we discuss also OLE DB for Data Mining and Predictive Model Markup Language, two recent proposals that like the first three query languages respectively provide native support to data mining primitives and provide a description in a standard language of statistical and data mining models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bastide, Y., Pasquier, N., Taouil, R., Stumme, G., Lakhal, L.: Mining minimal non-redundant association rules using frequent closed itemsets. In: Palamidessi, C., Moniz Pereira, L., Lloyd, J.W., Dahl, V., Furbach, U., Kerber, M., Lau, K.-K., Sagiv, Y., Stuckey, P.J. (eds.) CL 2000. LNCS (LNAI), vol. 1861, pp. 972–986. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  2. Boulicaut, J.-F., Bykowski, A.: Frequent closures as a concise representation for binary data mining. In: Terano, T., Chen, A.L.P. (eds.) PAKDD 2000. LNCS, vol. 1805, pp. 62–73. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  3. Boulicaut, J.-F., Bykowski, A., Rigotti, C.: Free-sets: a condensed representation of boolean data for the approximation of frequency queries. Data Mining and Knowledge Discovery 7(1), 5–22 (2003)

    Article  MathSciNet  Google Scholar 

  4. Giacometti, A., Laurent, D., Diop, C.T.: Condensed representations for sets of mining queries. In: Proc. KDID 2002, Helsinki (FIN) (2002), An extended version appears in this volume

    Google Scholar 

  5. Imielinski, T., Mannila, H.: A Database Perspective on Knowledge Discovery. Communications of the ACM 3(4), 58–64 (1996)

    Article  Google Scholar 

  6. Imielinski, T., Virmani, A., Abdulghani, A.: DataMine: Application Programming Interface and Query Language for Database Mining. In: Proc. KDD 1996, Portland (USA), pp. 256–261. AAAI Press, Menlo Park (1996)

    Google Scholar 

  7. Imielinski, T., Virmani, A.: MSQL: A Query Language for Database Mining. Data Mining and Knowledge Discovery 3(4), 373–408 (1999)

    Article  Google Scholar 

  8. Jeudy, B., Boulicaut, J.-F.: Optimization of association rule mining queries. Intelligent Data Analysis 6(4), 341–357 (2002)

    MATH  Google Scholar 

  9. Jeudy, B., Boulicaut, J.-F.: Using condensed representations for interactive association rule mining. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) PKDD 2002. LNCS (LNAI), vol. 2431, pp. 225–236. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  10. Han, J., Fu, Y., Wang, W., Koperski, K., Zaiane, O.: DMQL: A Data Mining Query Language for Relational Databases. In: Proc. of SIGMOD Workshop DMKD 1996, Montreal (Canada), pp. 27–34 (1996)

    Google Scholar 

  11. Han, J., Kamber, M.: Data Mining – Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco (2001)

    MATH  Google Scholar 

  12. Meo, R., Psaila, G., Ceri, S.: A New SQL-like Operator for Mining Association Rules. In: Proc. VLDB 1996, Bombay (India), pp. 122–133. Morgan Kaufmann, San Francisco (1996)

    Google Scholar 

  13. Meo, R., Psaila, G., Ceri, S.: An Extension to SQL for Mining Association Rules. Data Mining and Knowledge Discovery 2(2), 195–224 (1998)

    Article  Google Scholar 

  14. Virmani, A.: Second Generation Data Mining. PhD Thesis, Rutgers University (1998)

    Google Scholar 

  15. Zaki, M.J.: Generating non-redundant association rules. In: Proc. SIGKDD 2000, Boston (USA), pp. 34–43. ACM Press, New York (2000)

    Google Scholar 

  16. Netz, A., Chaudhuri, S., Fayyad, U., Bernhardt, J.: Integrating Data Mining with SQL Databases: OLE DB for Data Mining. In: Proc ICDE 2001, Heidelberg (Germany), pp. 379–387. IEEE Computer Society, Los Alamitos (2001)

    Google Scholar 

  17. OLEDB for Data Mining specifications, Available at: http://www.microsoft.com/data/oledb/dm

  18. Predictive Model Mark-up Language, Available at: http://www.dmg.org/pmml-v2-0.htm

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Botta, M., Boulicaut, JF., Masson, C., Meo, R. (2004). Query Languages Supporting Descriptive Rule Mining: A Comparative Study. In: Meo, R., Lanzi, P.L., Klemettinen, M. (eds) Database Support for Data Mining Applications. Lecture Notes in Computer Science(), vol 2682. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-44497-8_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-44497-8_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22479-2

  • Online ISBN: 978-3-540-44497-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics