Database Normalization as a By-product of Minimum Message Length Inference

  • David L. Dowe
  • Nayyar Abbas Zaidi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6464)

Abstract

Database normalization is a central part of database design in which we re-organise the data stored so as to progressively ensure that as few anomalies occur as possible upon insertions, deletions and/or modifications. Successive normalizations of a database to higher normal forms continue to reduce the potential for such anomalies. We show here that database normalization follows as a consequence (or special case, or by-product) of the Minimum Message Length (MML) principle of machine learning and inductive inference. In other words, someone (previously) oblivious to database normalization but well-versed in MML could examine a database and - using MML considerations alone - normalise it, and even discover the notion of attribute inheritance.

Keywords

Minimum Message Length MML Database Normalization Machine Learning Data Mining Intelligent Databases 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • David L. Dowe
    • 1
  • Nayyar Abbas Zaidi
    • 1
  1. 1.Clayton School of I.T.Monash UniversityClaytonAustralia

Personalised recommendations