A Bayesian Approach to Classify Conference Papers

  • Kok-Chin Khor
  • Choo-Yee Ting
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4293)


This article aims at presenting a methodological approach for classifying educational conference papers by employing a Bayesian Network (BN). A total of 400 conference papers were collected and categorized into 4 major topics (Intelligent Tutoring System, Cognition, e-Learning, and Teacher Education). In this study, we have implemented a 80-20 split of collected papers. 80% of the papers were meant for keywords extraction and BN parameter learning whereas the other 20% were aimed for predictive accuracy performance. A feature selection algorithm was applied to automatically extract keywords for each topic. The extracted keywords were then used for constructing BN. The prior probabilities were subsequently learned using the Expectation Maximization (EM) algorithm. The network has gone through a series of validation by human experts and experimental evaluation to analyze its predictive accuracy. The result has demonstrated that the proposed BN has outperformed Naïve Bayesian Classifier, and BN learned from the training data.


Bayesian Network Predictive Accuracy Conference Paper Human Expert Intelligent Tutor System 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Han, E.-H., Karypis, G., Kumar, V.: Text Categorization Using Weight Adjusted K-Nearest Neighbor Classification. In: Cheung, D., Williams, G.J., Li, Q. (eds.) PAKDD 2001. LNCS (LNAI), vol. 2035, p. 53. Springer, Heidelberg (2001)CrossRefGoogle Scholar
  2. 2.
    Kurt, A., Tozal, E.: Classification of XSLT-Generated Web Documents with Support Vector Machines. In: Nayak, R., Zaki, M.J. (eds.) KDXD 2006. LNCS, vol. 3915, pp. 33–42. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  3. 3.
    Souafi-Bensafi, S., Parizeau, M., Lebourgeois, F., Emptoz, H.: Bayesian Networks Classifiers Applied to Documents. In: Proceeding of the 16th International Conference on Pattern Recognition, vol. 1, pp. 483–486. IEEE, Los Alamitos (2002)Google Scholar
  4. 4.
    de Campos, L.M., Fernandez-Luna, J.M., Huete, J.F.: A Layered Bayesian Network Model for Document Retrieval. In: Crestani, F., Girolami, M., van Rijsbergen, C.J.K. (eds.) ECIR 2002. LNCS, vol. 2291, pp. 169–182. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  5. 5.
    Wang, Y., Hodges, J., Tang, B.: Classification of Web Document using a Naïve Bayes Method. In: Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence, pp. 560–564. IEEE, Los Alamitos (2003)CrossRefGoogle Scholar
  6. 6.
    Lam, W., Low, K.-F.: Automatic Document Classification Based on Probabilistic Reasoning: Model and Performance Analysis. In: International Conference on Systems, Man, and Cybernatics, vol. 3, pp. 2719–2723. IEEE, Los Alamitos (1997)Google Scholar
  7. 7.
    Bai, J., Nie, J.Y., Cao, G.: Integrating Compound Terms in Bayesian Text Classification. In: Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 598–601. IEEE, Los Alamitos (2005)Google Scholar
  8. 8.
    The Porter Stemming Algorithm, http://www.tartarus.org/martin/PorterStemmer/
  9. 9.
  10. 10.
  11. 11.
    Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Mateo (1988)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Kok-Chin Khor
    • 1
  • Choo-Yee Ting
    • 1
  1. 1.Faculty of Information TechnologyMultimedia UniversityCyberjayaMalaysia

Personalised recommendations