Using a Normalized Score Multi-Label KNN to Classify Multi-label Herbal Formulae

  • Verayuth Lertnattee
  • Sinthop Chomya
  • Chanisara Lueviphan
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8284)

Abstract

The popularity of herbal medicines has greatly increased in worldwide countries over recent years. Herbal formula is a form of traditional medicine where herbs are combined to heal patient to heal faster and more efficiency. Herbal formulae can be divided into categories. Some formulae can be classified as more than one category. The categories are usually based on indications of herbs in formulae. To support experts for classifying a formula to one or more therapeutic categories, the normalized score multi-label k-nearest neighbors (NSML k-NN) algorithm, is proposed for multi-label herbal formulae classification. The k-NN classifiers with several term weight schemes are explored. The normalized scores are calculated. The values of k, strategies to assign categories are investigated to adjust the decision for multi-label herbal formulae. The experiment is done using a mixed data set of herbal formulae collected from the Natural List of Essential Medicine and the list of common household remedies for traditional medicine. Moreover, a set of well-known commercial products are used for evaluating the effectiveness of the proposed method. From the results, the NSML k-NN is an efficient method to classify multi-label herbal formulae.

Keywords

Multi-label document text classification text categorization herbal formula k-NN classifier 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Lovell-Smith, H.D.: In defence of ayurvedic medicine. The New Zealand Medical Journal 119, 1–3 (2006)Google Scholar
  2. 2.
    Aziz, Z., Peng, T.N.: Herbal medicines: prevalence and predictors of use among malaysian adults. Complementary Therapies in Medicine 44, 44–50 (2009)CrossRefGoogle Scholar
  3. 3.
    Roiger, R., Geatz, M.: Data Mining: A Tutorial Based Primer. Addison-Wesley, Boston (2002)Google Scholar
  4. 4.
    Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34, 1–47 (2002)CrossRefMathSciNetGoogle Scholar
  5. 5.
    Nigam, K., McCallum, A.K., Thrun, S., Mitchell, T.M.: Text classification from labeled and unlabeled documents using em. Machine Learning 39, 103–134 (2000)CrossRefMATHGoogle Scholar
  6. 6.
    Duwairi, R., Al-Zubaidi, R.: A hierarchical k-nn classifier for textual data. The International Arab Journal of Information Technology 8, 251–259 (2011)Google Scholar
  7. 7.
    Lertnattee, V., Theeramunkong, T.: Effect of term distributions on centroid-based text categorization. Information Sciences 158, 89–115 (2004)CrossRefGoogle Scholar
  8. 8.
    Joachims, T.: Learning to Classify Text using Support Vector Machines. Kluwer Academic Publishers, Dordrecht (2002)CrossRefGoogle Scholar
  9. 9.
    Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing and Management 24, 513–523 (1988)CrossRefGoogle Scholar
  10. 10.
    Singhal, A., Salton, G., Buckley, C.: Length normalization in degraded text collections. Technical Report TR95-1507 (1995)Google Scholar
  11. 11.
    Tsoumakas, G., Katakis, I.: Multi-label classification: An overview. International Journal Data Warehousing and Mining 3, 1–13 (2007)CrossRefGoogle Scholar
  12. 12.
    Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. Machine Learning 85, 333–359 (2011)CrossRefMathSciNetGoogle Scholar
  13. 13.
    Fujino, A., Isozaki, H., Suzuki, J.: Multi-label text categorization with model combination based on f1-score maximization. In: Proceeding of The 3rd International Joint Conference on Natural Language Processing, pp. 823–828 (2008)Google Scholar
  14. 14.
    Hua, L.: Research on multi-classification and multi-label in text categorization. In: Proceeding of International Conference on Intelligent Human-Machine Systems and Cybernetics, pp. 86–89 (2009)Google Scholar
  15. 15.
    Zhang, M.L., Zhou, Z.H.: ML-KNN: A lazy learning approach to multi-label learning. Pattern Recognition 40, 2038–2048 (2007)CrossRefMATHGoogle Scholar
  16. 16.
    Younes, Z., Abdallah, F., Denœux, T.: An Evidence-Theoretic k-Nearest Neighbor Rule for Multi-label Classification, pp. 297–308 (2009)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2013

Authors and Affiliations

  • Verayuth Lertnattee
    • 1
  • Sinthop Chomya
    • 1
  • Chanisara Lueviphan
    • 1
  1. 1.Faculty of PharmacySilpakorn UniversityMuangThailand

Personalised recommendations