Skip to main content

Sentiment Classification: An Approach for Indian Language Tweets Using Decision Tree

  • Conference paper
  • First Online:
Mining Intelligence and Knowledge Exploration (MIKE 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9468))

Abstract

This paper describes the system we used for Shared Task on Sentiment Analysis in Indian Languages (SAIL) Tweets, at MIKE-2015. Twitter is one of the most popular platform which allows users to share their opinion in the form of tweets. Since it restricts the users with 140 characters, the tweets are actually very short to carry opinions and sentiments to analyze. We take the help of a twitter training dataset in Indian Language (Hindi) and apply data mining approaches for analyzing the sentiments. We used a state-of-the-art Data Mining tool Weka to automatically classify the sentiment of Hindi tweets into positive, negative or neutral.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bautin, M., Vijayarenu, L., Skiena, S.: International sentiment analysis for news and blogs. In: ICWSM (2008)

    Google Scholar 

  2. Das, A., Bandyopadhyay, S.: Sentiwordnet for indian languages, pp. 56–63. Asian Federation for Natural Language Processing, China (2010)

    Google Scholar 

  3. Das, D., Bandyopadhyay, S.: Word to sentence level emotion tagging for bengali blogs. In: Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pp. 149–152. Association for Computational Linguistics (2009)

    Google Scholar 

  4. Das, D., Bandyopadhyay, S.: Labeling emotion in bengali blog corpus-a fine grained tagging at sentence level. In: Proceedings of the 8th Workshop on Asian Language Resources, p. 47 (2010)

    Google Scholar 

  5. Das, S., Chen, M.: Yahoo! for amazon: extracting market sentiment from stock message boards. In: Proceedings of the Asia Pacific Finance Association Annual Conference (APFA), vol. 35, p. 43, Bangkok, Thailand (2001)

    Google Scholar 

  6. Ghosh, A., Li, G., Veale, T., Rosso, P., Shutova, E., Reyes, A., Barnden, J.: Semeval-2015 task 11: sentiment analysis of figurative language in twitter. In: International Workshop on Semantic Evaluation (SemEval-2015) (2015)

    Google Scholar 

  7. Liu, B.: Sentiment analysis and subjectivity. In: Handbook of Natural Language Processing, vol. 2, pp. 627–666 (2010)

    Google Scholar 

  8. Pang, B., Lee, L.: Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 115–124. Association for Computational Linguistics (2005)

    Google Scholar 

  9. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-2002 Conference on Empirical Methods in Natural Language Processing, vo. 10, pp. 79–86. Association for Computational Linguistics (2002)

    Google Scholar 

  10. Patra, B.G., Das, D., Das, A., Prasath, R.: Shared task on sentiment analysis in indian languages (sail) tweets - an overview. In: Proceeding of the Mining Intelligence and Knowledge Exploration (MIKE-2015) (2015)

    Google Scholar 

  11. Quinlan, J.R.: C4. 5 Programs for Machine Learning. Elsevier, New York (2014)

    Google Scholar 

  12. Sharma, S., Agrawal, J., Sharma, S.: Classification through machine learning technique: C4. 5 algorithm based on various entropies. Int. J. Comput. Appl. 82(16), 28–32 (2013)

    Google Scholar 

  13. Silva, N.F., Hruschka, E.R., Hruschka Jr, E.R.: Biocom usp: tweet sentiment analysis with adaptive boosting ensemble. In: SemEval 2014, p. 123 (2014)

    Google Scholar 

  14. Snyder, B., Barzilay, R.: Multiple aspect ranking using the good grief algorithm. In: HLT-NAACL, pp. 300–307 (2007)

    Google Scholar 

  15. Tong, R.M.: An operational system for detecting and tracking opinions in on-line discussion. In: Working Notes of the ACM SIGIR 2001 Workshop on Operational Text Classification, vol. 1, p. 6 (2001)

    Google Scholar 

  16. Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association For Computational Linguistics, pp. 417–424. Association for Computational Linguistics (2002)

    Google Scholar 

  17. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, San Francisco (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dinesh Kumar Prabhakar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Prasad, S.S., Kumar, J., Prabhakar, D.K., Pal, S. (2015). Sentiment Classification: An Approach for Indian Language Tweets Using Decision Tree. In: Prasath, R., Vuppala, A., Kathirvalavakumar, T. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2015. Lecture Notes in Computer Science(), vol 9468. Springer, Cham. https://doi.org/10.1007/978-3-319-26832-3_62

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26832-3_62

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26831-6

  • Online ISBN: 978-3-319-26832-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics