Skip to main content
Log in

An efficient framework for real-time tweet classification

  • Original Research
  • Published:
International Journal of Information Technology Aims and scope Submit manuscript

Abstract

Increasing popularity of social networking sites like facebook, twitter, google+ etc. is contributing in fast proliferation of big data. Amongst social Networking sites, twitter is one of the most common source of big data where people from across the world share their views on various topics and subjects. With daily Active user count of 100-million+ users twitter is becoming a rich information source for finding trends and current happenings around the world. Twitter does provide a limited “trends” feature. To make twitter trends more interesting and informative, in this paper we propose a framework that can analyze twitter data and classify tweets on some specific subject to generate trends. We illustrate the use of framework by analyzing the tweets on “Politics” domain as a subject. In order to classify tweets we propose a tweet classification algorithm that efficiently classify the tweets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Twitter-Wikipedia. https://en.wikipedia.org/wiki/Twitter. Accessed Jan 27 2017

  2. TweetStats. http://www.tweetstats.com/. Accessed Jan 27 2017

  3. Xefer. https://xefer.com/twitter/. Accessed Jan 27 2017

  4. TweetPsych.http://www.tweetpsych.com/. Accessed Jan 27 2017

  5. Twitalyzer. https://twitter.com/twitalyzer. Accessed Jan 27 2017

  6. Sentiment140 http://www.sentiment140.com/. Accessed Jan 27 2017

  7. Tweet Archivist http://www.tweetarchivist.com/. Accessed Jan 27 2017

  8. Twitonomy. http://www.twitonomy.com/. Accessed Jan 27 2017

  9. Twitter Counter. https://twittercounter.com/. Accessed Jan 27 2017

  10. Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022

    MATH  Google Scholar 

  11. Mcauliffe JD, Blei DM (2008) Supervised topic models. In: Advances in neural information processing systems, pp. 121–128

  12. Lacoste-Julien S, Sha F, ordan MI (2009) DiscLDA: discriminative learning for dimensionality reduction and classification. In: Advances in neural information processing systems, pp. 897–904

  13. Ramage D, Hall D, Nallapati R, Manning CD (2009) Labeled LDA: a supervised topic model for credit attribution in multi-labeled corpora. In: Proceedings of the 2009 conference on empirical methods in natural language processing: vol 1, Association for Computational Linguistics, pp. 248–256

  14. Zhao WX, Jiang J, Weng J, He J, Lim EP, Yan H, Li X (2011) Comparing twitter and traditional media using topic models. In: European conference on information retrieval, Springer Berlin Heidelberg, pp. 338–349

  15. Lau JH, Collier N, Baldwin T (2012) On-line trend analysis with topic models:\# twitter Trends Detection Topic Model Online. In: COLING, pp. 1519–1534

  16. Sriram B, Fuhry D, Demir E, Ferhatosmanoglu H, Demirbas M (2010) Short text classification in twitter to improve information filtering. In: Proceedings of the 33rd international ACM SIGIR conference on research and development in information retrieval, ACM, pp. 841-842

  17. Banerjee S, Ramanathan K, Gupta A (2007) Clustering short texts using wikipedia. In: Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval, ACM, pp. 787–788

  18. Lin J, Snow R, Morgan W (2011) Smoothing techniques for adaptive online language models: topic tracking in tweet streams. In: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp. 422–429

  19. Allan J (ed) (2012) Topic detection and tracking: event-based information organization, vol 12. Springer Science and Business Media, Berlin

    MATH  Google Scholar 

  20. Varga A, Basave AEC, Rowe M, Ciravegna F, He Y (2014) Linked knowledge sources for topic classification of microposts: a semantic graph-based approach. Web Semantics Sci Serv Agents World Wide Web 26:36–57

    Article  Google Scholar 

  21. Kinsella S, Passant A, Breslin JG (2011) Topic classification in social media using metadata from hyperlinked objects. In: European conference on information retrieval, Springer Berlin Heidelberg, pp. 201–206

  22. Lee K, Palsetia D, Narayanan R, Patwary MMA, Agrawal A, Choudhary A (2011) Twitter trending topic classification. In: 2011 IEEE 11th international conference on data mining workshops, IEEE, pp. 251-258

  23. Yerva SR, Miklós Z, Aberer K (2011) What have fruits to do with technology?: the case of orange, blackberry and apple. In: Proceedings of the international conference on web intelligence, mining and semantics, ACM, p. 48

  24. Yamin M, Ammar A. Al Makrami (2015) Cloud Computing in SMEs: case of Saudi Arabia. BVICAM’s Int J Inf Technol 7(1):853–860

    Google Scholar 

  25. Suneel KS, Guruprasad HS (2016) An approach for server consolidation in a priority based cloud architecture. BVICAM’s Int J Inf Technol 8(1):934–939

    Google Scholar 

  26. Beigh BM (2015) Framework for choosing best intrusion detection system. BIJIT-BVICAM’s Int J Inf Technol 4852:821

    Google Scholar 

  27. Kishore N, Seema S (2016) Secured data migration from enterprise to cloud storage–analytical survey. BVICAM’s Int J Inf Technol 8(1):965–968

    Google Scholar 

  28. Rana V (2013) Innovative use of cloud computing in smart phone technology. BVICAM’s Int J Inf Technol 5(2):640

    MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Imran Khan.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khan, I., Naqvi, S.K., Alam, M. et al. An efficient framework for real-time tweet classification. Int. j. inf. tecnol. 9, 215–221 (2017). https://doi.org/10.1007/s41870-017-0015-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41870-017-0015-x

Keywords

Navigation