TweetSemMiner: A Meta-Topic Identification Model for Twitter Using Semantic Analysis

  • Héctor D. Menéndez
  • Carlos Delgado-Calle
  • David Camacho
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8669)


The information contained in Social Networks has become increasingly important over the last few years. Inside this field, Twitter is one of the main current information sources, produced by the comments and contents that their users interchange. This information is usually noisy, however, there are some hidden patterns that can be extracted such as trends, opinions, sentiments, etc. These patterns are useful to generate users communities, which can be focused, for example, on marketing campaigns. Nevertheless, the identification process is usually blind, difficulting this information extaction. Based on this idea, this work pretends to extract relevant data from Twitter. In order to achieve this goal, we have desgined a system, called TweetSemMiner, to classify user comments (or tweets) using general topics (or meta-topics). There are several works devoted to analize social networks, however, only Topic Detection techniques have been applied in this context. This paper provides a new approach to the problem of classification using semantic analysis. The system has been developed focused on the detection of a single meta-topic and uses techniques such as Latent Semantic Analysis (LSA) combined with semantic queries in DBpedia, in order to obtain some results which can be used to analyze the effectiveness of the model. We have tested the model using real users, whose comments were subsequently evaluated to check the effectiveness of this approach.


Twitter tweets meta-topic Topic Detection DBpedia LSA semantic analysis 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Berry, M.: Survey of Text Mining: Clustering, Classification, and Retrieval. Springer (September 2003)Google Scholar
  2. 2.
    Dumais, S.T.: Latent semantic analysis. Annual Review of Information Science and Technology 38(1), 188–230 (2004)CrossRefGoogle Scholar
  3. 3.
    Jung, J.J.: Contextual synchronization for efficient social collaborations in enterprise computing: A case study on tweetpulse. Concurrent Engineering: R&A 21(3), 209–216 (2013)CrossRefGoogle Scholar
  4. 4.
    Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S., Bizer, C.: DBpedia - a large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web Journal 1, 1–29 (2012)CrossRefGoogle Scholar
  5. 5.
    Lohmann, S., Heim, P., Stegemann, T., Ziegler, J.: The relfinder user interface: Interactive exploration of relationships between objects of interest. In: Proceedings of the 14th International Conference on Intelligent User Interfaces (IUI 2010), pp. 421–422. ACM, New York (2010)Google Scholar
  6. 6.
    Makice, K.: Twitter API: Up and Running: Learn How to Build Applications with the Twitter API, 1st edn. O’Reilly Media, Inc. (April 2009)Google Scholar
  7. 7.
    Miller, G.A.: WordNet: A Lexical Database for English. Commun. ACM 38(11), 39–41 (1995)CrossRefGoogle Scholar
  8. 8.
    Milne, D., Witten, I.H.: An open-source toolkit for mining wikipedia. Artificial Intelligence 194, 222–239 (2013)MathSciNetCrossRefGoogle Scholar
  9. 9.
    Bello, G., Menéndez, H., Okazaki, S., Camacho, D.: Extracting collective trends from twitter using social-based data mining. In: Bǎdicǎ, C., Nguyen, N.T., Brezovan, M. (eds.) ICCCI 2013. LNCS, vol. 8083, pp. 622–630. Springer, Heidelberg (2013)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Héctor D. Menéndez
    • 1
  • Carlos Delgado-Calle
    • 1
  • David Camacho
    • 1
  1. 1.Departamento de Ingeniería Informática. Escuela Politécnica SuperiorUniversidad Autónoma de MadridMadridSpain

Personalised recommendations