Automatic Classification and PLS-PM Modeling for Profiling Reputation of Corporate Entities on Twitter

  • Jean-Valère Cossu
  • Eric Sanjuan
  • Juan-Manuel Torres-Moreno
  • Marc El-Bèze
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9103)

Abstract

In this paper, we address the task of detecting the reputation alert in social media updates, that is, deciding whether a new-coming content has strong and immediate implications for the reputation of a given entity. This content is also submitted to a standard typology of reputation dimensions that consists in a broad classification of the aspects of an under public audience company. Reputation manager needs a real-time database and method to report what is happening right now to his brand. However, typical Natural Language Processing (NLP) approaches to these tasks require external resources and show non-relational modeling. We propose a fast supervised approach for extracting textual features, which we use to train simple statistical reputation classifiers. These classifiers outputs are used in a Partial Least Squares Path Modeling (PLS-PM) system to model the reputation. Experiments on the RepLab 2013 and 2014 collections show that our approaches perform as well as the state-of-the-art more complex methods.

Notes

Acknowledgment

This work is funded by the project ImagiWeb ANR-2012-CORD-002-01.

References

  1. 1.
    Amigó, E., Carrillo de Albornoz, J., Chugur, I., Corujo, A., Gonzalo, J., Martín, T., Meij, E., de Rijke, M., Spina, D.: Overview of RepLab 2013: evaluating online reputation monitoring systems. In: Forner, P., Müller, H., Paredes, R., Rosso, P., Stein, B. (eds.) CLEF 2013. LNCS, vol. 8138, pp. 333–352. Springer, Heidelberg (2013) Google Scholar
  2. 2.
    Amigó, E., Carrillo-de-Albornoz, J., Chugur, I., Corujo, A., Gonzalo, J., Meij, E., de Rijke, M., Spina, D.: Overview of RepLab 2014: author profiling and reputation dimensions for online reputation management. In: Kanoulas, E., Lupu, M., Clough, P., Sanderson, M., Hall, M., Hanbury, A., Toms, E. (eds.) CLEF 2014. LNCS, vol. 8685, pp. 307–322. Springer, Heidelberg (2014) Google Scholar
  3. 3.
    Wold, S., Eriksson, L., Trygg, J., Kettaneh, N.: The pls method-partial least squares projections to latent structures-and its applications in industrial rdp (research, development, and production). Unea University (2004)Google Scholar
  4. 4.
    Ranganath, R., Jurafsky, D., McFarland, D.: It’s not you, it’s me: detecting flirting and its misperception in speed-dates. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 1, pp. 334–342. Association for Computational Linguistics (2009)Google Scholar
  5. 5.
    O’Connor, B., Krieger, M., Ahn, D.: Tweetmotif: exploratory search and topic summarization for twitter. In: ICWSM (2010)Google Scholar
  6. 6.
    Berrocal, J.L.A., Figuerola, C.G., Rodríguez, Á.Z.: Reina at replab2013 topic detection task: community detection. In: CLEF 2013 Eval. Labs and Workshop Online Working Notes (2013)Google Scholar
  7. 7.
    Sánchez-Sánchez, C., Jiménez-Salazar, H., Luna-Ramirez, W.: Uamclyr at replab2013: monitoring task. In: CLEF 2013 Eval. Labs and Workshop Online Working Notes (2013)Google Scholar
  8. 8.
    Cossu, J.V., Bigot, B., Bonnefoy, L., Senay, G.: Towards the improvement of topic priority assignment using various topic detection methods for e-reputation monitoring on twitter. In: Métais, E., Roche, M., Teisseire, M. (eds.) NLDB 2014. LNCS, vol. 8455, pp. 154–159. Springer, Heidelberg (2014) CrossRefGoogle Scholar
  9. 9.
    McDonald, G., Deveaud, R., McCreadie, R., Gollins, T., Macdonald, C., Ounis, I.: University of glasgow terrier team/project abacá at replab 2014: reputation dimensions task (2014)Google Scholar
  10. 10.
    Rahimi, A., Sahlgren, M., Kerren, A., Paradis, C.: The stavicta group report for replab 2014 reputation dimensions task. In: CLEF 2014 Evaluation Labs and Workshop-Working Notes Papers (2014)Google Scholar
  11. 11.
    Qureshi, M.A., ORiordan, C., Pasi, G.: Exploiting wikipedia for entity name disambiguation in tweets. In: Métais, E., Roche, M., Teisseire, M. (eds.) NLDB 2014. LNCS, vol. 8455, pp. 184–195. Springer, Heidelberg (2014) CrossRefGoogle Scholar
  12. 12.
    Vilares, D., Hermo, M., Alonso, M.A., Gómez-Rodrıguez, C., Vilares, J.: Lys at clef replab 2014: Creating the state of the art in author influence ranking and reputation classification on twitter. In: CLEF, pp. 1468–1478 (2014)Google Scholar
  13. 13.
    Crammer, K., Singer, Y.: On the algorithmic implementation of multiclass kernel-based vector machines. J. Mach. Learn. Res. 2, 265–292 (2002)MATHGoogle Scholar
  14. 14.
    Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)CrossRefGoogle Scholar
  15. 15.
    Robertson, S.: Understanding inverse document frequency: on theoretical arguments for idf. J. Documentation 60(5), 503–520 (2004)CrossRefGoogle Scholar
  16. 16.
    Torres-Moreno, J., El-Beze, M., Bellot, P.: Bechet, opinion detection as a topic classification problem in in textual information access. chap. 9 (2013)Google Scholar
  17. 17.
    Henseler, J.: On the convergence of the partial least squares path modeling algorithm. Comput. Statistics 25(1), 107–120 (2010)MATHCrossRefMathSciNetGoogle Scholar
  18. 18.
    Amigó, E., Gonzalo, J., Verdejo, F.: A general evaluation measure for document organization tasks. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 643–652. ACM (2013)Google Scholar
  19. 19.
    Cossu, J., Bigot, B., Bonnefoy, L., Morchid, M., Bost, X., Senay, G., Dufour, R., Bouvier, V., Torres-Moreno, J., El-Beze, M.: Lia@ replab 2013. In: CLEF 2013 Eval. Labs and Workshop Online Working Notes (2013)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Jean-Valère Cossu
    • 1
  • Eric Sanjuan
    • 1
  • Juan-Manuel Torres-Moreno
    • 1
  • Marc El-Bèze
    • 1
  1. 1.LIA/Université d’Avignon et des Pays de VaucluseAvignon cedex 9France

Personalised recommendations