Automatic Classification and PLS-PM Modeling for Profiling Reputation of Corporate Entities on Twitter

Cossu, Jean-Valère; Sanjuan, Eric; Torres-Moreno, Juan-Manuel; El-Bèze, Marc

doi:10.1007/978-3-319-19581-0_25

Jean-Valère Cossu¹⁸,
Eric Sanjuan¹⁸,
Juan-Manuel Torres-Moreno¹⁸ &
…
Marc El-Bèze¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9103))

Included in the following conference series:

International Conference on Applications of Natural Language to Information Systems

1824 Accesses
1 Citations

Abstract

In this paper, we address the task of detecting the reputation alert in social media updates, that is, deciding whether a new-coming content has strong and immediate implications for the reputation of a given entity. This content is also submitted to a standard typology of reputation dimensions that consists in a broad classification of the aspects of an under public audience company. Reputation manager needs a real-time database and method to report what is happening right now to his brand. However, typical Natural Language Processing (NLP) approaches to these tasks require external resources and show non-relational modeling. We propose a fast supervised approach for extracting textual features, which we use to train simple statistical reputation classifiers. These classifiers outputs are used in a Partial Least Squares Path Modeling (PLS-PM) system to model the reputation. Experiments on the RepLab 2013 and 2014 collections show that our approaches perform as well as the state-of-the-art more complex methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Replab provides a framework to evaluate Online Reputation Management systems on Twitter http://www.limosine-project.eu/events/replab2013.
2.
See [13] http://www.cs.cornell.edu/people/tj/svm_light/svm_multiclass.html.
3.
Reputation Institute’s Reptrak framework http://www.reputationinstitute.com/about-reputation-institute/the-reptrak-framework.
4.
We remove links, stop-words and punctuation marks.
5.
http://cran.r-project.org/web/packages/plspm/.

References

Amigó, E., Carrillo de Albornoz, J., Chugur, I., Corujo, A., Gonzalo, J., Martín, T., Meij, E., de Rijke, M., Spina, D.: Overview of RepLab 2013: evaluating online reputation monitoring systems. In: Forner, P., Müller, H., Paredes, R., Rosso, P., Stein, B. (eds.) CLEF 2013. LNCS, vol. 8138, pp. 333–352. Springer, Heidelberg (2013)
Google Scholar
Amigó, E., Carrillo-de-Albornoz, J., Chugur, I., Corujo, A., Gonzalo, J., Meij, E., de Rijke, M., Spina, D.: Overview of RepLab 2014: author profiling and reputation dimensions for online reputation management. In: Kanoulas, E., Lupu, M., Clough, P., Sanderson, M., Hall, M., Hanbury, A., Toms, E. (eds.) CLEF 2014. LNCS, vol. 8685, pp. 307–322. Springer, Heidelberg (2014)
Google Scholar
Wold, S., Eriksson, L., Trygg, J., Kettaneh, N.: The pls method-partial least squares projections to latent structures-and its applications in industrial rdp (research, development, and production). Unea University (2004)
Google Scholar
Ranganath, R., Jurafsky, D., McFarland, D.: It’s not you, it’s me: detecting flirting and its misperception in speed-dates. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 1, pp. 334–342. Association for Computational Linguistics (2009)
Google Scholar
O’Connor, B., Krieger, M., Ahn, D.: Tweetmotif: exploratory search and topic summarization for twitter. In: ICWSM (2010)
Google Scholar
Berrocal, J.L.A., Figuerola, C.G., Rodríguez, Á.Z.: Reina at replab2013 topic detection task: community detection. In: CLEF 2013 Eval. Labs and Workshop Online Working Notes (2013)
Google Scholar
Sánchez-Sánchez, C., Jiménez-Salazar, H., Luna-Ramirez, W.: Uamclyr at replab2013: monitoring task. In: CLEF 2013 Eval. Labs and Workshop Online Working Notes (2013)
Google Scholar
Cossu, J.V., Bigot, B., Bonnefoy, L., Senay, G.: Towards the improvement of topic priority assignment using various topic detection methods for e-reputation monitoring on twitter. In: Métais, E., Roche, M., Teisseire, M. (eds.) NLDB 2014. LNCS, vol. 8455, pp. 154–159. Springer, Heidelberg (2014)
Chapter Google Scholar
McDonald, G., Deveaud, R., McCreadie, R., Gollins, T., Macdonald, C., Ounis, I.: University of glasgow terrier team/project abacá at replab 2014: reputation dimensions task (2014)
Google Scholar
Rahimi, A., Sahlgren, M., Kerren, A., Paradis, C.: The stavicta group report for replab 2014 reputation dimensions task. In: CLEF 2014 Evaluation Labs and Workshop-Working Notes Papers (2014)
Google Scholar
Qureshi, M.A., ORiordan, C., Pasi, G.: Exploiting wikipedia for entity name disambiguation in tweets. In: Métais, E., Roche, M., Teisseire, M. (eds.) NLDB 2014. LNCS, vol. 8455, pp. 184–195. Springer, Heidelberg (2014)
Chapter Google Scholar
Vilares, D., Hermo, M., Alonso, M.A., Gómez-Rodrıguez, C., Vilares, J.: Lys at clef replab 2014: Creating the state of the art in author influence ranking and reputation classification on twitter. In: CLEF, pp. 1468–1478 (2014)
Google Scholar
Crammer, K., Singer, Y.: On the algorithmic implementation of multiclass kernel-based vector machines. J. Mach. Learn. Res. 2, 265–292 (2002)
MATH Google Scholar
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)
Article Google Scholar
Robertson, S.: Understanding inverse document frequency: on theoretical arguments for idf. J. Documentation 60(5), 503–520 (2004)
Article Google Scholar
Torres-Moreno, J., El-Beze, M., Bellot, P.: Bechet, opinion detection as a topic classification problem in in textual information access. chap. 9 (2013)
Google Scholar
Henseler, J.: On the convergence of the partial least squares path modeling algorithm. Comput. Statistics 25(1), 107–120 (2010)
Article MATH MathSciNet Google Scholar
Amigó, E., Gonzalo, J., Verdejo, F.: A general evaluation measure for document organization tasks. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 643–652. ACM (2013)
Google Scholar
Cossu, J., Bigot, B., Bonnefoy, L., Morchid, M., Bost, X., Senay, G., Dufour, R., Bouvier, V., Torres-Moreno, J., El-Beze, M.: Lia@ replab 2013. In: CLEF 2013 Eval. Labs and Workshop Online Working Notes (2013)
Google Scholar

Download references

Acknowledgment

This work is funded by the project ImagiWeb ANR-2012-CORD-002-01.

Author information

Authors and Affiliations

LIA/Université d’Avignon et des Pays de Vaucluse, 39 chemin des Meinajaries, Agroparc BP 91228, 84911, Avignon cedex 9, France
Jean-Valère Cossu, Eric Sanjuan, Juan-Manuel Torres-Moreno & Marc El-Bèze

Authors

Jean-Valère Cossu
View author publications
You can also search for this author in PubMed Google Scholar
Eric Sanjuan
View author publications
You can also search for this author in PubMed Google Scholar
Juan-Manuel Torres-Moreno
View author publications
You can also search for this author in PubMed Google Scholar
Marc El-Bèze
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jean-Valère Cossu .

Editor information

Editors and Affiliations

Technische Universität Darmstadt, Darmstadt, Germany
Chris Biemann
Universität Passau, Passau, Germany
Siegfried Handschuh
Universität Passau, Passau, Germany
André Freitas
University of Salford, Salford, United Kingdom
Farid Meziane
Conservatoire National des Arts et Métiers, Paris, France
Elisabeth Métais

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cossu, JV., Sanjuan, E., Torres-Moreno, JM., El-Bèze, M. (2015). Automatic Classification and PLS-PM Modeling for Profiling Reputation of Corporate Entities on Twitter. In: Biemann, C., Handschuh, S., Freitas, A., Meziane, F., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2015. Lecture Notes in Computer Science(), vol 9103. Springer, Cham. https://doi.org/10.1007/978-3-319-19581-0_25

Download citation

DOI: https://doi.org/10.1007/978-3-319-19581-0_25
Published: 04 June 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19580-3
Online ISBN: 978-3-319-19581-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics