Abstract
Topological and dynamic features of complex networks have proven in recent years to be suitable for capturing text characteristics, with various applications in natural language processing. In this article we show that texts with positive and negative opinions can be distinguished from each other when represented as complex networks. The distinction was possible with the use of several metrics, including degrees, clustering coefficient, shortest paths, global efficiency, closeness and accessibility. The multidimensional dataset was projected into a 2-dimensional space with the principal component analysis. The distinction was quantified using machine learning algorithms, which allowed a recall of 84.4% in the automatic discrimination for the negative opinions, even without attempts to optimize the pattern recognition process.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Manning, C.D., Schuetze, H.: Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge (1999)
Antiqueira, L., Nunes, M.G.V., Oliveira Jr., O.N., Costa, L.F.: Strong correlations between text quality and complex networks features. Physica A 373, 811–820 (2007)
Newman, M.E.J.: The Structure and Function of Complex Networks. SIAM Review 45, 167–256 (2003)
Albert, R.Z., Barabasi, A.L.: Statistical Mechanics of Complex Networks. Rev. Modern Phys. 74, 47–97 (2002)
Ferrer i Cancho, R., Sole, R.V.: The small world of human language. Proceedings of the Royal Society of London B 268, 2261 (2001)
Barabasi, A.L.: Scale-Free Networks: a decade and beyond. Science 24, 412–413 (2009)
Antiqueira, L., Oliveira Jr., O.N., Costa, L.F., Nunes, M.G.V.: A Complex Network Approach to Text Summarization. Information Sciences 179(5), 584–599 (2009)
Amancio, D.R., Antiqueira, L., Pardo, T.A.S., Costa, L.F., Oliveira Jr., O.N., Nunes, M.G.V.: Complex networks analysis of manual and machine translations. International Journal of Modern Physics C 19(4), 583–598 (2008)
Amancio, D.R., Nunes, M.G.V., Oliveira Jr., O.N., Pardo, T.A.S., Antiqueira, L., da Costa, L.F.: Using metrics from complex networks to evaluate machine translation. Physica A 390(1), 131–142 (2011)
Sigman, M., Cecchi, G.A.: Global Organization of the Wordnet Lexicon. Proceedings of the National Academy of Sciences 99, 1742–1747 (2002)
Costa, L.F.: What’s in a name? International Journal of Modern Physics C 15, 371–379 (2004)
Dorogovtsev, S.V., Mendes, J.F.F.: Evolution of networks. Advances in Physics 51, 1079–1187 (2002)
Antiqueira, L., Pardo, T.A.S., Nunes, M.G.V., Oliveira Jr., O.N., Costa, L.F. Some issues on complex networks for author characterization. In: Proceeedings of the Workshop in Information and Human Language Technology (2006)
Tang, H., Tan, S., Cheng, X.: A survey on sentiment detection of reviews. Expert Systems with Applications 36(7), 10760–10773 (2009)
Pennebaker, J.W., Mehl, M.R., Niederhoffer, K.G.: Psychological aspects of natural language. use: our words, our selves. Annual Review of Psychology 54, 547–577 (2003)
Costa, L.F., et al.: Characterization of complex networks: a survey of measurements. Advances in Physics 56, 167–242 (2007)
Rodrigues, F.A., Costa, L. F.: A structure dynamic approach to cortical organization: Number of paths and accessibility. Journal of Neuroscience Methods, 1–10 (2009)
Ratnaparki, A.: A Maximum Entropy Part-Of-Speech Tagger. In: Proceedings of the Empirical Methods in Natural Language Processing Conference (1996)
Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)
Jolliffe, I.T.: Principal component analysis. Springer, New York (2002)
Costa, L.F., Cesar Jr., R.M.: Shape Analysis and Classification. CRC Press, Boca Raton (2001)
McLachlan, G.J.: Discriminant Analysis and Statistical Pattern Recognition. Wiley, Chichester (2004)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. John Wiley and Sons Inc., Chichester (2001)
Quinlan, R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)
Cohen, W.W.: Fast Effective Rule Induction. In: 12 International Converence on Machine Learning, pp. 115–223 (1995)
John, G.H., Langley, P.: Estimating Continuous Distribution in Bayesian Classifiers. In: 11th Conference on Uncertainty in Artificial Intelligence, pp. 338–345 (1995)
Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, vol. 12, pp. 1137–1143 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Amancio, D.R., Fabbri, R., Oliveira, O.N., Nunes, M.G.V., da Fontoura Costa, L. (2011). Opinion Discrimination Using Complex Network Features. In: da F. Costa, L., Evsukoff, A., Mangioni, G., Menezes, R. (eds) Complex Networks. Communications in Computer and Information Science, vol 116. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25501-4_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-25501-4_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25500-7
Online ISBN: 978-3-642-25501-4
eBook Packages: Computer ScienceComputer Science (R0)