Sentiment Quantification of User-Generated Content

Sebastiani, Fabrizio

doi:10.1007/978-1-4939-7131-2_110170

Fabrizio Sebastiani³

54 Accesses
2 Citations

Synonyms

Estimating prevalence of sentiment classes in user-generated content

Glossary

Prevalence of c in set \( \mathcal{S} \):: Percentage of items in \( \mathcal{S} \) that belong to class c and also known as the “relative frequency” of c or the “prior probability” (or simply “prior”) of c
Quantification:: Estimation of the prevalence of each class c ∈ \( \mathcal{C} \) in a set \( \mathcal{S} \) of unlabeled items (or estimation of the distribution of \( \mathcal{S} \) across the classes in \( \mathcal{C} \)), synonym of “supervised prevalence estimation” and “class prior estimation,” and also previously referred to as “counting.”
Sentiment classification:: A classification task whereby items (e.g., tweets, product reviews, comments, answers to open-ended questions) are classified based on the sentiment they convey (or opinion they express) about a certain entity or topic. It may take the form of binary classification (when the available classes are \( \mathcal{C} \) = {Positive, Negative...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 2,500.00; Price excludes VAT (USA)

Hardcover Book: USD 549.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Barranquero J, Diez J, del Coz JJ (2015) Quantification-oriented learning based on reliable classifiers. Pattern Recogn 48(2):591–604
Article MATH Google Scholar
Bella A, Ferri C, Hernandez-Orallo J, Ramirez-Quintana MJ (2010) Quantification via probability estimators. In: Proceedings of the 11th IEEE international conference on data mining (ICDM 2010), Sydney, pp 737–742
Google Scholar
Cover TM, Thomas JA (1991) Elements of information theory. Wiley, New York
Book MATH Google Scholar
Csiszar I, Shields PC (2004) Information theory and statistics: a tutorial. Found Trends Commun Inf Theory 1(4):417–528
Article MATH Google Scholar
Da San Martino G, Gao W, Sebastiani F (2016) Ordinal text quantification. In: Proceedings of the 39th ACM conference on research and development in information retrieval (SIGIR 2016), Pisa, pp 937–940. https://doi.org/10.1145/2911451.2914749
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc B 39(1):1–38
MathSciNet MATH Google Scholar
Dodds PS, Harris KD, Kloumann IM, Bliss CA, Danforth CM (2011) Temporal patterns of happiness and information in a global social network: hedonometrics and Twitter. PLoS One 6(12). https://doi.org/10.1371/journal.pone.0026752
du Plessis MC, Sugiyama M (2012) Semi-supervised learning of class balance under class-prior change by distribution matching. In: Proceedings of the 29th international conference on machine learning (ICML 2012), Edinburgh
Google Scholar
Esuli A, Sebastiani F (2010a) Machines that learn how to code open-ended survey data. Int J Mark Res 52(6):775–800
Article Google Scholar
Esuli A, Sebastiani F (2010b) Sentiment quantification. IEEE Intell Syst 25(4):72–75
Article Google Scholar
Esuli A, Sebastiani F (2015) Optimizing text quantifiers for multivariate loss functions. ACM Trans Knowl Discov Data 9(4), Article 27
Google Scholar
Feldman R (2013) Techniques and applications for sentiment analysis. Commun ACM 56(4):82–89
Article Google Scholar
Forman, G (2005) Counting positives accurately despite inaccurate classification. In: Proceedings of the 16th European conference on machine learning (ECML), Porto, pp 564–575
Google Scholar
Forman G (2008) Quantifying counts and costs via classification. Data Min Knowl Disc 17(2):164–206
Article MathSciNet Google Scholar
Gao W, Sebastiani F (2015) Tweet sentiment: from classification to quantification. In: Proceedings of the 7th international conference on advances in social network analysis and mining (ASONAM 2015), Paris, pp 97–104
Google Scholar
Gao W, Sebastiani F (2016) From classification to quantification in tweet sentiment analysis. Soc Netw Anal Min 6(19):1–22
Google Scholar
Gonzalez-Castro V, Alaiz-Rodriguez R, Alegre E (2013) Class distribution estimation based on the Hellinger distance. Inf Sci 218:146–164
Article Google Scholar
Hopkins DJ, King G (2010) A method of automated nonparametric content analysis for social science. Am J Polit Sci 54(1):229–247
Article Google Scholar
Joachims T. (2005) A support vector method for multivariate performance measures. In: Proceedings of the 22nd international conference on machine learning (ICML 2005), Bonn, pp 377–384
Google Scholar
Kar P, Li S, Narasimhan H, Chawla S, Sebastiani F (2016) Online optimization methods for the quantification problem. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (KDD 2016), San Francisco, pp 1625–1634. https://doi.org/10.1145/2939672. 2939832
Google Scholar
King G, Lu Y (2008) Verbal autopsy methods with multiple causes of death. Stat Sci 23(1):78–91
Article MathSciNet MATH Google Scholar
Liu B (2012) Sentiment analysis and opinion mining. Morgan and Claypool Publishers, San Rafael
Google Scholar
Mandel B, Culotta A, Boulahanis J, Stark D, Lewis B, Rodrigue J (2012) A demographic analysis of online sentiment during hurricane Irene. In: Proceedings of the NAACL/HLT workshop on language in social media, Montreal, pp 27–36
Google Scholar
Nakov P, Ritter A, Rosenthal S, Sebastiani F, Stoyanov V (2016) SemEval-2016 Task 4: sentiment analysis in Twitter. In: Proceedings of the 10th international workshop on semantic evaluation (SemEval 2016), San Diego, pp 1–18
Google Scholar
O’Connor B, Balasubramanyan R, Routledge BR, Smith NA (2010) From tweets to polls: linking text sentiment to public opinion time series. In: Proceedings of the 4th AAAI conference on weblogs and social media (ICWSM 2010), Washington, DC
Google Scholar
Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2(1/2):1–135
Article Google Scholar
Saerens M, Latinne P, Decaestecker C (2002) Adjusting the outputs of a classifier to new a priori probabilities: a simple procedure. Neural Comput 14(1):21–41
Article MATH Google Scholar
Tang L, Gao H, Liu H (2010) Network quantification despite biased labels. In: Proceedings of the 8th workshop on mining and learning with graphs (MLG 2010), Washington, DC, pp 147–154
Google Scholar
Tang D, Wei F, Yang N, Zhou M, Liu T, Qin B (2014) Learning sentiment-specific word embedding for Twitter sentiment classification. In: Proceedings of the 52nd annual meeting of the Association for Computational Linguistics (ACL 2014), Baltimore, pp 1555–1565
Google Scholar
Vapnik V (1998) Statistical learning theory. Wiley, New York
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Istituto di Scienza e Tecnologie dell’Informazione, Consiglio Nazionale delle Ricerche, Pisa, Italy
Fabrizio Sebastiani

Authors

Fabrizio Sebastiani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fabrizio Sebastiani .

Editor information

Editors and Affiliations

Department of Computer Science, University of Calgary, Calgary, AB, Canada
Reda Alhajj
Department of Computer Science, University of Calgary, Calgary, AB, Canada
Jon Rokne

Section Editor information

Department of Computer Science, University of Bari "Aldo Moro", Bari, Italy
Giovanni Semeraro
Bari, Italy
Cataldo Musto

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Sebastiani, F. (2018). Sentiment Quantification of User-Generated Content. In: Alhajj, R., Rokne, J. (eds) Encyclopedia of Social Network Analysis and Mining. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-7131-2_110170

Download citation

DOI: https://doi.org/10.1007/978-1-4939-7131-2_110170
Published: 12 June 2018
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-7130-5
Online ISBN: 978-1-4939-7131-2
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics