Glossary
- Prevalence of c in set \( \mathcal{S} \):
-
Percentage of items in \( \mathcal{S} \) that belong to class c and also known as the “relative frequency” of c or the “prior probability” (or simply “prior”) of c
- Quantification:
-
Estimation of the prevalence of each class c ∈ \( \mathcal{C} \) in a set \( \mathcal{S} \) of unlabeled items (or estimation of the distribution of \( \mathcal{S} \) across the classes in \( \mathcal{C} \)), synonym of “supervised prevalence estimation” and “class prior estimation,” and also previously referred to as “counting.”
- Sentiment classification:
-
A classification task whereby items (e.g., tweets, product reviews, comments, answers to open-ended questions) are classified based on the sentiment they convey (or opinion they express) about a certain entity or topic. It may take the form of binary classification (when the available classes are \( \mathcal{C} \) = {Positive, Negative...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Barranquero J, Diez J, del Coz JJ (2015) Quantification-oriented learning based on reliable classifiers. Pattern Recogn 48(2):591–604
Bella A, Ferri C, Hernandez-Orallo J, Ramirez-Quintana MJ (2010) Quantification via probability estimators. In: Proceedings of the 11th IEEE international conference on data mining (ICDM 2010), Sydney, pp 737–742
Cover TM, Thomas JA (1991) Elements of information theory. Wiley, New York
Csiszar I, Shields PC (2004) Information theory and statistics: a tutorial. Found Trends Commun Inf Theory 1(4):417–528
Da San Martino G, Gao W, Sebastiani F (2016) Ordinal text quantification. In: Proceedings of the 39th ACM conference on research and development in information retrieval (SIGIR 2016), Pisa, pp 937–940. https://doi.org/10.1145/2911451.2914749
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc B 39(1):1–38
Dodds PS, Harris KD, Kloumann IM, Bliss CA, Danforth CM (2011) Temporal patterns of happiness and information in a global social network: hedonometrics and Twitter. PLoS One 6(12). https://doi.org/10.1371/journal.pone.0026752
du Plessis MC, Sugiyama M (2012) Semi-supervised learning of class balance under class-prior change by distribution matching. In: Proceedings of the 29th international conference on machine learning (ICML 2012), Edinburgh
Esuli A, Sebastiani F (2010a) Machines that learn how to code open-ended survey data. Int J Mark Res 52(6):775–800
Esuli A, Sebastiani F (2010b) Sentiment quantification. IEEE Intell Syst 25(4):72–75
Esuli A, Sebastiani F (2015) Optimizing text quantifiers for multivariate loss functions. ACM Trans Knowl Discov Data 9(4), Article 27
Feldman R (2013) Techniques and applications for sentiment analysis. Commun ACM 56(4):82–89
Forman, G (2005) Counting positives accurately despite inaccurate classification. In: Proceedings of the 16th European conference on machine learning (ECML), Porto, pp 564–575
Forman G (2008) Quantifying counts and costs via classification. Data Min Knowl Disc 17(2):164–206
Gao W, Sebastiani F (2015) Tweet sentiment: from classification to quantification. In: Proceedings of the 7th international conference on advances in social network analysis and mining (ASONAM 2015), Paris, pp 97–104
Gao W, Sebastiani F (2016) From classification to quantification in tweet sentiment analysis. Soc Netw Anal Min 6(19):1–22
Gonzalez-Castro V, Alaiz-Rodriguez R, Alegre E (2013) Class distribution estimation based on the Hellinger distance. Inf Sci 218:146–164
Hopkins DJ, King G (2010) A method of automated nonparametric content analysis for social science. Am J Polit Sci 54(1):229–247
Joachims T. (2005) A support vector method for multivariate performance measures. In: Proceedings of the 22nd international conference on machine learning (ICML 2005), Bonn, pp 377–384
Kar P, Li S, Narasimhan H, Chawla S, Sebastiani F (2016) Online optimization methods for the quantification problem. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (KDD 2016), San Francisco, pp 1625–1634. https://doi.org/10.1145/2939672. 2939832
King G, Lu Y (2008) Verbal autopsy methods with multiple causes of death. Stat Sci 23(1):78–91
Liu B (2012) Sentiment analysis and opinion mining. Morgan and Claypool Publishers, San Rafael
Mandel B, Culotta A, Boulahanis J, Stark D, Lewis B, Rodrigue J (2012) A demographic analysis of online sentiment during hurricane Irene. In: Proceedings of the NAACL/HLT workshop on language in social media, Montreal, pp 27–36
Nakov P, Ritter A, Rosenthal S, Sebastiani F, Stoyanov V (2016) SemEval-2016 Task 4: sentiment analysis in Twitter. In: Proceedings of the 10th international workshop on semantic evaluation (SemEval 2016), San Diego, pp 1–18
O’Connor B, Balasubramanyan R, Routledge BR, Smith NA (2010) From tweets to polls: linking text sentiment to public opinion time series. In: Proceedings of the 4th AAAI conference on weblogs and social media (ICWSM 2010), Washington, DC
Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2(1/2):1–135
Saerens M, Latinne P, Decaestecker C (2002) Adjusting the outputs of a classifier to new a priori probabilities: a simple procedure. Neural Comput 14(1):21–41
Tang L, Gao H, Liu H (2010) Network quantification despite biased labels. In: Proceedings of the 8th workshop on mining and learning with graphs (MLG 2010), Washington, DC, pp 147–154
Tang D, Wei F, Yang N, Zhou M, Liu T, Qin B (2014) Learning sentiment-specific word embedding for Twitter sentiment classification. In: Proceedings of the 52nd annual meeting of the Association for Computational Linguistics (ACL 2014), Baltimore, pp 1555–1565
Vapnik V (1998) Statistical learning theory. Wiley, New York
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media LLC, part of Springer Nature
About this entry
Cite this entry
Sebastiani, F. (2018). Sentiment Quantification of User-Generated Content. In: Alhajj, R., Rokne, J. (eds) Encyclopedia of Social Network Analysis and Mining. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-7131-2_110170
Download citation
DOI: https://doi.org/10.1007/978-1-4939-7131-2_110170
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-7130-5
Online ISBN: 978-1-4939-7131-2
eBook Packages: Computer ScienceReference Module Computer Science and Engineering