Abstract
This chapter concentrates on aspect-based sentiment analysis, a form of opinion mining where algorithms detect sentiments expressed about features of products, services, etc. We especially focus on novel approaches for aspect phrase extraction and classification trained on feature-rich datasets. Here, we present two new datasets, which we gathered from the linguistically rich domain of physician reviews, as other investigations have mainly concentrated on commercial reviews and social media reviews so far. To give readers a better understanding of the underlying datasets, we describe the annotation process and inter-annotator agreement in detail. In our research, we automatically assess implicit mentions or indications of specific aspects. To do this, we propose and utilize neural network models that perform the here-defined aspect phrase extraction and classification task, achieving F1-score values of about 80% and accuracy values of more than 90%. As we apply our models to a comparatively complex domain, we obtain promising results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
http://ratemds.com, last visit was on 2020-05-19.
- 2.
http://jameda.de, last visit was on 2020-05-19.
- 3.
http://pincetas.lt, last visit was on 2020-05-19.
- 4.
http://docfinder.at, last visit was on 2020-05-19.
- 5.
http://medicosearch.ch, last visit was on 2020-05-19.
- 6.
The aspect classes were translated from German: Freundlichkeit, Kompetenz, Zeit genommen, AufkƤrung.
- 7.
The aspect classes were originally in German: Behandlung, Alternativheilmethoden, VertrauensverhƤltnis, Kinderfreundlichkeit, Betreuung/Engagement, Gesamt/Empfehlung.
- 8.
The neural network was inspired by [37].
- 9.
The scheme was as follows: sentence: 1, classes predicted: [1]/ sentence: 2, classes predicted: [2, 5]/ ...; sentence: 7, classes predicted: [1]/ etc.
- 10.
SpatialDropout was used only for the first dataset (fkza) and the biLSTM-CRF with FastText embeddings, while we relied on a regular dropout layer for the other cases. While SpatialDropout performed slightly better in this case, the overall effect was marginal and we thus trusted the normal Dropout, which is more suitable for sequence processing in general. The difference between Dropout and SpatialDropout is that the latter drops whole feature maps instead of single elements [72].
References
Akbik, A., Blythe, D., Vollgraf, R.: Contextual string embeddings for sequence labeling. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 1638ā1649. ACL, Santa Fe, NM, USA (2018). https://www.aclweb.org/anthology/C18-1139
Apotheke-Adhoc: Von Jameda zur Konkurrenz geschickt. [sent by Jameda to the competitors]. https://www.apotheke-adhoc.de/nachrichten/detail/apothekenpraxis/von-jameda-zur-konkurrenz-geschickt-bewertungsportale/ (2018). Accessed 28 Oct 2019
Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the 7th LREC, vol.Ā 10, pp. 2200ā2204. ELRA (2010)
Beltagy, I., Lo, K., Cohan, A.: SCIBERT: a pretrained language model for scientific text. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, pp. 3615ā3620. ACL (2019)
Bidmon, S., Elshiewy, O., Terlutter, R., Boztug, Y.: What patients value in physicians: Analyzing drivers of patient satisfaction using physician-rating website data. J. Med. Internet Res. 22(2), e13830 (2020). https://doi.org/10.2196/13830
Bidmon, S., Elshiewy, O., Terlutter, R., Boztug Y.: What patients really value in physicians and what they take for granted: an analysis of large-scale data from a physician-rating website. J. Med. Internet Res. 22(2), e13830 (2019). https://doi.org/10.2196/13830
Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python, 1st edn. OāReilly Media, Sebastopol (2009)
Blair-Goldensohn, S., Hannan, K., McDonald, R., Neylon, T., Reis, G.A., Reynar, J.: Building a sentiment summarizer for local service reviews. In: Proceedings of the WWW Workshop on NLP Challenges in the Information Explosion Era, vol.Ā 14, pp. 339ā348. ACM (2008)
Bojanowski, P., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. Trans. ACL 5, 135ā146 (2017)
BƤumer, F.S., Grote, N., Kersting, J., Geierhos, M.: Privacy matters: detecting nocuous patient data exposure in online physician reviews. In: Proceedings of the 23rd International Conference on Information and Software Technologies, vol. 756, pp. 77ā89. Springer (2017). https://doi.org/10.1007/978-3-319-67642-5_7
BƤumer, F.S., Kersting, J., KurÅ”elis, V., Geierhos, M.: Rate your physician: findings from a Lithuanian physician rating website. In: Proceedings of the 24th International Conference on Information and Software Technologies, Communications in Computer and Information Science, vol. 920, pp. 43ā58. Springer (2018). https://doi.org/10.1007/978-3-319-99972-2_4
Chinsha, T.C., Shibily, J.: A syntactic approach for aspect based opinion mining. In: Proceedings of the 9th IEEE International Conference on Semantic Computing, pp. 24ā31. IEEE (2015). https://doi.org/10.1109/icosc.2015.7050774
Cieliebak, M., Deriu, J.M., Egger, D., Uzdilli, F.: A Twitter corpus and benchmark resources for German sentiment analysis. In: Proceedings of the 5th International Workshop on Natural Language Processing for Social Media, pp. 45ā51. ACL (2017). https://doi.org/10.18653/v1/W17-1106
Cohen, J.: A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 20(1), 37ā46 (1960)
Cordes, M.: Wie bewerten die anderen? Eine Ć¼bergreifende Analyse von Arztbewertungsportalen in Europa. [What do the others think? An overarching analysis of doctor rating portals in Europe]. Masterās thesis, Paderborn University (2018)
DeĀ Clercq, O., Lefever, E., Jacobs, G., Carpels, T., Hoste, V.: Towards an integrated pipeline for aspect-based sentiment analysis in various domains. In: Proceedings of the 8th ACL Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 136ā142. ACL (2017). https://doi.org/10.18653/v1/w17-5218
deepset: deepset ā open sourcing German BERT (2019). https://deepset.ai/german-bert. Accessed 28 Nov 2019
Deng, L., Wiebe, J.: MPQA 3.0: an entity/event-level sentiment corpus. In: Proceedings of the 2015 Conference of the North American Chapter of the ACL: Human Language Technologies, pp. 1323ā1328. ACL (2015)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint (2018)
Dong, L., Wei, F., Tan, C., Tang, D., Zhou, M., Xu, K.: Adaptive recursive neural network for target-dependent twitter sentiment classification. In: Proceedings of the 52nd Annual Meeting of the ACL, pp. 49ā54. ACL (2014)
Ellimoottil, C., Leichtle, S.W., Wright, C.J., Fakhro, A., Arrington, A.K., Chirichella, T.J., Ward, W.H.: Online physician reviews: the good, the bad and the ugly. Bull. Am. Coll. Surg. 98(9), 34ā39 (2013)
Emmert, M., Meier, F., Pisch, F., Sander, U.: Physician choice making and characteristics associated with using physician-rating websites: cross-sectional study. J. Med. Internet Res. 15(8), e187 (2013)
Emmert, M., Sander, U., Esslinger, A.S., Maryschok, M., Schƶffski, O.: Public reporting in Germany: the content of physician rating websites. Methods Inf. Med. 51(2), 112ā120 (2012)
Emmert, M., Sander, U., Pisch, F.: Eight questions about physician-rating websites: a systematic review. J. Med. Internet Res. 15(2), e24 (2013). https://doi.org/10.2196/jmir.2360
ExplosionAI: Spacy (2019). https://spacy.io/. Accessed 06 Nov 2019
ExplosionAI: GitHub - explosion/spacy-transformers/ ā spaCy pipelines for pre-trained BERT, XLNet and GPT-2 (2020). https://github.com/explosion/spacy-transformers. Accessed 20 May 2020
Gao, G.G., McCullough, J.S., Agarwal, R., Jha, A.K.: A changing landscape of physician quality reporting: Analysis of patientsā online ratings of their physicians over a 5-year period. J. Med. Internet Res. 14(1), e38 (2012). https://doi.org/10.2196/jmir.2003
Garcia-Pablos, A., Cuadros, M., Rigau, G.: W2VLDA: almost unsupervised system for aspect based sentiment analysis. Expert Syst. Appl. 91, 127ā137 (2018). https://doi.org/10.1016/j.eswa.2017.08.049
Geierhos, M., BƤumer, F., Schulze, S., StuĆ, V.: āI grade what I get but write what I think.ā inconsistency analysis in patientsā reviews. In: ECIS 2015 Completed Research Papers. AIS (2015). https://doi.org/10.18151/7217324
Hao, H., Zhang, K.: The voice of Chinese health consumers: A text mining approach to web-based physician reviews. J. Med. Internet Res. 18(5), e108 (2016). https://doi.org/10.2196/jmir.4430
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735ā1780 (1997). https://doi.org/10.1162/neco.1997.9.8.1735
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168ā177. ACM (2004)
Hu, M., Liu, B.: Mining opinion features in customer reviews. In: Proceedings of the 19th National Conference on Artificial Intelligence, pp. 755ā760. AAAI (2004)
Kersting, J., BƤumer, F., Geierhos, M.: In reviews we trust: But should we? experiences with physician review websites. In: Proceedings of the 4th International Conference on Internet of Things, Big Data and Security, pp. 147ā155. SCITEPRESS (2019). https://doi.org/10.5220/0007745401470155
Kersting, J., Geierhos, M.: Aspect phrase extraction in sentiment analysis with deep learning. In: Proceedings of the 12th International Conference on Agents and Artificial Intelligence: Special Session on Natural Language Processing in Artificial Intelligence, pp. 391ā400. SCITEPRESS (2020)
Kersting, J., Geierhos, M.: Neural learning for aspect phrase extraction and classification in sentiment analysis. In: Proceedings of the 33rd International Florida Artificial Intelligence Research Symposium (FLAIRS) Conference. AAAI (2020)
Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, pp. 1746ā1751. ACL (2014)
Kitaev, N., Klein, D.: Constituency parsing with a self-attentive encoder. In: Proceedings of the 56th Annual Meeting of the ACL, vol.Ā 1, pp. 2676ā2686. ACL (2018)
Kordzadeh, N.: Investigating bias in the online physician reviews published on healthcare organizationsā websites. Decis. Support Syst. 118, 70ā82 (2019). https://doi.org/10.1016/j.dss.2018.12.007
Krippendorff, K.: Computing Krippendorffās alpha-reliability. Technical report 1-25-2011, University of Pennsylvania (2011). https://repository.upenn.edu/asc_papers/43
Lagu, T., Norton, C.M., Russo, L.M., Priya, A., Goff, S.L., Lindenauer, P.K.: Reporting of patient experience data on health systemsā websites and commercial physician-rating websites: mixed-methods analysis. J. Med. Internet Res. 21(3), e12007 (2019). https://doi.org/10.2196/12007
Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33(1), 159ā174 (1977). https://doi.org/10.2307/2529310
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278ā2324 (1998)
Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5(1), 1ā167 (2012)
Liu, B., Zhang, L.: A survey of opinion mining and sentiment analysis. In: Aggarwal, C.C., Zhai, C.X. (eds.) Mining Text Data, pp. 415ā463. Springer, Berlin (2012)
Liu, J., Hou, S., Evans, R., Xia, C., Xia, W., Ma, J.: What do patients complain about online: a systematic review and taxonomy framework based on patient centeredness. JMIR 21(8), e14634 (2019). https://doi.org/10.2196/14634
Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L., Stoyanov, V.: RoBERTa: a robustly optimized BERT pretraining approach. CoRR p, o. S. (2019)
Lƶpez, A., Detz, A., Ratanawongsa, N., Sarkar, U.: What patients say about their doctors online: a qualitative content analysis. J. Gen. Intern. Med. 27(6), 685ā692 (2012). https://doi.org/10.1007/s11606-011-1958-4
McAuley, J., Leskovec, J., Jurafsky, D.: Learning attitudes and attributes from multi-aspect reviews. In: Proceedings of the 12th IEEE International Conference on Data Mining, pp. 1020ā1025. IEEE (2012). http://arxiv.org/pdf/1210.3926v2
Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: a survey. Ain Shams Eng. J. 5(4), 1093ā1113 (2014). https://doi.org/10.1016/j.asej.2014.04.011
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. CoRR pp. 1ā12 (2013)
Mitchell, M., Aguilar, J., Wilson, T., VanĀ Durme, B.: Open domain targeted sentiment. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1643ā1654. ACL (2013)
Mukherjee, A., Liu, B.: Aspect extraction through semi-supervised modeling. In: Proceedings of the 50th Annual Meeting of the ACL, vol.Ā 1, pp. 339ā348. ACL (2012)
Murphy, G.P., Radadia, K.D., Breyer, B.N.: Online physician reviews: is there a place for them. Risk Manag. Healthc. Policy 12, 85ā89 (2020)
Nguyen, T.H., Shirai, K.: Phrasernn: Phrase recursive neural network for aspect-based sentiment analysis. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 2509ā2514. ACL (2015)
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1ā2), 1ā135 (2008). https://doi.org/10.1561/1500000001
Paul, M.J., Wallace, B.C., Dredze, M.: What affects patient (dis) satisfaction? analyzing online doctor ratings with a joint topic-sentiment model. In: Proceedings of the Workshops at the 27th AAAI Conference on Artificial Intelligence. AAAI (2013)
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825ā2830 (2011)
Pontiki, M., Galanis, D., Papageorgiou, H., Androutsopoulos, I., Manandhar, S., AL-Smadi, M., Al-Ayyoub, M., Zhao, Y., Qin, B., DeĀ Clercq, O., Hoste, V., Apidianaki, M., Tannier, X., Loukachevitch, N., Kotelnikov, E., Bel, N., JimĆ©nez-Zafra, S.M., EryiÄit, G.: SemEval-2016 task 5: aspect based sentiment analysis. In: Proceedings of the 10th International Workshop on Semantic Evaluation, pp. 19ā30. ACL (2016). http://www.aclweb.org/anthology/S16-1002
Pontiki, M., Galanis, D., Papageorgiou, H., Manandhar, S., Androutsopoulos, I.: SemEval-2015 task 12: aspect based sentiment analysis. In: Proceedings of the 9th International Workshop on Semantic Evaluation, pp. 486ā495. ACL (2015). http://aclweb.org/anthology/S/S15/S15-2082.pdf
Pontiki, M., Galanis, D., Papageorgiou, H., Manandhar, S., Androutsopoulos, I.: SemEval 2016 task 5: aspect based sentiment analysis (ABSA-16) annotation guidelines (2016)
Pontiki, M., Galanis, D., Pavlopoulos, J., Papageorgiou, H., Androutsopoulos, I., Manandhar, S.: SemEval-2014 task 4: aspect based sentiment analysis. In: Proceedings of the 8th International Workshop on Semantic Evaluation, pp. 27ā35. ACL (2014)
Qiu, G., Liu, B., Bu, J., Chen, C.: Opinion word expansion and target extraction through double propagation. Comput. Linguist. 37(1), 9ā27 (2011). https://doi.org/10.1162/coli_a_00034
Remus, R., Quasthoff, U., Heyer, G.: SentiWS - A publicly available German-language resource for sentiment analysis. In: Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., Tapias, D. (eds.) Proceedings of the International Conference on Language Resources and Evaluation, pp. 1168ā1171. ELRA (2010). http://www.lrec-conf.org/proceedings/lrec2010/summaries/490.html
Ruppenhofer, J., Klinger, R., StruĆ, J.M., Sonntag, J., Wiegand, M.: IGGSA shared tasks on German sentiment analysis (GESTALT). In: Proceedings of the 12th KONVENS, pp. 164ā173 (2014). http://nbn-resolving.de/urn:nbn:de:gbv:hil2-opus-3196
Ruppenhofer, J., StruĆ, J.M., Wiegand, M.: Overview of the IGGSA 2016 shared task on source and target extraction from political speeches. In: Proceedings of the IGGSA 2016 Shared Task on Source and Target Extraction from Political Speeches, pp. 1ā9. Ruhr UniversitƤt Bochum, Bochumer Linguistische Arbeitsberichte (2016)
Saeidi, M., Bouchard, G., Liakata, M., Riedel, S.: SentiHood: Targeted aspect based sentiment analysis dataset for urban neighbourhoods. In: Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers, pp. 1546ā1556. COLING/ACL (2016)
Sharma, R.D., Tripathi, S., Sahu, S.K., Mittal, S., Anand, A.: Predicting online doctor ratings from user reviews using convolutional neural networks. Int. J. Mach. Learn. Comput. 6(2), 149ā154 (2016). https://doi.org/10.18178/ijmlc.2016.6.2.590
Shen, T., Zhou, T., Long, G., Jiang, J., Pan, S., Zhang, C.: Disan: Directional self-attention network for RNN/CNN-free language understanding. In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence. AAAI (2018)
Terlutter, R., Bidmon, S., Rƶttl, J.: Who uses physician-rating websites? Differences in sociodemographic variables, psychographic variables, and health status of users and nonusers of physician-rating websites. J. Med. Internet Res. 16(3), e97 (2014). https://doi.org/10.2196/jmir.3145
Toh, Z., Su, J.: Nlangp at SemEval-2016 task 5: Improving aspect based sentiment analysis using neural network features. In: Proceedings of the 10th International Workshop on Semantic Evaluation, pp. 282ā288. ACL (2016). https://doi.org/10.18653/v1/s16-1045
Tompson, J., Goroshin, R., Jain, A., LeCun, Y., Bregler, C.: Efficient object localization using convolutional networks. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition, pp. 648ā656. IEEE (2015). https://doi.org/10.1109/cvpr.2015.7298664
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Å., Polosukhin, I.: Attention is all you need. In: Proceedings of the 31st Conference on Neural Information Processing Systems, pp. 5998ā6008. Curran Associates (2017)
Vinodhini, G., Chandrasekaran, R.: Sentiment analysis and opinion mining: a survey. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 2(6), 282ā292 (2012)
Wallace, B.C., Paul, M.J., Sarkar, U., Trikalinos, T.A., Dredze, M.: A large-scale quantitative analysis of latent factors and sentiment in online doctor reviews. J. Am. Med. Inform. Assoc. 21(6), 1098ā1103 (2014). https://doi.org/10.1136/amiajnl-2014-002711
Wojatzki, M., Ruppert, E., Holschneider, S., Zesch, T., Biemann, C.: GermEval 2017: shared task on aspect-based sentiment in social media customer feedback. In: Proceedings of the GermEval 2017 ā Shared Task on Aspect-based Sentiment in Social Media Customer Feedback. Springer (2017)
Young, T., Hazarika, D., Poria, S., Cambria, E.: Recent trends in deep learning based natural language processing. IEEE Comput. Intell. Mag. 13(3), 55ā75 (2018)
Zeithaml, V.: How consumer evaluation processes differ between goods and services. Mark. Serv. 9(1), 186ā190 (1981)
Zeithaml, V.A., Parasuraman, A., Berry, L.L., Berry, L.L.: Delivering Quality Service: Balancing Customer Perceptions and Expectations. Free Press (1990). https://books.google.de/books?id=RWPMYP7-sN8C
Zhang, L., Wang, S., Liu, B.: Deep learning for sentiment analysis: a survey. Wiley Interdiscip. Rev.: Data Min. Knowl. Discov. 8(4), 1ā25 (2018). https://doi.org/10.1002/widm.1253
Zhao, W.X., Jiang, J., Yan, H., Li, X.: Jointly modeling aspects and opinions with a maxent-lda hybrid. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 56ā65. ACL (2010)
Zhou, J., Huang, J.X., Chen, Q., Hu, Q.V., Wang, T., He, L.: Deep learning for aspect-level sentiment classification: survey, vision, and challenges. IEEE Access 7, 78454ā78483 (2019). https://doi.org/10.1109/access.2019.2920075
Acknowledgements
This study is an invited, extended work based on [35]. Another related study is [36], which was written and submitted during the same period as [35]. This work was partially supported by the German Research Foundation (DFG) within the Collaborative Research Centre On-The-Fly Computing (SFB 901). We thank Rieke Roxanne MĆ¼lfarth, Frederik Simon BƤumer and Marvin Cordes for their support with the data collection.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Kersting, J., Geierhos, M. (2021). Towards Aspect Extraction and Classification for Opinion Mining with Deep Sequence Networks. In: Loukanova, R. (eds) Natural Language Processing in Artificial IntelligenceāNLPinAI 2020. Studies in Computational Intelligence, vol 939. Springer, Cham. https://doi.org/10.1007/978-3-030-63787-3_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-63787-3_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63786-6
Online ISBN: 978-3-030-63787-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)