Abstract
User reviews of mobile applications convey useful feedback from users, e.g. feature requests, bug descriptions, etc. The increasing number of reviews that users submit daily makes it difficult for developers to manually analyze and classify them into proper review categories. Moreover, several review messages may contain more than one information. In this paper, we propose to use multi-label active learning as a convenient solution to the problem of mobile app user reviews classification. An unlabeled and structured dataset was built from the initially unstructured large set of review messages. Moreover, in order to reduce the effort needed to assign labels to each instance in the large constructed dataset, we opted for an Active Learning approach. Experimental results have shown that, by actively querying an oracle for labels during training a binary relevance-based classifier (with logistic regression as a base classifier), we obtained a classifier that outperformed well-known classifiers in terms of performance without the need to label the whole dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
For sake of simplicity, we will use the short name “app” to refer to a mobile application throughout this paper.
- 2.
- 3.
- 4.
- 5.
- 6.
References
Guzman, E., El-Haliby, M., Bruegge, B.: Ensemble methods for app review classification: an approach for software evolution (N). In: 30th IEEE/ACM International Conference on Automated Software Engineering, pp. 771–776 (2015)
Zhang, L., Huang, X., Jiang, J., Hu, Y.: CSLabel: an approach for labelling mobile app reviews. J. Comput. Sci. Technol. 32(6), 1076–1089 (2017)
Palomba, F., et al.: Crowdsourcing user reviews to support the evolution of mobile apps. J. Syst. Softw. 137, 143–162 (2018)
Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit. O’Reilly Media Inc., Sebastopol (2009)
Herrera, F., Charte, F., Rivera, A.J., del Jesus, M.J.: Multilabel Classification - Problem Analysis, Metrics and Techniques. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-41111-8
Settles, B.: Active learning literature survey. University of Wisconsin-Madison, Computer Sciences Technical report 1648 (2009)
Deocadez, R., Harrison, R., Rodríguez, D.: Automatically classifying requirements from app stores: a preliminary study. In: IEEE 25th International Requirements Engineering Conference Workshops, pp. 367–371 (2017)
Maalej, W., Kurtanovic, Z., Nabil, H., Stanik, C.: On the automatic classification of app reviews. Requirements Eng. 21(3), 311–331 (2016)
Vasa, R., Hoon, L., Mouzakis, K., Noguchi, A.: A preliminary analysis of mobile app user reviews. In: Proceedings of the 24th Australian Computer-Human Interaction Conference, pp. 241–244 (2012)
Yang, Y., Lee, S., Chung, Y., Wu, T., Chen, S., Lin, H.: libact: pool-based active learning in python, vol. 6. CoRR (2017)
Hung, C.-W., Lin, H.-T.: Multi-label active learning with auxiliary learner. In: Asian Conference on Machine Learning, pp. 315–332 (2011)
Tai, F., Lin, H.-T.: Multilabel classification with principal label space transformation. Neural Comput. 24(9), 2508–2542 (2012)
Zhang, M.-L., Zhou, Z.: A review on multi-label learning algorithms. IEEE Trans. Knowl. Data Eng. 26(8), 1819–1837 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Messaoud, M.B., Jenhani, I., Jemaa, N.B., Mkaouer, M.W. (2019). A Multi-label Active Learning Approach for Mobile App User Review Classification. In: Douligeris, C., Karagiannis, D., Apostolou, D. (eds) Knowledge Science, Engineering and Management. KSEM 2019. Lecture Notes in Computer Science(), vol 11775. Springer, Cham. https://doi.org/10.1007/978-3-030-29551-6_71
Download citation
DOI: https://doi.org/10.1007/978-3-030-29551-6_71
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29550-9
Online ISBN: 978-3-030-29551-6
eBook Packages: Computer ScienceComputer Science (R0)