Abstract
Recently, there has been an enormous increase in the number of reviews of popular products. Therefore, opinion analysis has become a tedious task for customers when making decisions. As a result, opinion retrieval systems have emerged as an effective tool to analyze and represent customers’ feelings toward offered services. Conventional opinion retrieval systems retrieve and rank products according to both relevance and the overall polarity scores of the opinions. However, customer reviews are usually more detailed, including multiple features with different polarities. Consequently, feature-based opinion retrieval is necessary to extract and analyze each feature separately. Customers’ opinions are usually written with a short and unclear structure and contain many implicit linguistic features that cannot be identified by retrieval systems. As a result, the recall results are negatively affected. Few studies have focused on implicit features, as most examined explicit features. Also, implicit features extraction is a challenging task in some languages like Arabic due to difficulties with morphology. This paper proposes an enhanced retrieval approach based on feature-based opinion mining to enhance retrieval performance. In addition to explicit feature extraction, a metaheuristic optimization method with several similarity measures is utilized to identify implicit features and measure its effect on the retrieval results. The experimental results on Arabic and English datasets revealed the effectiveness of the proposed approach, whereby more features were extracted compared to the explicit feature results. Furthermore, the ranking results were improved by identifying both implicit and explicit features compared to the results obtained by the conditional random field method and association rule mining.
Similar content being viewed by others
References
Abd-Elhamid, L., Elzanfaly, D., Eldin, A.S. (2017). Proceedings of 2016 11th International Conference on Computer Engineering and Systems, ICCES 2016 (pp. 260–265). https://doi.org/10.1109/ICCES.2016.7822011.
Al-Obaidi, A., & Samawi, V. (2016). Proceedings of the World Congress on Engineering and Computer Science I(November). http://www.iaeng.org/publication/WCECS2016/WCECS2016_pp470-475.pdf.
Asghar, M.Z., Khan, A., Zahra, S.R., Ahmad, S., Kundi, F.M. (2017). Cluster Computing. 1–19. https://doi.org/10.1007/s10586-017-1096-9.
Atkinson, J., Salas, G., Figueroa, A. (2015). Information Sciences 299, 20. https://doi.org/10.1016/J.INS.2014.12.021. https://www.sciencedirect.com/science/article/pii/S0020025514011608.
Borase, P.S.D., & Mahale, P.P.P. (2019). International journal of scientific research in computer science. Engineering and Information Technology, 5(1), 45. https://doi.org/10.32628/CSEIT183812.
Boudad, N., Faizi, R., Oulad, R., Thami, H., Chiheb, R. (2017). Ain Shams Engineering Journal. https://doi.org/10.1016/j.asej.2017.04.007.
Chen, J., Sun, L., Peng, Y., Huang, Y. (2015). In Proceedings of the IEEE international conference on data science and advanced analytic, DSAA 2015. Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/DSAA.2015.7344860.
Cilibrasi, R.L., & Vitanyi, P.M. (2007). IEEE Transactions on Knowledge and Data Engineering 19(3), 370. https://doi.org/10.1109/TKDE.2007.48. https://ieeexplore.ieee.org/document/4072748/.
Civicioglu, P., & Besdok, E. (2013). . Artificial Intelligence Review, 39(4), 315. https://doi.org/10.1007/s10462-011-9276-0.
David, D.D., & Olson, L. (2008). Advanced Data Mining Techniques, 1st edn. Berlin: Springer.
Eirinaki, M., Pisal, S., Singh, J. (2012). Journal of Computer and System Sciences 78(4), 1175. https://doi.org/10.1016/J.JCSS.2011.10.007. https://www.sciencedirect.com/science/article/pii/S0022000011001139.
ElSahar, H., & El-Beltagy, S.R. (2015). Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 9042, 23. https://doi.org/10.1007/978-3-319-18117-2_2.
Eldin, S.S., Mohammed, A., Hefny, H., Ahmed, A.S.E. (2019). Journal of King Saud University - Computer and Information Sciences. https://doi.org/10.1016/J.JKSUCI.2019.01.010. https://www.sciencedirect.com/science/article/pii/S1319157818308504.
Gabrilovich, E., & Markovitch, S. (2007). Computing semantic relatedness using Wikipedia-based explicit semantic analysis. https://dl.acm.org/citation.cfm?id=1625535.
Gaillat, T., Stearns, B., Sridhar, G., McDermott, R., Zarrouk, M., Davis, B. (2018). In Proceedings of the First Workshop on Economics and Natural Language Processing (pp. 55–61). https://aclweb.org/anthology/papers/W/W18/W18-3108/.
Gandomi, A.H., Yang, X.S., & Alavi, A.H. (2013). . Engineering with Computers, 29(1), 17. https://doi.org/10.1007/s00366-011-0241-y.
Ganeshbhai, S.Y., & Shah, B.K. (2015). In 2015 IEEE International Advance Computing Conference (IACC) (pp. 919–923): IEEE. https://doi.org/10.1109/IADCC.2015.7154839. http://ieeexplore.ieee.org/document/7154839/.
Ghose, A. (2007). Proceedings of the ninth international conference on Electronic commerce: ACM. https://doi.org/10.1145/1282100.1282158.
Gomaa, W.H. (2013). . International Journal of Computer Applications, 68(13), 13.
Gracia, J., Trillo, R., Espinoza, M., Mena, E. (2006). ICWE ’06: Proceedings of the 6th international conference on Web engineering (pp. 241–248). https://doi.org/10.1145/1145581.1145630.
Hai, Z., Chang, K., & Kim, J. J. (2011). Implicit feature identification via co-occurrence association rule mining. In International Conference on Intelligent Text Processing and Computational Linguistics (pp. 393–404). Berlin, Heidelberg: Springer.
He, B., Macdonald, C., Ounis, I. (2008). Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (pp. 727–728).
Hu, M., & Liu, B. (2004). Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery and data mining - KDD ’04 (pp. 168). https://doi.org/10.1145/1014052.1014073. http://portal.acm.org/citation.cfm?doid=1014052.1014073.
Islam, A., & Inkpen, D. (2008). ACM Transactions on Knowledge Discovery from Data 2(2), 1. https://doi.org/10.1145/1376815.1376819. http://portal.acm.org/citation.cfm?doid=1376815.1376819.
Islam, A., Milios, E.E., Keselj, V. (2012). COLING, International Conference on Computational Linguistics (Posters) (pp. 495).
Jain, A., Jain, A., Chauhan, N., Singh, V., Thakur, N. (2017). . Int. J. Comput. Appl, 164(6), 28.
Jiang, W., Pan, H., Ye, Q. (2014). . Open Cybernetics and Systemics Journal, 8(1), 924. https://doi.org/10.2174/1874110X01408010924.
Karve, S., Shende, V., Hople, S. (2019). In Data Analytics and Learning (pp. 143–154): Springer.
Lazhar, F., & Yamina, T.G. (2016). International Journal of Data Mining, Modelling and Management 8(1), 75. https://doi.org/10.1504/IJDMMM.2016.075966. http://www.inderscience.com/link.php?id=75966.
Lazhar, F. (2018). . International Journal of Business Information Systems, 30(1), 13. https://doi.org/10.1504/ijbis.2019.097042.
Lazhar, F. (2019). . International Journal of Business Information Systems, 30(1), 13.
Li, Y., McLean, D., Bandar, Z., O’Shea, J., Crockett, K. (2006). IEEE Transactions on Knowledge and Data Engineering 18(8), 1138. https://doi.org/10.1109/TKDE.2006.130. http://ieeexplore.ieee.org/document/1644735/.
Li, S.Y.Z., Zhang M., Ma S., Zhou B. (2009). . In Lee, G.G., et al. (Eds.) Information Retrieval Technology. AIRS 2009. Lecture Notes in Computer Science. https://doi.org/10.1007/978-3-642-04769-5_10, Vol. 5839. Berlin: Springer.
Liu, L., Du, W., Wang, H., Song, W. (2017). 2017 IEEE 2nd International Conference on Big Data Analysis. ICBDA 2017, 28(6), 915. https://doi.org/10.1109/ICBDA.2017.8078772.
Maharani, W., Widyantoro, D.H., Khodra, M.L. (2015). Procedia Computer Science 59, 244. https://doi.org/10.1016/J.PROCS.2015.07.545. https://www.sciencedirect.com/science/article/pii/S1877050915020748.
Manning, C.D., Raghavan, P., Schutze, H. (2008). Introduction to information retrieval: Cambridge University Press. https://nlp.stanford.edu/IR-book/information-retrieval-book.html.
Mishne, G. (2006). Online Proceedings of TREC, 1–4.
Mishne, G. (2007). ICWSM (pp. 1–4).
Mogotsi, I., Manning, C.D., Raghavan, P., Schütze, H. (2010). Introduction to information retrieval.
Mohammad, S.M., Salameh, M., Kiritchenko, S. (2016). Tenth international conference on language resources and evaluation, LREC 2016 (pp. 33–37).
Na, L.J.S.H., Lee, Y., Nam, S.H. (2009). In Advances in Information Retrieval. ECIR 2009. Lecture Notes in Computer Science (Vol. 5478). Berlin: Springer. https://doi.org/10.1007/978-3-642-00958-7_76.
Ounis, I., Rijke, M.D., Macdonald, C., Mishne, G., Soboroff, I. (2006). .. In TREC, (Vol. 7 pp. 31–43).
Padmapriya, A., & Maheswaran, S. (2012). . IOSR Journal of Computer Engineering (IOSRJCE), 1(2), 13.
Paltoglou, G., & Giachanou, A. (2014). .. In Professional Search in the Modern World (pp. 193–214): Springer.
Pang, B., Lee, L., & Vaithyanathan, S. (2002). In Proceedings of the ACL-02 conference on Empirical methods in natural language processing (Vol. 10, pp. 79–86): (Association for Computational Linguistics).
Pang, B., Lee, L., & et al. (2008). . Foundations and Trends®;, in Information Retrieval, 2(1–2), 1.
Pereira, L., Rodrigues, D., Almeida, T., Ramos, C., Souza, A., Yang, X.S., Papa, J. (2014). In Cuckoo Search and Firefly Algorithm (pp. 141–154): Springer.
Popescu, A.M., & Etzioni, O. (2005). Human Language Technology and Empirical Methods in Natural Language Processing Vancouver (pp. 339–346). British Columbia. https://doi.org/10.3115/1220575.1220618.
Popescu, A.M., & Etzioni, O. (2007). In Natural Language Processing and Text Mining (pp. 9–28). London: Springer. https://doi.org/10.1007/978-1-84628-754-1_2.
Poria, S., Cambria, E., Ku, L.W., Gui, C., Gelbukh, A. (2014). Second workshop on natural language processing for social media (SocialNLP) (pp. 28–37).
Qiu, C.C.G., Zhang F., Bu J. (2009). . In Lee, G.G., et al. (Eds.) Information Retrieval Technology. AIRS 2009. Lecture Notes in Computer Science. https://doi.org/10.1007/978-3-642-04769-5_28, Vol. 5839. Berlin: Springer.
Radinsky, K., Agichtein, E., Gabrilovich, E., Markovitch, S. (2011). In Proceedings of the 20th international conference on World wide web - WWW ’11 (pp. 337). New York: ACM Press. https://doi.org/10.1145/1963405.1963455. http://portal.acm.org/citation.cfm?doid=1963405.1963455.
Rubtsova, Y., & Koshelnikov, S. (2015). In International Conference on Knowledge Engineering and the Semantic Web (pp. 158–167). Cham: Springer. https://doi.org/10.1007/978-3-319-24543-0_12.
Schouten, K., & Frasincar, F. (2014). In International Conference on Web Engineering (pp. 130–144): Springer.
Sebastiani, F. (2002). . ACM Computing Surveys (CSUR), 34(1), 1.
Shoukry, A., & Rafea, A. (2012). In The Fourth Workshop on Computational Approaches to Arabic Script-based Languages. https://www.semanticscholar.org/paper/Preprocessing-Egyptian-Dialect-Tweets-for-Sentiment-Shoukry-Rafea/ace965fc64f1471b8f9d5c7cf1881f17ab615a25.
Singh, S.M., & Mishra, N. (2017). Proceedings on 2016 2nd International Conference on Next Generation Computing Technologies, NGCT 2016 (pp. 540). https://doi.org/10.1109/NGCT.2016.7877474.
Song, H., Chu, J., Hu, Y., Liu, X. (2013). Proceedings - 2013 4th World Congress on Software Engineering, WCSE 2013 (pp. 331–335). https://doi.org/10.1109/WCSE.2013.62.
Street, S.M., & Liu, B. (2010). Proceedings of the 23rd international conference on computational linguistics: Posters. Association for Computational Linguistics, 17, 1462. http://dl.acm.org/citation.cfm?id=1944733.
Su, Q., Xiang, K., Wang, H., Sun, B., On, S.Y.I.C.U. (2006). 21st International Conference, ICCPOL 2006: Singapore. https://doi.org/10.1007/11940098_3.
Sun, L., Li, S., Li, J., Lv, J. (2014). DSAA 2014 - Proceedings Of the 2014 IEEE International Conference on Data Science and Advanced Analytics (pp 420–424). https://doi.org/10.1109/DSAA.2014.7058106.
Tubishat, M., Idris, N., Abushariah, M.A. (2018). Information Processing and Management 54(4), 545. https://doi.org/10.1016/j.ipm.2018.03.008.
Vijaymeena, M., & Kavitha, K. (2016). . Machine Learning and Applications: An International Journal, 3(2), 19.
Wang, W., Xu, H., Wan, W. (2013). . Expert Systems with Applications, 40(9), 3518. https://doi.org/10.1016/j.eswa.2012.12.060.
Wang, J., Zhou, B., Zhou, S. (2016). Computational Intelligence and Neuroscience 2016, 1. https://doi.org/10.1155/2016/2959370. http://www.hindawi.com/journals/cin/2016/2959370/.
Weeds, J., Weir, D., McCarthy, D. (2004). In Proceedings of the 20th international conference on Computational Linguistics. Association for Computational Linguistics, pp. 1015–1021. https://aclweb.org/anthology/papers/C/C04/C04-1146/.
Wei, H., Xin, C., Haibo, W. (2010). 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (Fskd) (pp. 2489). https://doi.org/10.1109/FSKD.2010.5569507. http://ieeexplore.ieee.org/document/5569507/.
Wilson, T., Hoffmann, P., Somasundaran, S., Kessler, J. (2005). proceedings of HLT/EMNLP 2005 interactive demonstrations. https://doi.org/10.3115/1225733.1225751.
Xu, H., Zhang, F., & Wang, W. (2015). . Knowledge-Based Systems, 76, 166. https://doi.org/10.1016/j.knosys.2014.12.012.
Yang, X.S., & Deb, S. (2009). In World Congress on Nature & Biologically Inspired Computing (NaBIC) (pp. 210–214): IEEE.
Yu, C., & Meng, W. (2007). Proceedings of the sixteenth ACM conference on Conference on information and knowledge management (pp. 831–840): ACM. https://doi.org/10.1145/1321440.1321555.
Zeng, L., & Li, F. (2013). Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 8202 LNAI (pp. 190). https://doi.org/10.1007/978-3-642-41491-6_18.
Zhang, W., Jia, L., Yu, C., Meng, W. (2008). .. In Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM ’08. https://doi.org/10.1145/1458082.1458309. http://portal.acm.org/citation.cfm?doid=1458082.1458309 (p. 1415). New York: ACM Press.
Zhang, Y., & Zhu, W. (2013). In Proceedings of the 22nd International Conference on World Wide Web (pp. 103–104): ACM.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Eldin, S.S., Mohammed, A., Eldin, A.S. et al. An enhanced opinion retrieval approach via implicit feature identification. J Intell Inf Syst 57, 101–126 (2021). https://doi.org/10.1007/s10844-020-00622-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10844-020-00622-9