Abstract
The Internet has more than Bronto bytes of available information. However, not all the information matches the appropriate meaning of the search. In information retrieval (IR), the documents are organized based on the terms and inverse document frequencies. The major issues in the information retrieval systems are vocabulary mismatch, lexical similarity, and performance threshold. This paper analyses the query optimization problem for personalized web searching and it proposes an intelligent method for developing an intellectual, personalized searching scheme by machine-learning algorithms. Here, the optimized features are selected by Deep Belief Network (DBN). For accelerating the search process, inverse filtering (IF) is used because text matching is time-consuming. The similarity between the query and the document is estimated using a Genetic Algorithm (GA)-cosine similarity. Furthermore, a user interest prediction scheme by PLS-ANN (partial least squares-artificial neural network) hybrid model is executed and interconnected with a web personalized search engine. To enhance the personalized search query, the feedback module is significant in this search engine. The PLS-ANN is first time developed for web user IR, which has been not yet to be implemented for the feedback module. The proposed approach is implemented in the Python platform and the performances are evaluated using precision, recall, f-measure, and accuracy. Then, the proposed scheme performance is compared with previous machine learning methods and several classical methods. The implementation results proved that the PLS-ANN achieved better performance than the existing algorithms.
Similar content being viewed by others
Data availability
Data sharing is not applicable to this article as no new data were created or analyzed in this study.
References
Ahamed BB, Ramkumar T (2016) An intelligent web search framework for performing efficient retrieval of data. Comput Electr Eng 56:289–299
Alsmadi MK (2018) Query-sensitive similarity measure for content-based image retrieval using meta-heuristic algorithm. J King Saud Univ Comput Inf Sci 30(3):373–381
Archana AB, Kumar J (2015) Location based semantic information retrieval from web documents using web crawler. Appl Theor Comput Commun Technol (iCATccT) IEEE 370–375.
Chen B, Tsutsui S, Ding Y, Ma F (2017) Understanding the topic of evolution in a scientific domain: an exploratory study for the field of information retrieval. J Informetr 11(4):1175–1189
Gysel C, de Rijke M, Kanoulas E (2018) Neural vector spaces for unsupervised information retrieval. ACM Trans Inf Syst 36(4):1–25
Hassan MM, Alam MGR, Uddin MZ, Huda S, Almogren A, Fortino G (2019) Human emotion recognition using deep belief network architecture. Inf Fus 51:10–18
Ibrahim O, Landa-Silva D (2015) Term frequency with average term occurrences for textual information retrieval. Soft Comput 20(8):3045–3061
Ioannakis G, Koutsoudis A, Pratikakis I, Chamzas C (2018) RETRIEVAL an online performance evaluation tool for information retrieval methods. IEEE Trans Multimed 20(1):119–127
Joby PP (2020) Expedient information retrieval system for web pages using the natural language modeling. J Artif Intell 2(02):100–110
John P, Arockiasamy S, Thangiah P (2018) A personalised user preference and feature based semantic information retrieval system in semantic web search. Int J Grid Util Comput 9(3):256
Khennak I, Drias H (2017) Bat-inspired algorithm based query expansion for medical web information retrieval. J Med Syst 41(2):34
Liang Q, Mendel JM (2000) Interval type-2 fuzzy logic systems: theory and design. IEEE Trans Fuzzy Syst 8(5):535–550
Liu Y, Gao T, Song B, Huang C (2017) Personalized fuzzy text search using interest prediction and word vectorization. arXiv preprint arXiv: 1710.00310.
Mala V, Lobiyal DK (2016) Semantic and keyword based web techniques in information retrieval. In: Computing, communication and automation (ICCCA), IEEE, pp 23–26
Mohammadzadeh A, Kayacan E (2019) A non-singleton type-2 fuzzy neural network with adaptive secondary membership for high dimensional applications. Neurocomputing 338:63–71
Mohammadzadeh A, Kaynak O (2019) A novel general type-2 fuzzy controller for fractional-order multi-agent systems under unknown time-varying topology. J Franklin Inst 356(10):5151–5171
Nagappan VK, Elango P (2015) Agent based weighted page ranking algorithm for Web content information retrieval. In: Computing and communications technologies (ICCCT), IEEE, pp 31–36
Negm E, AbdelRahman S, Bahgat R (2017) PREFCA: a portal retrieval engine based on formal concept analysis. Inf Process Manag 53(1):203–222
O’Brien H, Dickinson R, Askin N (2017) A scoping review of individual differences in information seeking behaviour and retrieval research between 2000 and 2015. Libr Inf Sci Res 39(3):244–254
Posada JD, Barda AJ, Shi L, Xue D, Ruiz V, Kuan P-H, Ryan ND, Tsui FR (2017) Predictive modelling for classification of positive valence system symptom severity from initial psychiatric evaluation records. J Biomed Inform 75:S94–S104
Rakholia RM, Saini JR (2017) Information retrieval for Gujarati language using cosine similarity based vector space model. In: Proceedings of the 5th international conference on frontiers in intelligent computing: theory and applications. Springer, Singapore, pp 1–9
Russell-Rose T, Chamberlain J, Azzopardi L (2018) Information retrieval in the workplace: a comparison of professional search practices. Inf Process Manag 54(6):1042–1057
Selvalakshmi B, Subramaniam M (2018) Intelligent ontology-based semantic information retrieval using feature selection and classification. Clust Comput 21:1–11
Sharma S, Kumar A, Rana V (2017) Ontology based informational retrieval system on the semantic web: semantic Web Mining. In: Next generation computing and information systems (NGCIS), IEEE, pp 35–37
Silvello G, Bordea G, Ferro N, Buitelaar P, Bogers T (2016) Semantic representation and enrichment of information retrieval experimental data. Int J Digit Libr 18(2):145–172
Singh J, Sharan A (2016) A new fuzzy logic-based query expansion model for efficient information retrieval using relevance feedback approach. Neural Comput Appl 28(9):2557–2580
Soille P, Burger A, De Marchi D, Kempeneers P, Rodriguez D, Syrris V, Vasilev V (2018) A versatile data-intensive computing platform for information retrieval from big geospatial data. Future Gener Comput Syst 81:30–40
Song K, Li L, Li S, Tedesco L, Duan H, Li Z, Shi K, Du J, Zhao Y, Shao T (2013) Using partial least squares-artificial neural network for inversion of inland water chlorophyll-a. IEEE Trans Geosci Remote Sens 52(2):1502–1517
Tang Y, Wang H, Guo K, Xiao Y, Chi T (2018) Relevant feedback based accurate and intelligent retrieval on capturing user intention for personalized websites. IEEE Access 6:24239–24248
Vicente-Lopez E, de Campos L, Fernandez-Luna J, Huete J, Tagua-Jimenez A, Tur-Vigil C (2014) An automatic methodology to evaluate personalized information retrieval systems. User Model User Adap Interact 25(1):1–37
Zhou D, Wu X, Zhao W, Lawless S, Liu J (2017) Query expansion with enriched user profiles for personalized search utilizing folksonomy data. IEEE Trans Knowl Data Eng 29(7):1536–1548
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Kinikar, M., Saleena, B. An intelligent personalized web user information retrieval using partial least squares and artificial neural networks. J Ambient Intell Human Comput 14, 6449–6461 (2023). https://doi.org/10.1007/s12652-021-03518-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-021-03518-w