Skip to main content
Log in

From Semantic to Cognitive Information Search: The Fundamental Principles and Models of Deep Semantic Search

  • INFORMATION SEARCH
  • Published:
Automatic Documentation and Mathematical Linguistics Aims and scope

Abstract

The features of human-machine documentary search focused on information support of cognitive processes are considered. The concepts of meaning and semantic information search are analyzed. The concept of deep semantic search is introduced, considered as an interactive process with search mechanisms on knowledge graph, similar to the mechanics of consciousness/cognition operations. The concept of cognitive information search is introduced, which is considered as the construction of a path of cognition—an interactive iterative and significantly dependent on the previous result formation of a target fact on a chaotic set of found facts. The result of such a search will be (1) the selection of fragments of documents that meet the real information need (and not copies of documents that meet the expressed need, as in traditional documentary retrieval system), and (2) an interactively generated semantic graph—a conceptual image of solving the user’s problem. Mathematical models of deep semantic search on knowledge graphs were given.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Notes

  1. Unlike the construction of a logical conclusion—new knowledge (and, accordingly, an increase in information), the purpose of information search is the selection of information (data, facts, etc.) to form a set of alternative, possibly hypothetical solutions, i.e., expanding the space choice (and, accordingly, a local increase in entropy).

  2. Context in [8] is defined as “the environment of the event, which provides the opportunity (resources) for its adequate interpretation.”

  3. Examples of semantic fields: the field of time, the field of chemistry, the field of names, the field of verbs of motion, the field of prepositions, the field of suffixes, etc., each of which is characterized by a certain differential feature and function.

  4. In general, we can assume that the field is given on the space to which the points M belong if a rule is given according to which a certain value K is assigned to each point from M.

  5. Here it is necessary to distinguish between search and find. Search is an organized purposeful process that involves the preliminary formation of a direct or indirect image of the desired (potentially useful), which is used to compare with the image (features) of an object stored in a certain environment. Find is a non-directed (not organized specifically for this) process, when a potentially useful object enters the field of view of the subject and can be selected by them.

  6. The secondary nature of information search is determined by the fact that it is in the energy-material sphere that those objects and processes exist and are transformed, which will act, among other things, as a need and an object of activity in the life of a subject (it is impossible to live without food, and quite possible—without information about the properties of this food).

  7. We can say that the cognitive state of the subject, formed by goals, possible methods, etc., is the meaning: it arises with thoughts (Russian: smysl—s myslyami) about the goal, ways to achieve it, etc., and it reflects their essence.

  8. The corresponding review of approaches and solutions is quite fully presented in [13].

  9. This paper does not consider the so-called question-answer systems, which have the ability to extract all possible information from an array as consequences, deduced through logic. Such systems are based on the thesis [14]: “The question, through its subject, sets the area of alternatives, and then prefaces the existing list with instructions, in accordance with which it is supposed to construct a direct answer.” However, this path is not trivial, as the choice of instruction is determined not only by the essence of the subject of the question, but also by the intention of the user, and the content of the answer essentially depends on the subject area and the nature of the problem solved by the user. This predetermines that the area of effective application of such systems is extremely limited and can in no way be extended to scientific research.

  10. Such broad possibilities, implemented differently in different systems, lead to the fact that a request expressed in different ways (for example, by rearranging words) and sent to different information resources will produce different results. This is easy to check, for example, by searching Yandex and Google on the topic viburnum for lowering cholesterol. Moreover, since the user cannot know the details of the request processing technology (factors affecting the selection and formation of the issuance, in particular, the mechanics of selection and ranking), they cannot objectively assess the information completeness of the result and determine the ways of the request development.

  11. The adjective semantic here emphasizes that meaning becomes an element of processing, since concepts are used in interrelation.

  12. Cognitiveness is considered here as a metaphor, as an image of a super-task and a prototype that coordinates information objects and synchronizes processes in the overall system cognition–information search, but not as an intelligent system for deriving new knowledge based on the documents found. The term cognitive information search is introduced as a designation of the IRS class, which includes, in addition to deep search, the mechanisms of semantic and contextual search.

  13. Already at the moment when the user receives the issue document (i.e., perceives and understands its meaning), its content is already being used to synthesize a possible solution to a pragmatic problem, i.e., switch to cognitive functions, interrupting the process of information activity.

  14. This is fundamentally different from the ideology of the classical search, built on a query-response scheme, which assumes that the output is formed according to a semantically complete query expression.

  15. Technological functions can also be intelligent: adaptive resource selection, flexible semantic matching criterion, adaptive interfaces, etc.

  16. Including language as a means of communication and knowledge.

  17. It should be noted that superposition punched cards, invented long ago (Taylor H. Selective device: patent 1,165,465. – US, 1915), in combination with long-forgotten KWIC (key word in context index) permutation indicators can be considered as a prototype of such deep search tools. However, it is clear that such a combination of objects on paper is technologically inconvenient and therefore ineffective.

  18. According to the terminology of R.S. Taylor [9].

  19. Maksimov N.V., Golitsyna O.L., Monankov K.V., and Gavrilkina A.S., Documentary information-analytical system xIRBIS (version 6.0): computer program, RF Certificate of state registration no. 2020661683, 2020.

  20. The term seek here refers to non-semantic search engines, i.e., practically by the condition of full or partial coincidence.

  21. In the search image, a fact is a triplet—a pair of entities connected by a typed relation.

  22. Visualization in our case is considered as a search carried out by ranking an array. At the same time, a reduction in search efforts is achieved by reducing the space of perception, as well as ordering and formatting graph elements in acordance with the semantics of the document and the pragmatics of the task.

  23. Examples and illustrations of the operation of deep semantic search mechanisms will be given in the article “Onto-graph mechanisms of deep semantic search.”

  24. A search unit (result) in traditional documentary information search systems is a document inferred by the system in response to a preliminary formulated and preferably semantically complete query. It is important to note that in this case, it is fundamentally impossible to achieve exhaustive completeness and accuracy due to the existence of a system of uncertainties that are fundamentally inherent in information systems, as well as the need to be based on a conceptual and lexical community that ensures a mutual understanding of the user and the system.

  25. A model is understood to be a description that allows one to reproduce the object in one or another specified form and volume.

REFERENCES

  1. Mikhailov, A.M., Chernyi, A.I., and Gilyarevskii, R.S., Osnovy informatiki (Foundations of Informatics), Moscow: Nauka, 1968.

  2. Todd, P.M., Hills, T.T., and Robbins, T.W., Search, goals, and the brain, Cognitive Search: Evolution, Algorithms, and the Brain, Massachusetts: MIT Press, 2012., pp. 125–156.

    Book  Google Scholar 

  3. Taylor, H., Fernandes, B., and Wraight, S., The evolution of complementary cognition: humans cooperatively adapt and evolve through a system of collective cognitive search, Cambridge Archaeol. J., 2022, vol. 32, no. 1, pp. 61–77.  https://doi.org/10.1017/S0959774321000329

    Article  Google Scholar 

  4. Gualtieri, M., The Forrester wave: Cognitive search and knowledge discovery solution. https://www.forrester. com/blogs/17-06-12-cognitive_search_is_the_ai_version_ of_enterprise_search. Cited April 2, 2022.

  5. Zlatev, J., Meaning = Life (+ Culture): An outline of a unified biocultural theory of meaning, Evol. Commun., 2000, vol. 4, no. 2, pp. 253–296.  https://doi.org/10.1075/eoc.4.2.07zla

    Article  Google Scholar 

  6. Kravchenko, A.V., Methodological foundations of cognitive analysis of meaning, Kognitivnyi analiz slova (Cognitive Analysis of Word), Irkutsk: Irkutskaya Gos. Ekon. Akad., 2000.

    Google Scholar 

  7. Filosofskii entsiklopedicheskii slovar’. Sovetskaya entsiklopediya (Philosophical Encyclopedic Dictionary: Soviet Encyclopedia), Il’ichev, L.F., Fedoseev, P.N., Kovalev, S.M., and Panov, V.G., Moscow: Sovetskaya Entsiklopediya, 1983.

  8. Goodwin, C. and Duranti, A., Rethinking context: An introduction, Rethinking Context: Language as an Interactive Phenomenon, Duranti, A. and Goodwin, Ch., Eds., Studies in the Social and Cultural Foundations of Language, vol. 11, Cambridge: Cambridge Univ. Press, 1992, pp. 1–41.

  9. Taylor, R.S., Question-negotiation and information seeking in libraries, College Res. Libr., 1968, vol. 29, no. 3, pp. 178–194.

    Article  Google Scholar 

  10. Salton, G., Dynamic Information and Library Processing, Englewood Cliffs, N.J.: Prentice-Hall, 1975.

    MATH  Google Scholar 

  11. Bast, H., Buchhold, B., and Haussmann, E., Semantic search on text and knowledge bases, Found. Trends Inf. Retrieval, 2016, vol. 10, nos. 2–3, pp. 119–271.  https://doi.org/10.1561/1500000032

    Article  Google Scholar 

  12. Leake, D.B. and Scherle, R., Towards context-based search engine selection, Proc. 6th Int. Conf. on Intelligent User Interfaces, New York: Association for Computing Machinery, 2001, pp. 109–112.  https://doi.org/10.1145/359784.360301

  13. Bennett, P.N., Collins-Thompson, K., Kelly, D., White, R.W., and Zhang, Y., Overview of the special issue on contextual search and recommendation, ACM Trans. Inf. Syst., 2015, vol. 33, no. 1, p. 1e.  https://doi.org/10.1145/2691351

    Article  Google Scholar 

  14. Belnap, N.D., Jr., and Steel, T.B., Jr., The Logic of Questions and Answers, New Haven, Conn.: Yale Univ. Press, 1976.

    MATH  Google Scholar 

  15. Golitsyna, O.L., Maksimov, N.V., Okropishina, O.V., and Strogonov, V.I., The ontological approach to the identification of informationin tasks of document retrieval, Autom. Doc. Math. Linguist., 2012, vol. 46, no. 3, pp. 125–132.  https://doi.org/10.3103/S0005105512030028

    Article  Google Scholar 

  16. Maksimov, N.V., Golitsyna, O.L., Monankov, K.V., and Gavrilkina, A.S., Methods of visual graphoanalytical representation and search of scientific and technical texts, Nauchn. Visualizatsiya, 2021, vol. 13, no. 1, pp. 138–161.

    Google Scholar 

  17. Cognitive search. https://ru.wikipedia.org/wiki/%D0% 9A%D0%BE%D0%B3%D0%BD%D0%B8%D1%82% D0%B8%D0%B2%D0%BD%D1%8B%D0%B9_% D0%BF%D0%BE%D0%B8%D1%81%D0%BA. Cited April 2, 2022.

  18. Golitsyna, O.L. and Maksimov, N.V., Information retrieval models in the context of retrieval tasks, Autom. Doc. Math. Linguist., 2011, vol. 45, no. 1, pp. 20–32.  https://doi.org/10.3103/S0005105511010079

    Article  Google Scholar 

  19. Van Dijk, T.A., et al., On macrostructures, mental models, and other inventions: a brief personal history of the kintsch-van dijk theory, Discourse Comprehension: Essays in Honor of Walter Kintsch, Lawrence Erlbaum Associates, 1995, pp. 383–410.

    Google Scholar 

  20. Van Dijk, T.A. and Kintsch, W., Strategies of Discourse Comprehension, New York: Academic Press, 1983.

    Google Scholar 

  21. Zhang, X., Yang, A., Li, S., and Wang, Y., Machine reading comprehension: a literature review, 2019. ar-Xiv:1907.01686 [cs.CL]

  22. Chen, S., Wang, Y., Liu, J., and Wang, Y., Bidirectional machine reading comprehension for aspect sentiment triplet extraction, Proc. AAAI Conf. Artif. Intell., 2021, vol. 35, no. 14, pp. 12666–12674.

  23. Zheng, Y., Mao, J., Liu, Y., Ye, Z., Zhang, M., and Ma, S., Human behavior inspired machine reading comprehension, SIGIR’19: Proc. 42nd Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, Paris, 2019, New York: Association for Computing Machinery, 2019, pp. 425–434.  https://doi.org/10.1145/3331184.3331231

  24. Hyona, J., Lorch, R.F., and Kaakinen, J.K., Individual differences in reading to summarize expository text: evidence from eye fixation patterns, J. Educ. Psychol., 2002, vol. 94, no. 1, pp. 44–55.  https://doi.org/10.1037/0022-0663.94.1.44

    Article  Google Scholar 

  25. Kintsch, W. and Van Dijk, T.A., Toward a model of text comprehension and production, Psychol. Rev., 1978, vol. 85, no. 5, pp. 363–394.  https://doi.org/10.1037/0033-295X.85.5.363

    Article  Google Scholar 

  26. Shah, P., A model of the cognitive and perceptual processes in graphical display comprehension, AAAI Technical Report FS-97-03, 1997, pp. 94–101.

  27. Peirce, C.S., Existential graphs. http://www.jfsowa.com/peirce/ms514.htm. Cited October 2, 2021.

  28. Golitsina, O.L. and Gavrilkina, A.S., On one approach to the extraction of entity and relationships names in the task of building a semantic search image, Autom. Doc. Math. Linguist., 2021, vol. 55, no. 2, pp. 54–62.  https://doi.org/10.3103/S0005105521020023

    Article  Google Scholar 

  29. Maksimov, N.V. and Lebedev, A.A., Ontological system knowledge–activity, Ontol. Proektirovaniya, 2021, vol. 11, no. 2, pp. 185–211.  https://doi.org/10.18287/2223-9537-2021-11-2-185-211

    Article  Google Scholar 

  30. Chernavskii, D.S., Sinergetika i informatsiya (Synergetics and Information), Moskva: Editorial URSS, 2004.

Download references

ACKNOWLEDGMENTS

The work was supported by the Ministry of Science and Higher Education of the Russian Federation (draft state assignment no. 0723-2020-0036).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to N. V. Maksimov or O. L. Golitsyna.

Ethics declarations

The authors declare that they have no conflicts of interest.

Additional information

Translated by S. Avodkova

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Maksimov, N.V., Golitsyna, O.L. From Semantic to Cognitive Information Search: The Fundamental Principles and Models of Deep Semantic Search. Autom. Doc. Math. Linguist. 56, 145–159 (2022). https://doi.org/10.3103/S0005105522030074

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.3103/S0005105522030074

Keywords:

Navigation