Skip to main content
Log in

On the Capacity of Families of Characteristic Functions That Ensure Diagnostic Problems Are Solved Correctly

  • Published:
Scientific and Technical Information Processing Aims and scope

Abstract

The capabilities and tools for quality evaluation of the results of intelligent data analysis (IDA) in problems of a diagnostic type are discussed. The reliability (indisputability) of the empirical dependencies formed during machine learning (interpolation-extrapolation) by precedents is evaluated by means of special logical tools, viz. characteristic functions. To generate characteristic functions using the available sample of precedents, the similarity of precedent descriptions, formalized as a binary algebraic operations, is analyzed. A method for evaluating the representativeness of the initial training sample is proposed to ensure that diagnostic problems are correctly solved. This method leverages a procedural scheme that reconstructs what causes diagnosable effects to arise and is implemented by the IDA tools in hand. The complexity of the computations associated with the application of the proposed mathematical toolset based on characteristic functions is estimated.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Notes

  1. Given only the precedents listed in the respective sample.

  2. Operating with meaningful concepts of the subject domain involved.

  3. That is, satisfying the BCE condition, see above.

  4. Taking into account the respective closures—see above.

  5. Which allows an expert in the respective subject domain (because it is this expert rather than a computer system who makes the final decision) to match the results of the performed computer data analysis with his already accumulated professional experience and meaningful ideas on the nature of the effects being diagnosed.

REFERENCES

  1. Zhuravlev, Y.I., Correct algebras over sets of incorrect (heuristic) algorithms, Kibernetika, 1977, no. 6, pp. 21–27.

  2. Vapnik, V.N., Statistical Learning Theory, Willey, 1998.

    MATH  Google Scholar 

  3. Vorontsov, K.V., Combinatorial theory of learning by precedents, Doctoral Dissertation, Moscow: Computational Center of the Russian Academy of Sciences, 2010. www.dissercat.com/content/kombinatornaya-teoriya-nadezhnosti-obucheniya-po-pretsedentam.

    Google Scholar 

  4. Vinogradov, D.V., Probabilistic-combinatorial formal method of learning based on lattice theory, Doctoral Dissertation, Moscow: Federal Research Center Computer Science and Control, Russian Academy of Science, 2018. http://www.frccsc.ru/disscouncil/00207305/disflist/vinogradov_dv.

    Google Scholar 

  5. Principles of Data Mining and Knowledge Discovery: First European Symposium, PKDD ’97, Trondheim, Norway, June 24–27, 1997 Proceedings, Komorowski, J. and Zytkow, J., Eds., Lecture Notes in Computer Science, vol. 1263, Berlin: Springer, 1997.  https://doi.org/10.1007/3-540-63223-9

  6. Oresky, C.M., Lenat, D.B., Clarkson, A., and Kaisler, S., Strategic automatic discovery system (STRADS), Knowledge-Based Simulation, Fishwick, P.A. and Modjeski, R.B., Eds., Advances in Simulation, vol. 4, New York: Springer, 1991, pp. 223–260. https://doi.org/10.1007/978-1-4612-3040-3_11

  7. Machine Learning: An Artificial Intelligence Approach, Michalski, R.S., Carbonell, J.G., and Mitchell, T.M., Eds., Symbolic Computation, Berlin: Springer, 1983. https://doi.org/10.1007/978-3-662-12405-5

    Book  Google Scholar 

  8. Finn, V.K. and Shestemikova, O.P., The heuristics of detection of empirical regularities by JSM reasoning, Autom. Doc. Math. Linguist., 2018, vol. 52, no. 5, pp. 215–247.

    Article  Google Scholar 

  9. Finn, V.K., On the heuristics of JSM research (additions to articles), Autom. Doc. Math. Linguist., 2019, vol. 53, no. 5, pp. 250–282.  https://doi.org/10.3103/S0005105519050078

    Article  Google Scholar 

  10. Zabezhailo, M.I., To the complexity of characteristic function sets providing correct diagnostic solutions, 19 Vserossiskaya konferentsia Matematicheskye metody raspoznavanya obrazov MMRO-2019 (Mathematical Methods in Pattern Recognition (MMPR-2019)), Moscow, 2019.

  11. Zabezhailo, M.I. and Trunin, Yu.Yu., On the problem of medical diagnostic evidence: intelligent analysis of empirical data on patients in samples of limited size, Autom. Doc. Math. Linguist., 2019, vol. 53, no. 6, pp. 322–328.  https://doi.org/10.3103/S0005105519060086

    Article  Google Scholar 

  12. Grusho, A., Zabezhailo, M., and Timonina, E., On causal representativeness of training samples of precedents in diagnostic type tasks, Inf. Ee Prilozh., 2020, vol. 14, no. 1, pp. 80–86.

    Google Scholar 

  13. Cohn, P.M., Universal Algebra, Mathematics and Its Applications, vol. 6, Dordrecht: Springer, 1981.  https://doi.org/10.1007/978-94-009-8399-1

  14. Simon, J., On the difference between one and many, Automata, Languages and Programming. ICALP 1977, Salomaa, A. and Steinby, M., Eds., Lecture Notes in Computer Science, vol. 52, Berlin: Springer, 1977, pp. 480–491.  https://doi.org/10.1007/3-540-08342-1_37

    Book  Google Scholar 

  15. Valiant, L.G., The complexity of enumeration and reliability problems, SIAM J. Comput., 1979, vol. 8, no. 3, pp. 410–421.  https://doi.org/10.1137/0208032

    Article  MathSciNet  MATH  Google Scholar 

  16. Valiant, L.G., The complexity of computing the permanent, Theor. Comput. Sci., 1979, vol. 8, no. 2, pp. 189–201.  https://doi.org/10.1016/0304-3975(79)90044-6

    Article  MathSciNet  MATH  Google Scholar 

  17. Finn, V.K., J.S. Mill’s inductive methods in artificial intelligence systems. Part I, Sci. Tech. Inf. Process., 2011, vol. 38, no. 6, pp. 385–402.  https://doi.org/10.3103/S0147688211060037

    Article  Google Scholar 

  18. Grusho, A.A., Zabezhailo, M.I., Zatsarinny, A.A., and Timonina, E.E., On some possibilities of resource management for organizing active counteraction to computer attacks, Inf. Ee Prilozh., 2018, vol. 12, no. 1, pp. 62–70.

    Google Scholar 

  19. Zabezhailo, M.I. and Trunin, Y.Y., To the evidence of medical diagnosis: Intelligent data analysis of limited size samples of describing patients empirical data, Tsifrovoe zdravookhranenye: XX Kongress informatsionnye tekhnologii v medicine 2020 (Digital Medicine: 20th Congress on Informational Technologies in Medicine 2019), Moscow: Konsef., 2019, pp. 6–9. https://itmcongress.ru/itm2019/proceedings/1.1Zabezhailo_ITM2019.pdf.

    Google Scholar 

  20. Zabezhailo, M.I., Some estimates of computational complexity when predicting the properties of new objects using characteristic functions, Autom. Doc. Math. Linguist., 2020, vol. 54, no. 6, pp. 298–305.  https://doi.org/10.3103/S0005105520060072

    Article  Google Scholar 

  21. Zabezhailo, M.I. and Trunin, Y.Y., On the reliability of medical diagnosis based on empirical data, Sci. Tech. Inf. Process., 2021, vol. 48, no. 5, pp. 415–422.  https://doi.org/10.3103/S0147688221050142

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to M. I. Zabezhailo.

Ethics declarations

The author declares that he has no conflicts of interest.

Additional information

Translated by M. Talacheva

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zabezhailo, M.I. On the Capacity of Families of Characteristic Functions That Ensure Diagnostic Problems Are Solved Correctly. Sci. Tech. Inf. Proc. 49, 385–392 (2022). https://doi.org/10.3103/S0147688222050148

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.3103/S0147688222050148

Keywords:

Navigation