Abstract
We introduce and study information spaces that arise in the problem of calibration of a measuring system in the case when the measurement model is initially unknown. Information extracted from calibration measurements is proposed to be represented by an element of the corresponding information space endowed with a certain algebraic structure. We also consider the possibility of further improving the estimation accuracy by repeatedly measuring an unknown object of study, which leads to yet another information space of a different type. As a result, a processing algorithm is constructed that contains the accumulation of information of two types and the interaction of information flows with the concurrent arrival of calibration and measurements data. The study presents the mathematical basis for a data processing model in which different types of input data are processed as parallel and independent as possible, and finally, the corresponding types of accumulated information interact to produce a complete processing result.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Rao, C.R.: Linear Statistical Inference and its Applications. Wiley, Hoboken (1973)
Seber, G.A.F., Lee, A.J.: Linear Regression Analysis. Wiley-Interscience Inc. (2003)
Tikhonov, A.N., Arsenin, V.Y.: Solutions of Ill-Posed Problems. V. H. Winston & Sons, Washington, D.C.; Wiley, New York (1977)
Pyt’ev, Yu.P.: Reduction problems in experimental investigations. Math. USSR Sb. 48(1), 237–272 (1984)
Golubtsov, P.V.: The concept of information in big data processing. Autom. Doc. Math. Linguist. 52(1), 38–43 (2018). https://doi.org/10.3103/S000510551801003X
Golubtsov, P.V.: The linear estimation problem and information in big-data systems. Autom. Doc. Math. Linguist. 52(2), 73–79 (2018). https://doi.org/10.3103/S0005105518020024
Golubtsov, P.: Scalability and parallelization of sequential processing: big data demands and information algebras. In: Hu, Z., Petoukhov, S., He, M. (eds.) CSDEIS 2019. AISC, vol. 1127, pp. 274–298. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-39216-1_25
Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)
Ekanayake, J., Pallickara, S., Fox, G.: MapReduce for data intensive scientific analyses. In: Fourth IEEE International Conference on eScience, Indianapolis, IN, pp. 277–284 (2008)
Palit, I., Reddy, C.K.: Scalable and parallel boosting with MapReduce. IEEE Trans. Knowl. Data Eng. 24(10), 1904–1916 (2012)
Golubtsov, P.V., Pyt’ev, Yu.P., Chulichkov, A.I.: Construction of the reduction operator by test measurements. Discrete information processing systems, Ustinov: Udmurt State University, pp. 68–71 (1986). (in Russian)
Bekkerman, R., Bilenko, M., Langford, J.: Scaling Up Machine Learning: Parallel and Distributed Approaches. Cambridge University Press, Cambridge (2012)
Fan, J., Han, F., Liu, H.: Challenges of big data analysis. Natl. Sci. Rev. 1(2), 293–314 (2013)
Farhan, N., Habib, A., Ali, A.: A study and performance comparison of MapReduce and apache spark on twitter data on hadoop cluster. Int. J. Inf. Technol. Comput. Sci. (IJITCS) 10(7), 61–70 (2018)
Roy, C., Rautaray, S.S., Pandey, M.: Big data optimization techniques: a survey. Int. J. Inf. Eng. Electron. Bus. (IJIEEB) 10(4), 41–48 (2018)
Golubtsov, P.: Information spaces for big data problems in fuzzy bayesian decision making. In: Hu, Z., Gavriushin, S., Petoukhov, S., He, M. (eds.) Advances in Intelligent Systems, Computer Science and Digital Economics III, vol. 121, pp. 102–114. Springer, Cham (2022). https://doi.org/10.1007/978-3-030-97057-4_10
Pyt’ev, Yu.P.: Pseudoinverse operators. Properties and applications. Math. USSR Sb. 46(1), 17–50 (1983)
Pyt’ev, Yu.P.: Methods of mathematical modeling of measurement-computer systems. Fizmatlit, in Russian, Moscow (2012)
White, T.: Hadoop: The Definitive Guide. O’Reilly, Sebastopol (2015)
Ryza, S., Laserson, U., Owen, S., Wills, J.: Advanced Analytics with Spark: Patterns for Learning from Data at Scale. O’Reilly, Sebastopol (2015)
Acknowledgment
The reported study was supported by RFBR, research project number 19-29-09044.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Golubtsov, P. (2023). Information Spaces and Efficient Information Accumulation in Calibration Problems. In: Hu, Z., Wang, Y., He, M. (eds) Advances in Intelligent Systems, Computer Science and Digital Economics IV. CSDEIS 2022. Lecture Notes on Data Engineering and Communications Technologies, vol 158. Springer, Cham. https://doi.org/10.1007/978-3-031-24475-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-031-24475-9_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-24474-2
Online ISBN: 978-3-031-24475-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)