Abstract
The representation of images under the Multiple Instance Learning (MIL) approach tends to generate large bags of instances, particularly when the so-called block decomposition is used as the instance generation strategy. This implies that the computations required to calculate dissimilarities between bags may become extremely expensive, even reaching highly prohibitive execution times. As a solution to this drawback, this paper presents a parallelization strategy of four point set dissimilarity measures for comparing images represented as large bags, allowing to perform a faster MIL-based image classification by accelerating the computation of the pairwise point set distances. The proposed strategy for calculating the point set dissimilarity between two bags distributes the instances of the first bag among the number of available processing units and, concurrently, calculates the selected dissimilarity measure for each partition of the first bag against all the instances of the second bag to be compared. The parallel strategy was experimentally tested for a problem of automatic visual inspection, showing that an acceleration of up to 23 times can be obtained when using 48 processing units on a machine with 24 physical cores.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
In practice, this means that MIL is able to deal with images of different sizes without requiring the application of any previous resize operation. In contrast, convolutional neural networks do require same-sized images.
- 2.
Remember that an operation \(\star \) is associative if \(\forall a,b,c,\ a \star (b \star c) = (a \star b) \star c\).
References
Bier, J.: Is deep learning the solution to all computer vision problems? Blog Post at the Vision Systems Design website. https://tinyurl.com/bdf8yfw2 (2019)
Božič, J., Tabernik, D., Skočaj, D.: End-to-end training of a two-stage neural network for defect detection. In: 2020 25th International Conference on Pattern Recognition (ICPR), pp. 5619–5626 (2021). https://doi.org/10.1109/icpr48806.2021.9412092
Cheplygina, V., Tax, D.M.J., Loog, M.: Multiple instance learning with bag dissimilarities. Pattern Recognit. 48(1), 264–275 (2015). https://doi.org/10.1016/j.patcog.2014.07.022
Javier, S.B., Teresa, A.A., Ignacio, M., Ruben, P.L., Jorge, M.: A public fabric database for defect detection methods and results. AUTEX Res. J. 19(4), 363–374 (2019). https://doi.org/10.2478/aut-2019-0035
Mattson, T.G., Anderson, T.A., Georgakoudis, G.: PyOMP: multithreaded parallel programming in python. Comput. Sci. Eng. 23(6), 77–80 (2021). https://doi.org/10.1109/mcse.2021.3128806
Mera, C., Orozco-Alzate, M., Branch, J.: Incremental learning of concept drift in Multiple Instance Learning for industrial visual inspection. Comput. Ind. 109, 153–164 (2019). https://doi.org/10.1016/j.compind.2019.04.006
Mera-Banguero, S.C.A.: Detección de defectos en sistemas de inspección visual automática a través del aprendizaje de múltiples instancias. Ph.D. thesis, Universidad Nacional de Colombia - Sede Medellín, Medellín, Colombia (2017). https://repositorio.unal.edu.co/handle/unal/59346. Doctorate in Engineering - Systems and Informatics
Nguyen, T., Raich, R.: Incomplete label multiple instance multiple label learning. IEEE Trans. Pattern Anal. Mach. Intell. 44(3), 1320–1337 (2022). https://doi.org/10.1109/tpami.2020.3017456
Pękalska, E., Duin, R.P.W.: The Dissimilarity Representation for Pattern Recognition: Foundations and Applications, Machine Perception and Artificial Intelligence, vol. 64. World Scientific, Singapore (2005). https://doi.org/10.1142/5965
Smith, M.L., Smith, L.N., Hansen, M.F.: The quiet revolution in machine vision - a state-of-the-art survey paper, including historical review, perspectives, and future directions. Comput. Ind. 130, 103472 (2021). https://doi.org/10.1016/j.compind.2021.103472
Zhou, Z.H.: A brief introduction to weakly supervised learning. Natl. Sci. Rev. 5(1), 44–53 (2018). https://doi.org/10.1093/nsr/nwx106
Acknowledgment
This work was supported by Universidad Nacional de Colombia—Sede Manizales.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Villegas-Jaramillo, E.J., Uribe-Hurtado, A.L., Orozco-Alzate, M. (2023). Multi-core Parallelization of Point Set Dissimilarities for Accelerating the Comparison of Bags with Many Instances. In: Omatu, S., Mehmood, R., Sitek, P., Cicerone, S., Rodríguez, S. (eds) Distributed Computing and Artificial Intelligence, 19th International Conference. DCAI 2022. Lecture Notes in Networks and Systems, vol 583. Springer, Cham. https://doi.org/10.1007/978-3-031-20859-1_21
Download citation
DOI: https://doi.org/10.1007/978-3-031-20859-1_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20858-4
Online ISBN: 978-3-031-20859-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)