Skip to main content

A General Purpose Method for Image Collection Summarization and Exploration

  • Conference paper
  • First Online:
Image Analysis and Processing - ICIAP 2023 Workshops (ICIAP 2023)

Abstract

We propose a flexible framework that can be used to explore large-scale image datasets and summarize photo albums. Our proposed method first groups images based on their semantic content, and then selects the most diverse and aesthetically pleasing images to represent each category. To ensure the selection of high-quality images, we use features extracted from a Convolutional Neural Network to assess their diversity and perceptual properties. The effectiveness of our method is tested using benchmarking datasets and a qualitative study.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. booking. Automated image tagging at booking.com (2017)

    Google Scholar 

  2. Bradley, R.A., Terry, M.E.: Rank analysis of incomplete block designs: I the method of paired comparisons. Biometrika 39(3/4), 324–345 (1952)

    Article  MathSciNet  Google Scholar 

  3. Campadelli, P., Posenato, R., Schettini, R.: An algorithm for the selection of high-contrast color sets. Color Res. App. 24(2), 132–138 (1999)

    Article  Google Scholar 

  4. Ceroni, A., Solachidis, V., Niederée, C., Papadopoulou, O., Kanhabua, N., Mezaris, V.: To keep or not to keep: an expectation-oriented photo selection method for personal photo collections. In: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, pp. 187–194 (2015)

    Google Scholar 

  5. Chang, H., Fisher, Yu., Wang, J., Ashley, D., Finkelstein, A.: Automatic triage for a photo series. ACM Trans. Graphics (TOG) 35(4), 1–10 (2016)

    Article  Google Scholar 

  6. Cheng, Y.: Mean shift, mode seeking, and clustering. IEEE Trans. Pattern Anal. Mach. Intell. 17(8), 790–799 (1995)

    Article  Google Scholar 

  7. Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Li, F.-F.: Imagenet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)

    Google Scholar 

  8. Ignatov, A., Kobyshev, N., Timofte, R., Vanhoey, K., Gool, L.V.: Wespe: weakly supervised photo enhancer for digital cameras. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 691–700 (2018)

    Google Scholar 

  9. Instagram. Our story: A quick walk through our history as a company (2016)

    Google Scholar 

  10. kayak. Hotel image categorization with deep learning (2018)

    Google Scholar 

  11. Kittur, A., Chi, E.H., Suh, B.: Crowdsourcing user studies with mechanical Turk. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 453–456 (2008)

    Google Scholar 

  12. Leonardi, M., Napoletano, P., Rozza, A., Schettini, R.: Modeling image aesthetics through aesthetic-related attributes. In: London Imaging Meeting, vol. 2021, Society for Imaging Science and Technology (2021)

    Google Scholar 

  13. Leonardi, M., Napoletano, P., Schettini, R., Rozza, A.: No reference, opinion unaware image quality assessment by anomaly detection. Sensors 21(3), 994 (2021)

    Article  Google Scholar 

  14. Li, C.-H., Chiu, C.-Y., Huang, C.-R., Chen, C.-S., Chien, L.-F.: Image content clustering and summarization for photo collections. In: 2006 IEEE International Conference on Multimedia and Expo, pp. 1033–1036. IEEE (2006)

    Google Scholar 

  15. Li, C., Loui, A.C., Chen, T.: Towards aesthetics: a photo quality assessment and photo selection system. In: Proceedings of the 18th ACM International Conference on Multimedia, pp. 827–830 (2010)

    Google Scholar 

  16. Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48

    Chapter  Google Scholar 

  17. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  18. MacQueen, J., et al.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. Oakland, CA, USA (1967)

    Google Scholar 

  19. Mahajan, D., et al.: Exploring the limits of weakly supervised pretraining. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11206, pp. 185–201. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01216-8_12

    Chapter  Google Scholar 

  20. Pouget, A., et al.: Fast and accurate camera scene detection on smartphones. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2569–2580 (2021)

    Google Scholar 

  21. Sinha, P., Mehrotra, S., Jain, R.: Summarization of personal photologs using multidimensional content and context. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval, pp. 1–8 (2011)

    Google Scholar 

  22. Trivago. How we build the image gallery on Trivago (2021)

    Google Scholar 

  23. Wang, X.-J., Zhang, L., Liu, C.: Duplicate discovery on 2 billion internet images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 429–436 (2013)

    Google Scholar 

  24. Wang, Y., Lin, Z., Shen, X., Mech, R., Miller, G., Cottrell, G.W.: Event-specific image importance. In proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4810–4819 (2016)

    Google Scholar 

  25. Wang, Y., Lin, Z., Shen, X., Mech, R., Miller, G., Cottrell, G.W.: Recognizing and curating photo albums via event-specific image importance. arXiv preprint arXiv:1707.05911 (2017)

  26. Xie, S., Girshick, R., Dollár, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1492–1500 (2017)

    Google Scholar 

  27. Zephoria: The top 20 valuable Facebook statistics (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marco Leonardi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Leonardi, M., Napoletano, P., Rozza, A., Schettini, R. (2024). A General Purpose Method for Image Collection Summarization and Exploration. In: Foresti, G.L., Fusiello, A., Hancock, E. (eds) Image Analysis and Processing - ICIAP 2023 Workshops. ICIAP 2023. Lecture Notes in Computer Science, vol 14365. Springer, Cham. https://doi.org/10.1007/978-3-031-51023-6_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-51023-6_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-51022-9

  • Online ISBN: 978-3-031-51023-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics