Architecture for Aggregation, Processing and Provisioning of Data from Heterogeneous Scientific Information Services

  • Cezary MazurekEmail author
  • Marcin Mielnicki
  • Aleksandra Nowak
  • Maciej Stroiński
  • Marcin Werla
  • Jan Węglarz
Part of the Studies in Computational Intelligence book series (SCI, volume 467)


One of the tasks undertaken in PSNC in the frame of the SYNAT project is the design of architecture for the scientific information system allowing to integrate heterogeneous distributed services necessary to build knowledge base and innovative applications based on it. This paper contains overall description of the designed architecture with the analysis of its most important technical aspects, features and possible weak points. It also presents initial results from the test deployment of the first prototype of several architecture components, including initial aggregation and processing of several millions of metadata records from several tenths of network services.


data aggregation REST architecture distributed systems metadata processing 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Górny, M., Gruszczyński, P., Mazurek, C., Nikisch, J.A., Stroiński, M., Swędrzyński, A.: Zastosowanie oprogramowania dLibra do budowy Wielkopolskiej Biblioteki Cyfrowej. In: Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne, May 18-21. I Krajowa Konferencja Technologie Informacyjne, pp. 109–117. Wydawnictwo Politechniki Gdańskiej, Gdańsk (2003)Google Scholar
  2. 2.
    Mazurek, C., Stroiński, M., Werla, M.: Wdrażanie regionalnych bibliotek cyfrowych w sieci PIONIER w oparciu o środowisko dLibra. In: INFOBAZY 2005 – Bazy Danych dla Nauki, Gdańsk, September 25-27, pp. 58–64. Centrum Informatyczne TASK, Gdańsk (2005) ISBN 83-908112-3-5Google Scholar
  3. 3.
    Lewandowska, A., Mazurek, C., Werla, M.: Enrichment of European Digital Resources by Federating Regional Digital Libraries in Poland. In: Christensen-Dalsgaard, B., Castelli, D., Ammitzbøll Jurik, B., Lippincott, J. (eds.) ECDL 2008. LNCS, vol. 5173, pp. 256–259. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  4. 4.
    Dudczak, A., Mazurek, C., Werla, M.: RESTful Atomic Services for Distributed Digital Libraries. In: 1st International Conference on Information Technology, Gdańsk, May 18-21, pp. 267–270 (2008) ISBN 978-1-4244-244-9Google Scholar
  5. 5.
    Kazman, R., et al.: The architecture tradeoff analysis method. In: Proceedings of Fourth IEEE International Conference on Engineering of Complex Computer Systems, ICECCS 1998, pp. 68–78 (1998) ISBN 0-8186-8597-2Google Scholar
  6. 6.
    Mazurek, C., Mielnicki, M., Werla, M.: Selective harvesting of regional digital libraries and national metadata aggregators. In: The Proceedings of 9th ACM/IEEE-CS Joint Conference on Digital Libraries, Austin, TX, USA, June 15-19 (2009) ISBN 978-1-60558-322-8Google Scholar
  7. 7.
    Mazurek, C., Parkoła, T., Werla, M.: Building Federation of Digital Libraries Basing on Concept of Atomic Services. In: ACM/IEEE Joint Conference on Digital Libraries, JCDL 2008, Pittsburgh, PA, USA, June 16-20. ACM (2008) ISBN 978-1-59593-998-2Google Scholar
  8. 8.
    Lagoze, C., et al.: The Open Archives Initiative Protocol for Metadata Harvesting (2002),
  9. 9.
    Dublin Core Metadata Terms,
  10. 10.
  11. 11.
  12. 12.
    ETD-MS: an Interoperability Metadata Standard for Electronic Theses and Dissertations version 1.00, rev. 2,
  13. 13.
    Rosenberg, J., Mateos, A.: The Cloud at Your Service. The when, how, and why of enterprise cloud computing (November 2010) ISBN 9781935182528Google Scholar
  14. 14.
    Open Archives Initiative - Object Reuse and Exchange. ORE Specifications and User Guides - Table of Contents,
  15. 15.
    ISO/IEC 9126 Standard: Software engineering – Product quality – Part 1: Quality model,
  16. 16.
    Kruchten, P., Lago, P., van Vliet, H.: Building Up and Reasoning About Architectural Knowledge. In: Hofmeister, C., Crnković, I., Reussner, R. (eds.) QoSA 2006. LNCS, vol. 4214, pp. 43–58. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  17. 17.
    Mirkovic, J., Reiher, P.: A taxonomy of DDoS attack and DDoS defense mechanisms. ACM SIGCOMM Computer Communication Review 34(2), 39–53 (2004)CrossRefGoogle Scholar
  18. 18.
    Werla, M.: Możliwości wykorzystania katalogu centralnego w sieci rozproszonych bibliotek cyfrowych. In: Konferencja Rola Katalogu Centralnego NUKAT w Kształtowaniu Społeczeństwa Wiedzy w Polsce, Warszawa, January 23-25 (2008)Google Scholar
  19. 19.
    Apache Cassandra project website,
  20. 20.
    Mazurek, C., Sielski, K., Stroiński, M., Walkowska, J., Werla, M., Węglarz, J.: Transforming a Flat Metadata Schema to a Semantic Web Ontology: The Polish Digital Libraries Federation and CIDOC CRM Case Study. In: Bembenik, R., Skonieczny, L., Rybiński, H., Niezgodka, M. (eds.) Intelligent Tools for Building a Scient. Info. Plat. SCI, vol. 390, pp. 153–177. Springer, Heidelberg (2012)CrossRefGoogle Scholar
  21. 21.
    Apach Solr project website,

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Cezary Mazurek
    • 1
    Email author
  • Marcin Mielnicki
    • 1
  • Aleksandra Nowak
    • 1
  • Maciej Stroiński
    • 1
  • Marcin Werla
    • 1
  • Jan Węglarz
    • 1
  1. 1.Poznań Supercomputing and Networking CenterPoznańPoland

Personalised recommendations