MEDCollector: Multisource Epidemic Data Collector

  • João Zamite
  • Fabrício A. B. Silva
  • Francisco Couto
  • Mário J. Silva
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6266)


This paper analyzes the requirements and presents a novel approach to the development of a system for epidemiological data collection and integration based on the principles of interoperability and modularity. Accurate and timely epidemic models require the integration of large, fresh datasets. Thus, from an e-science perspective, collected data should be shared seamlessly across multiple applications. This is addressed by our approach, MEDCollector, trough workflow design enables the extraction of data from multiple Web sources. The mapping of extracted entities to ontologies will guarantee the consistency within gathered datasets, and therefore enhance epidemic modeling tools.


Epidemic Surveillance Data Collection Information Integration Workflow Design 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Brownstein, J., Freifeld, C.: HealthMap: The development of automated real-time internet surveillance for epidemic intelligence. Euro. Surveill. 12(11), E071129 (2007)Google Scholar
  2. 2.
    Ginsberg, J., Mohebbi, M., Patel, R., Brammer, L., Smolinski, M., Brilliant, L.: Detecting influenza epidemics using search engine query data. Nature 457(7232), 1012–1014 (2008)CrossRefGoogle Scholar
  3. 3.
    Mawudeku, A., Blench, M.: Global Public Health Intelligence Network (GPHIN). In: 7th Conference of the Association for Machine Translation in the Americas, pp. 8–12 (2006)Google Scholar
  4. 4.
    Van Noort, S., Muehlen, M., Rebelo, A., Koppeschaar, C., Lima, L., Gomes, M.: Gripenet: an internet-based system to monitor influenza-like illness uniformly across Europe. Euro. Surveill. 12(7), E5 (2007)Google Scholar
  5. 5.
    Twitter (2009), (accessed, December 2009)
  6. 6.
    EPIWORK, (accessed, February 2009)
  7. 7.
    Silva, M.J., Silva, F.A., Lopes, L.F., Couto, F.M.: Building a digital library for epidemic modelling. In: Proceedings of ICDL 2010 - The International Conference on Digital Libraries, February 23-27, vol. 1. TERI Press, New Delhi (2010) (invited Paper)Google Scholar
  8. 8.
  9. 9.
    Li, P., Castrillo, J., Velarde, G., Wassink, I., Soiland-Reyes, S., Owen, S., Withers, D., Oinn, T., Pocock, M., Goble, C., Oliver, S., Kell, D.: Performing statistical analyses on quantitative data in taverna workflows: an example using r and maxdbrowse to identify differentially-expressed genes from microarray data. BMC Bioinformatics 9(334) (August 2008)Google Scholar
  10. 10.
    Gibson, A., Gamble, M., Wolstencroft, K., Oinn, T., Goble, C.: The data playground: An intuitive workflow specification environment. In: IEEE International Conference on e-Science and Grid Computing, pp. 59–68 (2007)Google Scholar
  11. 11.
    Riedel, M., Memon, A., Memon, M., Mallmann, D., Streit, A., Wolf, F., Lippert, T., Venturi, V., Andreetto, P., Marzolla, M., Ferraro, A., Ghiselli, A., Hedman, F., Shah, Z.A., Salzemann, J., Da Costa, A., Breton, V., Kasam, V., Hofmann-Apitius, M., Snelling, D., van de Berghe, S., Li, V., Brewer, S., Dunlop, A., De Silva, N.: Improving e-Science with Interoperability of the e-Infrastructures EGEE and DEISA. In: International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), Opatija, Croatia, pp. 225–231 (2008)Google Scholar
  12. 12.
    Madoff, L., Yu, V.: ProMED-mail: an early warning system for emerging diseases. Clinical infectious diseases 39(2), 227–232 (2004)CrossRefGoogle Scholar
  13. 13.
    European Influenza Surveillance Network (EISN), (accessed, December 2009)
  14. 14.
    Marquet, R., Bartelds, A., van Noort, S., Koppeschaar, C., Paget, J., Schellevis, F., van der Zee, J.: Internet-based monitoring of influenza-like illness(ILI) in the general population of the Netherlands during the 2003 – 2004 influenza season. BMC Public Health 6(1), 242 (2006)CrossRefGoogle Scholar
  15. 15.
    Durvasula, S., Guttmann, M., Kumar, A., Lamb, J., Mitchell, T., Oral, B., Pai, Y., Sedlack, T., Sharma, H., Sundaresan, S.: SOA Practitioners’ Guide, Part 2, SOA Reference Architecture (2006)Google Scholar
  16. 16.
    Garlan, D.: Using service-oriented architectures for socio-cultural analysis,
  17. 17.
    Lopes, L.F., Zamite, J., Tavares, B., Couto, F., Silva, F., Silva, M.J.: Automated social network epidemic data collector. INForum - Simpósio de Informática (September 2009)Google Scholar
  18. 18.
    Utley, C.: Designing the Star Schema Database. Data Warehousing Resources (2002)Google Scholar
  19. 19.
    Bodenreider, O.: The unified medical language system (umls): integrating biomedical terminology. Nucl. Acids Res. 32(suppl_1), D267–D270 (2004),
  20. 20.
    GeoNames, (accessed, December 2009)
  21. 21.
    Alves, A., Arkin, A., Askary, S., Bloch, B., Curbera, F., Goland, Y., Kartha, N., Sterling, König, D., Mehta, V., Thatte, S., van der Rijn, D., Yendluri, P., Yiu, A.: Web services business process execution language version 2.0. OASIS Committee Draft (May 2006)Google Scholar
  22. 22.
    Aboauf, E.: WireIt - a Javascript Wiring Library, (accessed, January 2010)
  23. 23.
    Yahoo Pipes, (accessed, October 2009)
  24. 24.
    Sousa, J., Schmerl, B., Poladian, V., Brodsky, A.: uDesign: End-User Design Applied to Monitoring and Control Applications for Smart Spaces. In: Proceedings of the 2008 Working IFIP/IEEE Conference on Software Architecture (2008)Google Scholar
  25. 25.
    The Apache Software Foundation Foundation. Apache Orchestration Director Engine, (accessed, January 2010)
  26. 26.
    Google AJAX Language API, (accessed, January 2010)

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • João Zamite
    • 1
  • Fabrício A. B. Silva
    • 1
  • Francisco Couto
    • 1
  • Mário J. Silva
    • 1
  1. 1.LaSIGE, Faculty of ScienceUniversity of LisbonPortugal

Personalised recommendations