International Journal of Public Health

, Volume 59, Issue 5, pp 841–849

Demand-based web surveillance of sexually transmitted infections in Russia

  • Alexander Domnich
  • Eva K. Arbuzova
  • Alessio Signori
  • Daniela Amicizia
  • Donatella Panatto
  • Roberto Gasparini
Original Article



To investigate the possibility of using HIV- and syphilis-related web queries to predict incident diagnosis rates of sexually transmitted infections in Russia.


The regional volume of HIV/syphilis queries, normalized to the total number of queries submitted to the most popular search engine, was used to predict the notification rates of HIV/syphilis in each region by applying both global non-spatial and spatial statistics.


Nationwide, both search volumes and regional HIV/syphilis diagnosis rates were positively spatially auto-correlated, indicating a clustered pattern of spatial distribution. A high positive correlation between notification rates and search volume was observed. Compared with linear models, spatially explicit geographically weighted models adjusted for broadband Internet diffusion proved superior in predicting the regional level of the HIV/syphilis epidemic on the basis of their search volume.


Timeliness, easy availability, low cost, and transparency make HIV- and syphilis-related web queries a promising addition to traditional methods of disease surveillance in Russia. Geographically weighted regression provides useful insights, as it is able to capture the spatial heterogeneity of the relationship between search volume and disease incidence.


Disease surveillance HIV Search engine Search volume Infodemiology 

Supplementary material

38_2014_581_MOESM1_ESM.pdf (527 kb)
Supplementary material 1 (PDF 526 kb)


  1. Amstadter AB, Broman-Fulks J, Zinzow H, Ruggiero KJ, Cercone J (2009) Internet-based interventions for traumatic stress-related mental health problems: a review and suggestion for future research. Clin Psychol Rev 29:410–420PubMedCrossRefPubMedCentralGoogle Scholar
  2. Anselin L (2002) Under the hood issues in the specification and interpretation of spatial regression models. Agric Econ 27:247–267CrossRefGoogle Scholar
  3. Atkinson NL, Saperstein SL, Pleis J (2009) Using the internet for health-related activities: findings from a national probability sample. J Med Internet Res 11:e4PubMedCrossRefPubMedCentralGoogle Scholar
  4. Bernardo TM, Rajic A, Young I, Robiadek K, Pham MT, Funk JA (2013) Scoping review on search queries and social media for disease surveillance: a chronology of innovation. J Med Internet Res 15:e147PubMedCrossRefPubMedCentralGoogle Scholar
  5. Brunsdon C, Charlton M, Harris P (2012) Living with collinearity in local regression models. In: Proceedings of the 10th international symposium on spatial accuracy assessment in natural resources and environmental sciences. Accessed 29 November 2013
  6. Carneiro HA, Mylonakis E (2009) Google trends: a web-based tool for real-time surveillance of disease outbreaks. Clin Infect Dis 49:1557–1564PubMedCrossRefGoogle Scholar
  7. Centralized Information System for Infectious Diseases (2013) Syphilis, incidence rate. Accessed 29 November 2013
  8. Chen W, Wellman B (2004) The global digital divide—within and between countries. It Soc 1:39–45Google Scholar
  9. Dehne KL, Khodakevich L, Hamers FF, Schwartländer B (1999) The HIV/AIDS epidemic in eastern Europe: recent patterns and trends and their implications for policy-making. AIDS 13:741–749PubMedCrossRefGoogle Scholar
  10. Development Core Team R (2012) R: A language and environment for statistical computing. R Foundation for Statistical Computing, ViennaGoogle Scholar
  11. Duncan DT, Castro MC, Gortmaker SL, Aldstadt J, Melly SJ, Bennett GG (2012) Racial differences in the built environment–body mass index relationship? A geospatial analysis of adolescents in urban neighborhoods. Int J Health Geogr 11:11PubMedCrossRefPubMedCentralGoogle Scholar
  12. Eysenbach G (2009) Infodemiology and infoveillance: framework for an emerging set of public health informatics methods to analyze search, communication and publication behavior on the Internet. J Med Internet Res 11:e11PubMedCrossRefPubMedCentralGoogle Scholar
  13. Federal AIDS centre. Statistics. 28 November 2013
  14. Fotheringham S, Brunsdon C, Charlton M (2002) Geographically weighted regression: the analysis of spatially varying relationships. Wiley, ChichesterGoogle Scholar
  15. Hulth A, Rydevik G (2011) Web query-based surveillance in Sweden during the influenza A(H1N1)2009 pandemic, April 2009 to February 2010. Euro Surveill 16:pii: 19856Google Scholar
  16. Jena AB, Karaca-Mandic P, Weaver L, Seabury SA (2013) Predicting new diagnoses of HIV infection using internet search engine data. Clin Infect Dis 56:1352–1353PubMedCrossRefGoogle Scholar
  17. Koniuchova K, Rozova M, Savicheva A, Sokolovskiy E, Domeika M, Unemo M (2013) Epidemiology of sexually transmitted infections in Tver. Sex Transm Infect 89(Suppl 1):A158CrossRefGoogle Scholar
  18. Marcus U, Hickson F, Weatherburn P, Schmidt AJ (2013) Estimating the size of the MSM populations for 38 European countries by calculating the survey-surveillance discrepancies (SSD) between self-reported new HIV diagnoses from the European MSM internet survey (EMIS) and surveillance-reported HIV diagnoses among MSM in 2009. BMC Public Health 13:919PubMedCrossRefPubMedCentralGoogle Scholar
  19. Miles J, Shevlin M (2001) Applying regression & correlation: a guide for students and researchers. SAGE Publications Ltd, LondonGoogle Scholar
  20. Moran D, Jordaan JA (2007) HIV/AIDS in Russia: determinants of regional prevalence. Int J Health Geogr 6:22PubMedCrossRefPubMedCentralGoogle Scholar
  21. Netesov SV, Conrad JL (2001) Emerging infectious diseases in Russia, 1990-1999. Emerg Infect Dis 7:1–5PubMedCrossRefPubMedCentralGoogle Scholar
  22. Noll-Hussong M (2012) Syphilis and the internet. Euro Surveill 17:pii:20249Google Scholar
  23. Polgreen PM, Chen Y, Pennock DM, Nelson FD (2008) Using internet searches for influenza surveillance. Clin Infect Dis 47:1443–1448PubMedCrossRefGoogle Scholar
  24. Prieger JE (2007) The supply side of the digital divide: is there equal availability in the broadband Internet access market? Econ Inq 41:346–363CrossRefGoogle Scholar
  25. Reeves PM (2000) Coping in cyberspace: the impact of Internet use on the ability of HIV-positive individuals to deal with their illness. J Health Commun 5(Suppl):47–59PubMedCrossRefGoogle Scholar
  26. Reeves PM (2001) How individuals coping with HIV/AIDS use the Internet. Health Educ Res 16:709–719PubMedCrossRefGoogle Scholar
  27. Regushevskaya E, Dubikaytis T, Laanpere M, Nikula M, Kuznetsova O, Karro H, Haavio-Mannila E, Hemminki E (2010) The determinants of sexually transmitted infections among reproductive age women in St. Petersburg, Estonia and Finland. Int J Public Health 55:581–589PubMedCrossRefGoogle Scholar
  28. Reis BY, Brownstein JS (2010) Measuring the impact of health policies using Internet search patterns: the case of abortion. BMC Public Health 10:514PubMedCrossRefPubMedCentralGoogle Scholar
  29. Riedner G, Dehne KL, Gromyko A (2000) Recent declines in reported syphilis rates in eastern Europe and central Asia: are the epidemics over? Sex Transm Infect 76:363–365PubMedCrossRefPubMedCentralGoogle Scholar
  30. Russian Federation Federal State Statistics Service (2012) Households with a personal computer and Internet access, as a percentage of the total number of households of the correspondent federal subject in 2011. Accessed 28 November 2013
  31. Russian Ministry of Communications and Mass Media (2012) Industry statistics. Accessed 28 November 2013
  32. Russian Ministry of Health and Social Development (2013) Statistics. Accessed 28 November 2013
  33. Webster P (2005) Russia underestimates HIV/AIDS incidence. CMAJ 172:985PubMedCrossRefPubMedCentralGoogle Scholar
  34. World Health Organization (WHO) (2011a) Introduction of Second-generation HIV Surveillance Guidelines in some Newly Independent States of Eastern Europe. Report on a WHO meeting. Accessed 28 November 2013
  35. World Health Organization (WHO) (2011b) UNAIDS. Guidelines on surveillance among populations most at risk for HIV, WHO, GenevaGoogle Scholar
  36. Yandex Wordstat© (2013) Search terms statistics. Accessed 28 November 2013
  37. Zhan W, Krasnoselskikh TV, Niccolai LM, Golovanov S, Kozlov AP, Abdala N (2011) Concurrent sexual partnerships and sexually transmitted diseases in Russia. Sex Transm Dis 38:543–547PubMedPubMedCentralGoogle Scholar
  38. Zheluk A, Gillespie JA, Quinn C (2012) Searching for truth: internet search patterns as a method of investigating online responses to a Russian illicit drug policy debate. J Med Internet Res 14:e165PubMedCrossRefPubMedCentralGoogle Scholar
  39. Zheluk A, Quinn C, Hercz D, Gillespie JA (2013) Internet search patterns of human immunodeficiency virus and the digital divide in the Russian Federation: infoveillance study. J Med Internet Res 15:e256PubMedCrossRefPubMedCentralGoogle Scholar
  40. Zhou XC, Shen HB (2010) Notifiable infectious disease surveillance with data collected by search engine. J Zhejiang Univ-Sci C 11:241–248CrossRefGoogle Scholar

Copyright information

© Swiss School of Public Health 2014

Authors and Affiliations

  • Alexander Domnich
    • 1
  • Eva K. Arbuzova
    • 2
  • Alessio Signori
    • 1
  • Daniela Amicizia
    • 1
  • Donatella Panatto
    • 1
  • Roberto Gasparini
    • 1
  1. 1.Department of Health SciencesUniversity of GenoaGenoaItaly
  2. 2.State Budgetary Healthcare Institution “Specialized Clinical Hospital for Infectious Diseases”Healthcare Department of Krasnodar RegionKrasnodarRussia

Personalised recommendations