Advertisement

How to Find Correlated Internet Failures

  • Ramakrishna PadmanabhanEmail author
  • Aaron Schulman
  • Alberto Dainotti
  • Dave Levin
  • Neil Spring
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11419)

Abstract

Even as residential users increasingly rely upon the Internet, connectivity sometimes fails. Characterizing small-scale failures of last mile networks is essential to improving Internet reliability.

In this paper, we develop and evaluate an approach to detect Internet failure events that affect multiple users simultaneously using measurements from the Thunderping project. Thunderping probes addresses across the U.S. When the areas in which they are geo-located are affected by severe weather alerts. It detects a disruption event when an IP address ceases to respond to pings. In this paper, we focus on simultaneous disruptions of multiple addresses that are related to each other by geography and ISP, and thus are indicative of a shared cause. Using binomial testing, we detect groups of per-IP disruptions that are unlikely to have happened independently. We characterize these dependent disruption events and present results that challenge conventional wisdom on how such outages affect Internet address blocks.

Notes

Acknowledgments

We thank Arthur Berger, Philipp Richter, our shepherd Georgios Smaragdakis, and the anonymous reviewers for their thoughtful feedback. This research is supported by the U.S. Department of Homeland Security (DHS) Science and Technology Directorate, Cyber Security Division (DHS S&T/CSD) via contract number 70RSAT18CB0000015 and by NSF grants CNS-1619048 and CNS-1526635.

References

  1. 1.
    Argon, O., Bremler-Barr, A., Mokryn, O., Schirman, D., Shavitt, Y., Weinsberg, U.: On the dynamics of IP address allocation and availability of end-hosts. arXiv preprint arXiv:1011.2324 (2010)
  2. 2.
    Bischof, Z., Bustamante, F., Feamster, N.: The growing importance of being always on - a first look at the reliability of broadband internet access. In: Research Conference on Communications, Information and Internet Policy (TPRC), vol. 46 (2018)Google Scholar
  3. 3.
    Bischof, Z.S., Bustamante, F.E., Stanojevic, R.: Need, want. Broadband markets and the behavior of users. In: IMC, Can Afford (2014)Google Scholar
  4. 4.
    Dainotti, A., et al.: Analysis of country-wide Internet outages caused by censorship. In: IMC (2011)Google Scholar
  5. 5.
    Grover, S., et al.: Peeking behind the NAT: an empirical study of home networks. In: IMC (2013)Google Scholar
  6. 6.
    Heidemann, J., Pradkin, Y., Govindan, R., Papadopoulos, C., Bartlett, G., Bannister, J.: Census and survey of the visible Internet. In: IMC (2008)Google Scholar
  7. 7.
    Internet Outage Detection and Analysis (IODA). https://www.caida.org/projects/ioda/
  8. 8.
    National Hurricane Center Tropical Cyclone Report: Hurricane Irma. https://www.nhc.noaa.gov/data/tcr/AL112017_Irma.pdf
  9. 9.
    Katz-Basset, E., Madhyastha, H.V., John, J.P., Krishnamurthy, A., Wetherall, D., Anderson, T.: Studying black holes in the internet with Hubble. In: NSDI (2008)Google Scholar
  10. 10.
  11. 11.
    Northeast Storm Undergoes Bombogenesis, Bringing 70 MPH Gusts, Almost 350 Reports of Wind Damage, Flooding—The Weather Channel. https://weather.com/forecast/regional/news/2017-10-30-northeast-storm-damaging-winds-flooding
  12. 12.
    29–30 October 2017 damaging winds, heavy rainfall & flooding. https://www.weather.gov/aly/October29-302017
  13. 13.
    More than 1 million power outages in the Northeast after blockbuster fall storm - The Washington Post. https://www.washingtonpost.com/news/capital-weather-gang/wp/2017/10/30/over-one-million-power-outages-in-the-northeast-after-blockbuster-fall-storm/
  14. 14.
    Comcast outage on Sep 13 2017 in the Outages Mailing List. https://puck.nether.net/pipermail/outages/2017-September/010754.html
  15. 15.
    Padmanabhan, R.: Analyzing internet reliability remotely with probing-based techniques. Ph.D. thesis, University of Maryland (2018)Google Scholar
  16. 16.
    Padmanabhan, R., Dhamdhere, A., Aben, E., Claffy, K., Spring, N.: Reasons dynamic addresses change. In: IMC (2016)Google Scholar
  17. 17.
    Padmanabhan, R., Owen, P., Schulman, A., Spring, N.: Timeouts: beware surprisingly high delay. In: IMC (2015)Google Scholar
  18. 18.
    Quan, L., Heidemann, J., Pradkin, Y.: Trinocular: understanding internet reliability through adaptive probing. In: SIGCOMM (2013)Google Scholar
  19. 19.
    Richter, P., Padmanabhan, R., Plonka, D., Berger, A., Clark, D.: Advancing the art of internet edge outage detection. In: IMC (2018)Google Scholar
  20. 20.
    Sánchez, M.A., et al.: Dasu: pushing experiments to the internet’s edge. In: NSDI (2013)Google Scholar
  21. 21.
    Schulman, A., Spring, N.: Pingin’ in the rain. In: IMC (2011)Google Scholar
  22. 22.
    Shah, A., Fontugne, R., Aben, E., Pelsser, C., Bush, R.: Disco: fast, good, and cheap outage detection. In: TMA (2017)Google Scholar
  23. 23.
    Shavitt, Y., Shir, E.: DIMES: let the internet measure itself. SIGCOMM Comput. Commun. Rev. 35, 71–74 (2005)CrossRefGoogle Scholar
  24. 24.
    Sundaresan, S., Burnett, S., Feamster, N., de Donato, W.: BISmark: a testbed for deploying measurements and applications in broadband access networks. In: USENIX ATC, June 2014Google Scholar
  25. 25.
    van Belle, G., Heagerty, P.J., Fischer, L.D., Lumley, T.S.: Biostatistics: A Methodology for the Health Sciences, 2nd edn. Wiley, Hoboken (2004)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Ramakrishna Padmanabhan
    • 1
    • 2
    Email author
  • Aaron Schulman
    • 3
  • Alberto Dainotti
    • 2
  • Dave Levin
    • 1
  • Neil Spring
    • 1
  1. 1.University of MarylandCollege ParkUSA
  2. 2.CAID/UCSDLa JollaUSA
  3. 3.UCSDSan DiegoUSA

Personalised recommendations