This chapter includes several larger examples of web scrapers. Contrary to most of the examples showcased during the previous chapters, the examples here serve a twofold purpose. First, they showcase some more examples using real-life websites instead of a curated, safe environment. The reason why we haven’t used many real-life examples so far is due to the dynamic nature of the web. It might be that the examples covered here do not provide the exact same results anymore or will be broken by the time you read them. That being said, we have tried to use a selection of sites that are rather scraper friendly and not very prone to changes. The second purpose of these examples is to highlight how various concepts seen throughout the book “fall together” and interact, as well as to hint toward some interesting data science-oriented use cases.


Beautiful Soup Scrap Page IMDb Internet Movie Database (IMDB) SQLite Script Injection 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Copyright information

© Seppe vanden Broucke and Bart Baesens 2018

Authors and Affiliations

  1. 1.KU LeuvenLeuvenBelgium
  2. 2.Dept of Decision Sci & Info ManagemKU Leuven Dept of Decision Sci & Info ManagemLeuvenBelgium

Personalised recommendations