Scrapy spider to scrape the data of people listed on the EU Whoiswho website.
If you're looking for the dataset, go to this page.
- Clone this repository or simply download the python files
- Install Scrapy
- Run the following command to start scraping (the whole website takes 9 hours to scrape!):
scrapy runspider spider.py -o data.json
- (optional) run
json2csv.pyto convert the results fromdata.jsontodata.csv