Skip to content

Scrapy spider to scrape the data of people listed on the EU Whoiswho website

Notifications You must be signed in to change notification settings

openstate/EU_whoiswho_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

EU Whoiswho Scraper

Scrapy spider to scrape the data of people listed on the EU Whoiswho website.

If you're looking for the dataset, go to this page.

Instructions

  • Clone this repository or simply download the python files
  • Install Scrapy
  • Run the following command to start scraping (the whole website takes 9 hours to scrape!):
    scrapy runspider spider.py -o data.json
  • (optional) run json2csv.py to convert the results from data.json to data.csv

About

Scrapy spider to scrape the data of people listed on the EU Whoiswho website

Resources

Stars

Watchers

Forks

Releases

No releases published

Sponsor this project

Packages

No packages published

Languages