LPSE Parser - as reflected by its name - is a data parser for LPSE websites
Developed for KPP Pratama Kotabumi.
- Python3.5.3+
- urllib
- requests
- ssl
- BeautifulSoup4
In vars.py, several parameters need to be configured
govNameis the name of the website (without the trailing go.id or any subdomain)staticCodeis the last 3 numbers inKodecolumn of/eproc4/lelanglowNumandhighNumis the lower and upper bounds for the iteration.
Use python main.py to scrape several information from pengumumanlelang page.
The result will be recorded in results folder in csv format.
I am not responsible to any damage to the server due to the excessive requests sent from the script.
The project is under development where other pages will be scraped in addition to the existing ones. A database might also be needed to manage complex data structure, rather than the current mechanism of storing the data to csv format.
- Create other pages' scrapers
- Build a database system
- Build a reporting system (POD might be interesting)
- Auto update for new data