This is a Scrapy project to scrape eplanning website and extract agent details from http://eplanning.ie/.
This project is only meant for educational purposes.
Main Site
Country Url Selection
Select Received Application
Form Request Data
Application URL
New Page URL
Select Agent Button
Select Agent Data
This project extracts Agent Data. The extracted data looks like this sample:
{
"name": " Sean Boyle Architects",
"address": [
"Unit 3, Second Floor",
"Donohoe Building, Kennedy Centre",
"Kennedy Road, Navan",
"Co. Meath "
],
"phone": "046 9023797 ",
"fax": " ",
"email": "info@boylearchitects.ie",
"url": "http://www.eplanning.ie/MeathCC/AppFileRefDetails/aa200649/0"
}
This project contains one spider and you can list them using the list
command:
$ scrapy list
eplanningSpider
Spider extract the data from ePlanning Site.
You can run a spider using the scrapy crawl
command, such as:
$ scrapy crawl eplanningSpider
If you want to save the scraped data to a file, you can pass the -o
option:
$ scrapy crawl eplanningSpider -o output.json