Skip to content

A spyder bot to scrape timesjob.com for job listings and save in a db using scrapy

License

Notifications You must be signed in to change notification settings

karthik-ballullaya/TimesJobSpyder_Scrapy

Repository files navigation

TimesJobSpyder_Scrapy

A spyder bot to scrape timesjob.com for job listings and save in a db using scrapy

Table of contents:

  1. Installation
  2. Scraped details
  3. How to run the spyder
  4. How to view the results

Installation:

run the following command to install the packages: pip install -r requirements.txt

Data scraped:

The spyder scrapes these following fields:

Field Datatype Description
jobType string Type of job
moreDetails string href to get more details about the job listing
companyName string Name of the company
reqExp string Required experience
location string Location of office
compensation string Compensation for the job
jobDescription string Description of the job
skillSet string Skill set required for the job
postedTime string When was the job listed
isWFHAvailable string Is Work from home option available

How to run:

Ex Cmd: scrapy crawl timesjob -a keywords="Data science" -a location="Mumbai" -a workexp="1" -a maxpages="100"

How to view the results:

The scraped details is stored in a sqlite database named JobListing.db . One can use sqliteonline for quick viewing of the database.

ex cmd: SELECT * FROM job_listing_tb;

To view all the entries

About

A spyder bot to scrape timesjob.com for job listings and save in a db using scrapy

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages