cricinfo-web-scraping

using nodejs (cheerio module)

About

This is a web scraping project to obtain required information from cricinfo website The following 3 activities are carried out in this project-

Print the last ball commentary.
Print the name of the winning team and bowler(with name and no. of wickets) who has taken the maximum wickets from the winning team.
Print the birthday of every batsmen played.

How to run this project

Clone this repository in your local environment.
Run command npm install to install all the required packages.
Run each file in the activities directory one by one to get desired output.

Insights-

Different files created for implementing different activities.
Cheerio module used here for web scraping.
Disadvantage of cheerio module: it only parses and extracts initial loaded html, so we cannot find first ball commentary using this.
HTML seggregation is done using another file (table.html) to make information extraction easier.
Multiple page scraping is done here in printing birthdays of every batsmen.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
activities		activities
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
table.html		table.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

activities

activities

README.md

README.md

package-lock.json

package-lock.json

package.json

package.json

table.html

table.html

Repository files navigation

cricinfo-web-scraping

using nodejs (cheerio module)

About

How to run this project

Insights-

About

Releases

Packages

Languages

swatijha-2906/cricinfo-web-scraping

Folders and files

Latest commit

History

Repository files navigation

cricinfo-web-scraping

using nodejs (cheerio module)

About

How to run this project

Insights-

About

Topics

Resources

Stars

Watchers

Forks

Languages