Web Scrapping Financial News

We often have plenty of unstructured data available for free on the internet. Some of this data may be useful for combining with other structured or unstructured data available in the organization.

The project aims to automate the process of gathering unstructured (raw HTML) finance data using Python library BeautifulSoup & transform into structured data JSON and save as CSV

Objectives :

Automate the process of gathering unstructured data which is in the form of raw HTML.
Learn to web scrap Financial News of specific listed companies on the Stock Market.
Use BeautifulSoup4 Python library for web scraping - Install, Exception Handling, Advanced HTML Parsing.
How to traverse a single domain to fetch data from many HTML pages.
Process gathered (scrapped) data and transform it into structured format JSON and save as CSV.

Set up and Installation:

pip install --upgrade pip
pip install -r requirements.txt

Create a sub-directory 'content' in project-directory to save CSV files

This is what a general DIY web scraping process looks like:

Identify the target website
Collect URLs of the pages where you want to extract data from
Make a request to these URLs to get the HTML of the page
Use locators to find the data in the HTML
Save the data in a JSON or CSV file or some other structured format

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
content		content
1.Introduction_to_WebScraping.ipynb		1.Introduction_to_WebScraping.ipynb
2.WebScraping_News_Blog.ipynb		2.WebScraping_News_Blog.ipynb
3.Parsing_Pages.ipynb		3.Parsing_Pages.ipynb
4.Automate_Scrapping.ipynb		4.Automate_Scrapping.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

content

content

1.Introduction_to_WebScraping.ipynb

1.Introduction_to_WebScraping.ipynb

2.WebScraping_News_Blog.ipynb

2.WebScraping_News_Blog.ipynb

3.Parsing_Pages.ipynb

3.Parsing_Pages.ipynb

4.Automate_Scrapping.ipynb

4.Automate_Scrapping.ipynb

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Web Scrapping Financial News

Objectives :

Set up and Installation:

This is what a general DIY web scraping process looks like:

About

Releases

Packages

Languages

Jigisha-p/Automated-Financial-News-Scraping-and-Structured-Data-Conversion

Folders and files

Latest commit

History

Repository files navigation

Web Scrapping Financial News

Objectives :

Set up and Installation:

This is what a general DIY web scraping process looks like:

About

Topics

Resources

Stars

Watchers

Forks

Languages