Amazon-Data-Scraper

This repository contains two Python scripts that perform web scraping on HTML files and convert the extracted data to JSON and CSV formats.

Files

csv_output.csv: This file is the output of the app.py script. It contains the scraped data from a single HTML file in CSV format.
json_output.json: This file is the output of the app(2).py script. It contains the scraped data from multiple HTML files in JSON format.
app.py: This Python script performs web scraping on a single HTML file (index.html) and extracts specific information such as the image link, title, rating, and price of a product. The scraped data is then saved to a JSON file (data.json).
app(2).py: This Python script performs web scraping on multiple HTML files stored in the Pages Snapshots directory. It extracts similar information to that in app.py from each file and saves the scraped data to a JSON file (all_data.json).

Prerequisites

Python 3.x
json library
os library
BeautifulSoup library

Usage

Ensure that you have Python 3.x installed on your system.
Install the required libraries by running the following command:
```
pip install beautifulsoup4
```
Run the app.py script to perform web scraping on the index.html file and save the data to a JSON file (data.json).
```
python app.py
```
Place the HTML files you want to scrape in the Pages Snapshots directory for the app(2).py script.
Run the app(2).py script to perform web scraping on the HTML files and save the data to a JSON file (all_data.json).
```
python app(2).py
```

Output

After running app.py, the scraped data from the index.html file will be saved as data.json.
After running app(2).py, the scraped data from the HTML files in the Pages Snapshots directory will be saved as all_data.json.
The data.json and all_data.json files can be further processed or used as desired.

FAQ

Can I follow you on your social media platforms ?

Yes you can !

You'll find me

Posting memes or talking about data on Twitter
Writing articles about complex data concepts and making them digestible on Medium
Posting data vizualizations inspiration and data infographics on Instagram

License

Distributed under the no License. See LICENSE.txt for more information.

Show Your Support

Please ⭐️ this repository if this project helped you or buy me coffee!

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
CSV Output		CSV Output
JSON Output		JSON Output
README.md		README.md
app(s).py		app(s).py
app.py		app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CSV Output

CSV Output

JSON Output

JSON Output

README.md

README.md

app(s).py

app(s).py

app.py

app.py

Repository files navigation

Amazon-Data-Scraper

Files

Prerequisites

Usage

Output

FAQ

Can I follow you on your social media platforms ?

License

Show Your Support

About

Languages

LowCodeDataGirl/Amazon-Data-Scraper

Folders and files

Latest commit

History

Repository files navigation

Amazon-Data-Scraper

Files

Prerequisites

Usage

Output

FAQ

Can I follow you on your social media platforms ?

License

Show Your Support

About

Topics

Resources

Stars

Watchers

Forks

Languages