A python application to scrap & clone static website
You need install python3 to run this script
pip3 install -r requirements.txt
Example command
python3 app/main.py --url https://chungta.vn --output www --resource-threads=50 --threads=50 --force=true --download_resources=False
This will crawl https://chungta.vn and write output into www folder.
usage: main.py [-h] [--url URL] [--output OUTPUT] [--threads THREADS]
[--resource-threads RESOURCE_THREADS] [--force FORCE]
[--download_resources DOWNLOAD_RESOURCES]
Website scrapper info.
optional arguments:
-h, --help show this help message and exit
--url URL Url of website
--output OUTPUT Output folder
--threads THREADS Number of threads to run to fetch html page in
concurences
--resource-threads RESOURCE_THREADS
Number of threads to run to fetch resources just as
image, video
--force FORCE Remove history and download everything again
--download_resources DOWNLOAD_RESOURCES
Download images, js, css, and other files
- Restore session from database to duplication