Thank you Georgios! #52

lexciobotariu · 2024-05-13T08:40:01Z

I've been running the script with 5k queries for the last 10h and it got to the level where it is using over 200GB RAM and I've set it to use 35 cores.

It scraped over 300k businesses.

I'm just a bit worried that it won't finish the entire list of queries before crashing due to the lack of RAM.
Any suggestions on how to continue the scraping once it crashes, and to retake from where it left?

admbyz · 2024-05-18T12:44:15Z

try less cores or you can split your keywords and run synchronously i have shared a script on this closed issue #35 and make sure you are running latest version.

gosom · 2024-05-20T11:01:37Z

@lexciobotariu what is the outcome of this? Have you managed to scrape all your keywords?

lexciobotariu · 2024-05-22T05:50:22Z

Hello there, it did manage to scrap all the information ~500k.
@admbyz I did use your suggestion in the past, was working perfectly.

admbyz · 2024-05-27T09:32:46Z

eh i misunderstood your problem. but your request seems really hard because i dont think google sends static results with requests you do. So able to resume program also need to validate returned data from google. Skipping already scraped data is more performant sure but at the end total request will be same unless only checking exact url and skip the entire results.
I didnt pay attention to terminal prints but maybe you can extract data from there make a new or remove already passed data from your keyword list and check if scraper not running and keywordlist is not empty then run again. But before that you need to check is scraper append results or removes and append new result to file after restarting. If its not appending you have to make new result file every restart. I am not recommend this way to handle scraping its wonky and not reliable..
Running scraper with less cores will be best bet for you i guess.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thank you Georgios! #52

Thank you Georgios! #52

lexciobotariu commented May 13, 2024

admbyz commented May 18, 2024

gosom commented May 20, 2024

lexciobotariu commented May 22, 2024

admbyz commented May 27, 2024

Thank you Georgios! #52

Thank you Georgios! #52

Comments

lexciobotariu commented May 13, 2024

admbyz commented May 18, 2024

gosom commented May 20, 2024

lexciobotariu commented May 22, 2024

admbyz commented May 27, 2024