Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请教怎样控制爬虫延时或者暂停? #1129

Open
Mr-LiuDC opened this issue Sep 23, 2023 · 3 comments
Open

请教怎样控制爬虫延时或者暂停? #1129

Mr-LiuDC opened this issue Sep 23, 2023 · 3 comments

Comments

@Mr-LiuDC
Copy link

例如我在爬取网站时触发了网站的防护机制,当我判断出网站有进行防护时,我该怎样控制爬虫让它过一段时间再抓取?

@18547601391
Copy link

在processor中有个site变量,里面有控制抓取间隔、重试次数

@Mr-LiuDC
Copy link
Author

在processor中有个site变量,里面有控制抓取间隔、重试次数

这是全局的配置,没法对某次的请求进行设置。

@18547601391
Copy link

你是怎样判断出网站有进行防护的?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants