git clone ()
cd Aiohttp_spider
- If you want to use your Ip_source you should:
- I use xx source.
- Change the
ip_pool_spider.py
and replaced by your id.
- Change the
- I use other source
- Change the
get_ip
inIp_pool_spider
- The
get_ip
after modify, it should yield a dict.
{ "proxies":{"http": Address, "https": Address}, "canUseTime": int_remainder_unit:second }
- Change the
- I use xx source.
- Config the source of redis, which in
Main_comments_spider.py
.- Create the Database in redis, and keep the same name both redis and
Main_comments_spider.py
- Create the Database in redis, and keep the same name both redis and
- Config your grab aim, which located in
judge()
inMain_comments_spider.py
. - Config your download folder in
Main_comments_spider.py
.
python Main_comments_spider.py
├─Spider
│ ├─Mian_comments_spider.py
│ ├─Ip_pool_spider.py
│ ├─Cookies_spider.py