Skip to content

William-An/DFB_Final

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DFB Final Crawler

This git project is for DFB Data mining competition. It contains a crawler based on Scrapy that collects TV series data from Youku Index and Douban.

DFB_Series_data_crawler

The crawler.
To activate spiders, run scrapy crawl -o OUTPUT -a list=tv.txt micro or scrapy crawl -o OUTPUT -a list=tv.txt macro For more information, Scrapy Docs

Virtualenv

The virtual python environment for this project. It contains all the packages that are used in crawler.