Data analysis of Tsinghua Student Community Management and Service website advice
|--data-preprocessing
| |---crawler.py # a simple Python script for crawling websites
| |---crawler.R # R script for crawling websites into html files
| |---html2csv.R # convert raw HTMLs into sorted data frame in CSV format
An example of jiayuan.csv