Skip to content

Latest commit

 

History

History
14 lines (11 loc) · 426 Bytes

README.md

File metadata and controls

14 lines (11 loc) · 426 Bytes

myhome-analysis

Data analysis of Tsinghua Student Community Management and Service website advice

Data Preprocessing

|--data-preprocessing
| 	|---crawler.py  # a simple Python script for crawling websites
|	|---crawler.R  	# R script for crawling websites into html files
|	|---html2csv.R  # convert raw HTMLs into sorted data frame in CSV format

An example of jiayuan.csv