DataClean

DataClean is a package including all heavily used functions in the precess of data cleaning. This package is designed for people who will deal with big massive data. Because the incentive of creating this package originated from my real research projects, my purpose is to design a package including all functions that might be used in a data analysis study.

The structrue of this package is as follows:

  1. A set of functions are designed for data collection no matter from “html”, “xls” or “xlsx” files. Or you can use several function to automatically pull data on specified websites.

  2. A set of functions are designed for data extracting.