Introduction to using R Studio to understand and analyze big data sets. Importance of collecting and cleaning data. Methodology to understanding big data sets. Understanding the process and visualization techniques to make meaningful interpretations.
6. Understanding the Data
List of components – names()
Dimensions of object – dim()
Structure of the data – str()
Header + first 6 observations – head()
Function on entire set – sapply()
Summary – summary()
Jahartig.com 11/11/2015