Big Data Analytics and Applications Final Project, finding out how variables influence the dependent variable.
• Data based on the 2010 Basic questionnaire at NTU, a total of 1030 people.
• Implemented data cleaning and added the feature field on excel.
• Imported the final data set in python and maked exploratory data analysis with the Pandas package and the Matplotlib package.
• Plotted cross-tabulation and produced a predictive model by using Logistic regression.