2. Objective
What are the significant variables influencing the likelihood of diabetes?
Understanding the relationships between pregnancy history, glucose levels, blood pressure,
skin thickness, insulin levels, BMI, diabetes pedigree function, age, and the outcome of
diabetes is crucial for advancing our knowledge of the disease. The analysis aims to provide
valuable insights for healthcare professionals, researchers, and policymakers working
towards effective diabetes prevention and management
Expected Outcomes:
● Identification of key variables strongly associated with diabetes.
● Insights into potential risk factors and their interplay.
3. Dataset
The dataset is taken from Kaggle.
This dataset is originally from the National Institute of Diabetes and Digestive and
Kidney Diseases. The objective is to predict based on diagnostic measurements whether a
patient has diabetes.
4. Data Cleaning
Number of Missing Values After Removing missing values
Removing duplicate values