Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
AIML PPT.pptx
1. AIML INTERNSHIP
DEPARTMENT OF INFORMATION TECHNOLOGY, SRKREC
Presented by:
MATCHA MANOJ KUMAR
3/4 - IT
Regd.No: 20B91A12B6
HENOTIC TECHNOLOGY PRIVATE LIMITED
7th July 2022 to 06th September 2022
2. CONTE NTS :
INTRODUCTION ABOUT DATASET
OBJECTIVE
DATA SCIENCE PROJECT LIFE CYCLE
RESULTS
CONCLUSIONS
REFERENCES
4. INTRODUCTION ABOUT DATASET :
Every year thousands of peoples travelling from one place to another place for any cause. If the
journey is too expensive or too important, they apply for travel insurance. That one usually tends to
overlook while planning the vacation or a trip.
Travel Insurance covers risks during travel such as loss of passport and personal belonging cover, loss
of checked in baggage etc. Having these risks covered ensures an additional layer of protection
against financial loss.
The dataset contains information such as – agency , agency type , distribution channel , product name
, duration , destination , net sales , commission , gender , age and claim(target variable).
Based on the destination , agency type , channel , product name , duration of journey and some
another data based we decide whether the traveler able to claim the insurance or not.
By implementing ML algorithms on the dataset, we can predict the results efficiently and accurately.
This dataset contains 11 columns and 48260 observations, respectively.
5. OBJECTIVE :
The main agenda of this project is:
In back days people cheat insurance companies in different accepts and claim their
insurance to over come this we build a project.
In this project we study the previous data and now we decide whether the traveler can
claim their insurance or not.
We Build an appropriate Machine Learning Model that will help to predict whether
the traveler can claim their insurance or not .
6. Data Science Project Life Cycle :
1. Data Pre-processing
i. Check the Duplicate and low variation data
ii. Identify and address the missing variables
iii. Handling of Outliers
iv. Categorical data and Encoding Techniques
v. Feature Scaling
vi. As this is unbalanced dataset we apply over sampling (RandomOverSampler) techniques to
balance the dataset.
2. Selection of Dependent and Independent variable
In my dataset, target variable is claim.
3. Training Models
8. Results :
The model results in the following order by considering the model Accuracy, F1 score and
ROC_AUC_score.
1) Decision tree Classifier
2) Random forest classifier
3) KNeighbors Classifier