# Health-Insurance-Cross-Sell-EDA-and-Machine-Learning-Modeling
This is a note book of exploratory data analysis on cross selling of health insurance customers on vehicle insurance product and using machine learning to predict whether a customer is interested or not in vehicle insurancen
## Background Information :
- an Insurance company that provide Health Insurance to its customers, usually they offer other insurance product to the customers through diffirent kind of marketing channel. In this case we will build a model to predict whether the policyholders (customers) from past year will also be interested in Vehicle Insurance provided by the company.
## Problem Statement:
- Un optimize customer reachout process, many insurance worker spend a lot of their time having meeting with prospective client without knowing the probabily of that customer to buy the insurance product
## Business Goals:
- Building a model to predict whether a customer would be interested in Vehicle Insurance is extremely helpful for the company because it can then accordingly plan its communication strategy to reach out to those customers and optimise its business model and revenue
## Business Question:
- How does age of a vehicle determing the response of vehicle insurance advertisment
- How to attract customers from different generation
- what's the major factor that make a health insurance customer not intersted with vehicle insurance
- What's the best machine Learning modeling for this Cross Sell case
## Workflow :
- Data Cleaning
- Recategorize Data
- Binning
- Exploratory Data Analysis to Answer business Question
- Feature Engineering & Selection For Machine Learning Process
- Encoding all the categorical features
- Checking correlation between dependent and independet variable
- Feature Selection
- Model Building :
- Splitting data into Training and Testing
- Apllying the **SMOTE** (Synthetic Minority Oversampling Technique) to the minority target since the data is imbalance
- Creating base model of classification algorithm ( Logistic Regression, KNN Classifier, Decision Tree Classifier, Random Forest Clasifier)
- Check The Evaluation matrix for all the base model
- HyperParameter tuning
- Checking Evaluation Matrix for tuned Model
- Choose which model has the best recall score for this case
<details>
<summary><h2> Click to see the conclusion and recommendation </h2></summary>
## Conclusion:
- From this dataset of health insurance customers almost **95%** of customers have a vehicle age that's less than 2 years. from our analysis, customers who has more than 2 years of vehicle age are more interested with vehicle insurance advertisment, while customers who has less then one year of vehicle age, only **4%** of them are actually interesred with vehicle insurance
- We found out that customer who already have vehicle insurance are almost have no interest in another vehicle insurance. Our analysis shows that **99.9% of customers that have a vehicle insurance is not interested in another vehicle insurance**, while customer who doesn't have a vehicle insurance **22.5 %** of them are interested with vehicle insurance
- we also found out that a newer vehicle are more likely to have a vehicle insurance, with vehicle that's less than one year **66% of those are insured** , vehicle that's older than one year but less than 2 years are **33% insured**, while less than **one percent** of vehicle that's older than 2 years are insured. This should explain why customer who owns a newer vehicle are less likely to be intersted with insurance promotion, because they probably alredy have one.
- **Customers who never had vehicle damaged** only 0.5 % of those customers are intersted with vehicle insurance, **87%** of customers who never had any vehicle damaged already have a vehicle insurance
- **Which Customer Generation are less likely to be intersted with vehicle insurance** the answer is **Millenials** (people in age group of 18 - 34) only **6% of millenials are actually interested with vehicle insurance**, and why is so?
1. almost **63% of millenials already have vehicle insurance**, from our analysis before owning vehicle insurance is a major factor why someone is not interested with another vehicle insurance
2. **90% of millenials have a vehicle that's less than one year of age**, and from our analysis before that vehicle that's less than one year are **66% already insured**
This conculed that millenials are more likely to already have a vehicle insurance before our vehicle insurance team approached, and that's a major factor why millenials are least likely to be interested with our vehicle insurance, because they already have one
#### So who's actually interested with our vehicle insurance ?
From the responses there are **12 % of our health insurance customers are interested with the vehicle insurance product** but who are those people?
1. **First, Customer who does not have a a vehicle insurance**, out of all customers who does not have a vehicle insurance **22.5 %** of them says that they're interested with vehicle insurance product
2. **customers who has vehicle that's older than 2 years** our analysis before mentioned that only less than **one percent** of car that's older than 2 years are previouly insured, by not having a vehicle insurance they're more likely to be intersted with our vehicle insurance, our data show's that customer who has car that's more than 2 years are **7 times** more likely to be intersted with vehicle insurance compared to customer who own a vehicle less than one year
3. **customers who have had a vehicle damaged in the past** from our analysis we found out that **97%** customers who actually intersted with vehicle insurance have had their vehicle damaged in the past
1. **95 % of customers who have had vehicle damage in the past still doesn't have a vehicle insurance**
**Which Customer Generation that's most likely to be interested in Vehicle insurance ?**
- **GEN X** (Age Gen X : 35 - 50)
- our analysis shows that GEN X has the highest percentage to be intersteed with vehicle insurance, to be precise, **21 %** of GEN X are interested with vehicle insurance
- This might be because **72% GEN X** does not have a vehicle insurace, and GEN X has the highest percentage of vehicle damager the past **(67%)** among other generation
<b>Machine learning could predict whether a customer would be interested or not towards vehicle insurance product with recall 0.965 out of 1</b>
Using logiistic regression that has been tuned, we focus more on recall instead of accuracy here because of we want to reduce the false negative ( The customer who actually interested but the model predicted that customer is not intrested). Since this kind of problem could lead into lost of potential revenue
## Recommendation
### 1. Work with dealership to capture millenial market
as we know from the analysis that millenials are less likely to be intersted with vehicle insurance because of most of them have a vehicle that's less than one year of age, and vehicle with less then one year of age are most likely to be insured so in conclution they already have one, and so they're not interested. By working together with a dealership that sells a brand new car, we could tackle this problem, our insurance company could have a bundling product of brand new vehicle and a free promotional vehicle insurance for certain period of months. we hope that by working together with vehicle dealership we could target more millenials customers.
### 2. Target & Educate Customers Who had Vehicle Damage in the past
**95%** customers who have had a vehicle damaged in the past still does not have a vehicle insurance this is a gold mine for our vehicle insurance, since customers are more likely to be interested in vehicle insurance if they've a vehicle damage in th
小夕Coding
- 粉丝: 6347
- 资源: 527
最新资源
- 深度强化学习足球机器人行为控制方法解析源码.zip
- 设备控制与数据采集系统(物联网项目,含源码+项目说明+设计资料+产品溯源功能).zip
- MATLAB实现DBN-SVM深度置信网络结合支持向量机多输入分类预测(含模型描述及示例代码)
- 深度学习人体3D姿态实时检测与未来动作预测python源码+设计报告.zip
- 深度学习光纤传感水声识别源码+技术文档.zip
- 深度学习辅助的区块链智能合约安全检测系统(源码+设计文档).zip
- 深度学习舌苔检测系统(含开题报告+论文).zip
- 深度学习声纹识别系统-基于TensorFlow架构(含源码+项目说明+全部资料).zip
- 深度学习书籍推荐系统-个性化推荐引擎(基于用户行为分析,含源码+项目说明+实验报告).zip
- 《基于区块链的宠物收养平台》(毕业设计,源码,教程)简单部署即可运行。功能完善、操作简单,适合毕设或课程设计.zip
- 《基于区块链的宠物店运营程序设计》(毕业设计,源码,教程)简单部署即可运行。功能完善、操作简单,适合毕设或课程设计.zip
- 《基于 Algorand 区块链的制药供应链系统》(毕业设计,源码,教程)简单部署即可运行。功能完善、操作简单,适合毕设或课程设计.zip
- 《基于区块链的电子健康记录存储和检索系统》(毕业设计,源码,教程)简单部署即可运行。功能完善、操作简单,适合毕设或课程设计.zip
- 《基于区块链的电子病历 (EMR) 系统》(毕业设计,源码,教程)简单部署即可运行。功能完善、操作简单,适合毕设或课程设计.zip
- 《基于区块链的电子投票系统》(毕业设计,源码,教程)简单部署即可运行。功能完善、操作简单,适合毕设或课程设计.zip
- 《基于区块链的电子商务保修系统》(毕业设计,源码,教程)简单部署即可运行。功能完善、操作简单,适合毕设或课程设计.zip
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈