Insurance company especially health insurance company need to analyze how a customer is disease prone. We will try to find the risk based on the customers data.
If the Health_Risk can be estimated accurately then insurance cost can be calculated using this formula.
Health Insurance Cost = Base Cost + α * Health Risk
Data Source: National Center for Health Statistics
Total records : 1.04 M Has disease and non disease records. Disease related records : 0.56 M No. of feature: 34
Used Box-Whisker plot to detect outlier.
We applied K-Nearest Neighbour, Random Forest, Neural Network and Support Vector Machine algorithms to classify the different health risk.