This is the legendary Titanic ML kaggle competition
Create a model that predicts the probability of survival of passengers in a shipwreck.
Selection of classification algorithms:
- Logistic Regression
- Gaussian Naive Bayes
- K-nearest Neighbors
- Linear Support Vector Machine
- Random Forest
Random Forest > Multiple Random Decision Trees
Chances to survive:
~80% accuracy
~71% f1-score
~82% ROC-AUC score
The model is quite good in predicting the probability of survival of the passengers but there is still room for improvement.