- Kaggles Titanic Dataset (Python)
- Kaggles Heart Disease Dataset (R)
- See RandomForestTuningExample.ipynb
- In this file, I do an iteration of randomsearch tuning to find a subset of hyperparameter values to tune more thoroughly in suubsequent gridsearch.
- Some basic figure generation using seaborn.
- Some basic object-oriented programming (OOP).
- Look into imputing missing values
- consider using xgboost instead of a random forest