MSc thesis project, working on imbalanced classification on large data sets in different time points.
Using data from CAMH and Environics Analytics to identify the segments who are facing a decline in mental health during COVID-19
Using tools to create a interactive and management-friendly report
Finding the best segmentation model among customers in a survey conducted by an airline
Determined the key drivers of return to airline for past flyers based on a survey and developed a predictive model
Using R to clean and prepare a raw survey data for the analytics team
Using Ensemble Decision Trees, Logistic Regression and etc. in R to Predict Life Expectancy based on Public Health Factors