This repository contains the final code, analyses, and supplementary documentation for my MSc Data Science dissertation, titled: "Remote Work on Mental Health"
This project explores the psychological and productivity-related effects of remote and hybrid work arrangements, using both primary survey data and publicly available secondary datasets.
The analysis includes:
- Data cleaning and preprocessing
- Descriptive statistics
- ANOVA testing
- OLS Regression (Linear)
- Binary Logistic Regression
- Decision-Tree Classifier
- Random Forest Classifier
- Correlation matrix generation
- Linear regression modeling
- Discussion of implications
- Python (Pandas, NumPy, Statsmodels, Seaborn, Matplotlib)
- Jupyter Notebooks
- Git/GitHub for version control
- Personally identifiable information (PII) has been removed or anonymized from the datasets.
- The public version of this repository excludes any sensitive raw data.
The project was originally tracked privately over the course of several months via a private GitHub repository. This public version contains the cleaned and documented final deliverables.
This work was supervised by Dr. Vinayak Deshpande and Prof. Babita Kashroo, and I’d like to express my gratitude for their support throughout the project.