Skip to content

Sreeharipavithran/DevTown-Data_Science_Training-Project

Repository files navigation

📊 DevTown Data Science Training Projects

Python Jupyter scikit-learn Status

A collection of end-to-end Data Science projects completed during DevTown training — covering classification, sentiment analysis, and real-world datasets.


🗂️ Projects

✈️ 1. Airline Customer Satisfaction Classification

Dataset: Invistico Airlines (Invisitico_Airline.csv)

Objective: Predict whether a customer is satisfied or dissatisfied with airline service based on various features.

Key Steps:

  • Exploratory Data Analysis (EDA) on passenger demographics & flight data
  • Data cleaning and preprocessing (handling nulls, encoding)
  • Built and evaluated classification models
  • Feature importance analysis

Tech Stack: Python Pandas Scikit-learn Matplotlib Seaborn


🎬 2. Movie Review Sentiment Analysis — Naive Bayes

Objective: Classify movie reviews as positive or negative using Natural Language Processing.

Key Steps:

  • Text preprocessing (tokenization, stopword removal)
  • Feature extraction using Bag of Words / TF-IDF
  • Naive Bayes classifier training and evaluation
  • Accuracy and confusion matrix analysis

Tech Stack: Python NLTK Scikit-learn Pandas


🛠️ How to Run

# Clone the repo
git clone https://github.com/Sreeharipavithran/DevTown-Data_Science_Training-Project.git

# Install dependencies
pip install -r requirements.txt

# Open notebooks
jupyter notebook

📁 Repository Structure

├── Invistico-Airline Customer Satisfaction Classification ML Model Using RandomForest.ipynb
├── Invisitico_Airline.csv
├── Movie Review Sentimental Analysis - Naive Bayes.ipynb
└── README.md

📬 Connect with Me

LinkedIn GitHub

Releases

No releases published

Packages

 
 
 

Contributors