This project predicts whether it will rain tomorrow in Australia using historical weather data and different machine learning algorithms.
- Dataset: Rain in Australia dataset
- Goal: Predict the binary target RainTomorrow (
YesorNo) - Techniques: Data preprocessing, feature engineering, model training, evaluation
- Tools: Python, Pandas, NumPy, Scikit-learn, Matplotlib, Seaborn
-
Data Loading & Exploration
- Handle missing values
- Summary statistics & distributions
- Visualizations of weather patterns
-
Data Preprocessing
- Encode categorical variables
- Feature scaling (MinMax/Standard Scaler)
- Train-test split
-
Modeling
Algorithms used:- Logistic Regression
- K- Nearest Neighbor
- Support Vector Machine
- Decision Tree
-
Evaluation Metrics (Decision Tree)
- Accuracy = 89.04 %
- Precision, Recall, F1-score
- ROC-AUC Curve = 0.8908
-
Results & Insights
- Best performing model and its metrics
- Feature importance analysis
- Logistic Regression: (add results)
- Random Forest: (add results)
- XGBoost: (add results)
- Hyperparameter tuning with GridSearchCV/RandomizedSearchCV
- Try deep learning models (LSTM for time-based weather data)
- Deploy with Streamlit/Flask
- Clone this repository:
git clone https://github.com/your-username/Rain-Prediction-Australia-using-ML-algorithms.git cd Rain-Prediction-Australia-using-ML-algorithms