OptimizationMethods-MLP

🧠 A Multi-Layer Perceptron from scratch using Numpy for Age Regression

Final Project for Optimization Methods for Data Science

Author: Milad Torabi | Date: December 16, 2024

This repository contains the final project for the Optimization Methods for Data Science course, focusing on building and training a Multi-Layer Perceptron (MLP) for predicting age using a given set of features and using only Numpy, without using any libraries that are built for neural network models (e.g., Pytorch, TensorFlow, sklearn.neuralnetwork, etc.). The project demonstrates the use of L-BFGS-B optimization, efficient weight initialization, and hyperparameter tuning to achieve optimal results.

📄Report

The complete project report is available for detailed reference: 📘 View Full Report (PDF)

📚Table of Contents

1-Problem Description
2-Data Preprocessing
3-Optimization Goal and Strategy
4-Neural Network Architecture and Training Details
5-Hyperparameter Tuning
6-Optimization Details
7-Final Model Training and Results
8-Conclusion

📋 Key Highlights

Objective: Age prediction using a Multi-Layer Perceptron (MLP).
Dataset: Custom AGE_REGRESSION.csv with 32 input features and 1 target (age).
Loss Function: Mean Squared Error (MSE) with L2 Regularization.
Architecture: 32 → 128 → 64 → 1 (fully connected layers).
Optimization: L-BFGS-B with gradient clipping and pre-activation clipping.
Best Hyperparameters:
- Hidden Layers: [128, 64]
- Activation Function: Tanh
- Regularization (λ): 0.01
- Initialization: He

📚 Project Structure

📂 Project Root
├── 📘 Report_MLP.pdf
├── 📄 README.md (This File)
├── 📊 AGE_REGRESSION.csv (Dataset)
├── 📜 Functions_MLP.py (Functions and Utility Codes)
└── 📒 run_MLP.ipynb (Jupyter Notebook for Training and Running the Model)

📈 Performance

Metric	Train	Validation	Test
Initial Loss	1681.80	-	-
Final Loss	99.07	-	91.31
Initial MAPE	99.70	-	-
Final MAPE	22.93	23.06	23.56

📚 Methodology

Data Preprocessing:
- Features were Z-score normalized for better convergence.
- The dataset was split into features (X) and target (age).
MLP Architecture:
- 2 Hidden Layers: 128, 64 Neurons (Tanh Activation)
- Custom Backpropagation Implementation
- Weight Initialization: He for Tanh activation
Optimization:
- L-BFGS-B: Converged from initial loss 1681.80 to final loss 91.31.
- Stopping Criteria: 200 iterations, tolerance = (10^{-4}).

📧 Contact For any questions or suggestions, feel free to reach out: Milad Torabi 📧 [miladtorabi65@gmail.com]

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
AGE_PREDICTION.csv		AGE_PREDICTION.csv
Functions_MLP.py		Functions_MLP.py
Hyperparameter_Space_Result.csv		Hyperparameter_Space_Result.csv
README.md		README.md
Report_MLP.pdf		Report_MLP.pdf
run_MLP.ipynb		run_MLP.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OptimizationMethods-MLP

🧠 A Multi-Layer Perceptron from scratch using Numpy for Age Regression

📄Report

📚Table of Contents

📋 Key Highlights

📚 Project Structure

📈 Performance

📚 Methodology

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OptimizationMethods-MLP

🧠 A Multi-Layer Perceptron from scratch using Numpy for Age Regression

📄Report

📚Table of Contents

📋 Key Highlights

📚 Project Structure

📈 Performance

📚 Methodology

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages