MNIST & Fashion-MNIST Machine Learning with CUDA

A C++ implementation of a neural network for MNIST digit classification and Fashion-MNIST clothing classification using CUDA for accelerated backpropagation.

Features

CUDA-accelerated backpropagation: All gradient computations run on GPU
Multiple datasets: Supports both MNIST and Fashion-MNIST datasets
Flexible data loading: Python script supports both PyTorch and TensorFlow datasets
MAT file format: Data stored in .mat format for easy C++ consumption
Multi-layer neural network: Configurable architecture with ReLU activations and softmax output

Requirements

C++ Dependencies

CUDA Toolkit (version 10.0 or higher)
CMake (version 3.10 or higher)
MatIO library (libmatio-dev on Ubuntu/Debian)
C++17 compatible compiler (GCC 7+ or Clang 5+)

Python Dependencies (for data download)

Python 3.6+
NumPy
SciPy
PyTorch OR TensorFlow (at least one)

Installation

1. Install System Dependencies

Ubuntu/Debian:

sudo apt-get update
sudo apt-get install build-essential cmake libmatio-dev

Other Linux distributions: Install equivalent packages for your distribution.

2. Install CUDA

Follow NVIDIA's official CUDA installation guide for your system: https://developer.nvidia.com/cuda-downloads

3. Install Python Dependencies

pip install numpy scipy torch torchvision
# OR
pip install numpy scipy tensorflow

Building the Project

Download dataset data:

For MNIST:

python download_mnist.py --dataset mnist

For Fashion-MNIST:

python download_mnist.py --dataset fashion

Or download both:

python download_mnist.py --dataset mnist
python download_mnist.py --dataset fashion

This will create a data/ directory with:

mnist_train.mat and mnist_test.mat (for MNIST)
fashion_mnist_train.mat and fashion_mnist_test.mat (for Fashion-MNIST)

You can also specify the source (PyTorch or TensorFlow) and output directory:

python download_mnist.py --dataset fashion --source torch --output-dir data

Build the C++ project:

mkdir build
cd build
cmake ..
make

Run training:

For MNIST:

./MNIST_ML_CUDA ../data/mnist_train.mat ../data/mnist_test.mat [epochs] [learning_rate] [batch_size]

For Fashion-MNIST:

./MNIST_ML_CUDA ../data/fashion_mnist_train.mat ../data/fashion_mnist_test.mat [epochs] [learning_rate] [batch_size]

Examples:

# Train on MNIST
./MNIST_ML_CUDA ../data/mnist_train.mat ../data/mnist_test.mat 10 0.01 32

# Train on Fashion-MNIST
./MNIST_ML_CUDA ../data/fashion_mnist_train.mat ../data/fashion_mnist_test.mat 10 0.01 32

Test predictions:

For MNIST:

./test_predictions ../data/mnist_test.mat mnist_model.bin

For Fashion-MNIST:

./test_predictions ../data/fashion_mnist_test.mat fashion_mnist_model.bin

Project Structure

ml_cpp/
├── CMakeLists.txt          # Build configuration
├── README.md               # This file
├── download_mnist.py       # Python script to download and convert MNIST/Fashion-MNIST
├── include/                # Header files
│   ├── neural_network.h    # Neural network class definition
│   ├── mat_reader.h        # MAT file reader
│   └── cuda_kernels.h      # CUDA kernel declarations
└── src/                    # Source files
    ├── main.cpp            # Main training loop
    ├── neural_network.cpp  # Neural network implementation
    ├── mat_reader.cpp      # MAT file reader implementation
    └── cuda_kernels.cu     # CUDA kernel implementations

Architecture

The default neural network architecture is:

Input layer: 784 neurons (28×28 images for both MNIST and Fashion-MNIST)
Hidden layer 1: 128 neurons with ReLU activation
Hidden layer 2: 64 neurons with ReLU activation
Output layer: 10 neurons with softmax activation (one per class)
- For MNIST: 10 digit classes (0-9)
- For Fashion-MNIST: 10 clothing classes (T-shirt/top, Trouser, Pullover, Dress, Coat, Sandal, Shirt, Sneaker, Bag, Ankle boot)

You can modify the architecture in src/main.cpp by changing the layer_sizes vector.

Fashion-MNIST Classes

Fashion-MNIST has 10 classes: 0. T-shirt/top

Trouser
Pullover
Dress
Coat
Sandal
Shirt
Sneaker
Bag
Ankle boot

CUDA Implementation

The following operations are accelerated on GPU:

Matrix multiplications (forward pass)
Activation functions (ReLU, softmax)
Gradient computations (backward pass)
Weight and bias updates

All CUDA kernels are implemented in src/cuda_kernels.cu.

Performance Notes

The current implementation processes samples one at a time for simplicity
For better performance, consider implementing true batch processing on GPU
Adjust CUDA_NVCC_FLAGS in CMakeLists.txt for your GPU architecture (sm_75 = compute capability 7.5)

Troubleshooting

CUDA not found

Ensure CUDA is installed and nvcc is in your PATH
Set CUDA_PATH environment variable if needed

MatIO library not found

Install libmatio-dev package
Or build from source: https://github.com/tbeu/matio

Out of memory errors

Reduce batch size
Use a smaller network architecture

License

This project is provided as-is for educational purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
include		include
src		src
tests		tests
.gitignore		.gitignore
BACKPROP_EXPLANATION.md		BACKPROP_EXPLANATION.md
CMakeLists.txt		CMakeLists.txt
README.md		README.md
download_mnist.py		download_mnist.py
requirements.txt		requirements.txt
test_predictions.cpp		test_predictions.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNIST & Fashion-MNIST Machine Learning with CUDA

Features

Requirements

C++ Dependencies

Python Dependencies (for data download)

Installation

1. Install System Dependencies

2. Install CUDA

3. Install Python Dependencies

Building the Project

Project Structure

Architecture

Fashion-MNIST Classes

CUDA Implementation

Performance Notes

Troubleshooting

CUDA not found

MatIO library not found

Out of memory errors

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MNIST & Fashion-MNIST Machine Learning with CUDA

Features

Requirements

C++ Dependencies

Python Dependencies (for data download)

Installation

1. Install System Dependencies

2. Install CUDA

3. Install Python Dependencies

Building the Project

Project Structure

Architecture

Fashion-MNIST Classes

CUDA Implementation

Performance Notes

Troubleshooting

CUDA not found

MatIO library not found

Out of memory errors

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages