Practical Machine Learning: Assisting Visually Impaired People

This project uses the YOLOv5 model, optimized for edge devices, to assist visually impaired people in everyday scenarios through AI. It identifies common objects in a visually impaired person's daily walk using a select set of 17 classes, making it easier for them to navigate their surroundings.

Prerequisites

You should have the following installed:

Python and pip
PyTorch

Data Collection and Training

The YOLOv5 model comes with a variety of 80 classes. For our project, we only needed 17 specific classes. Not all of these classes were part of the default YOLOv5 model, so we collected new labeled images for these classes from the Open Images Dataset V6.

We used the OIDv4_ToolKit to convert the labels to a format suitable for our use.

Training

First, download the YOLOv5 Repo from ultralytics and install the requirements using the requirements.txt file.

Create or alter an existing Dataset.yaml file to specify the classes and their sample locations. We used the pretrained weights of the small YOLOv5 model for our training.

Use the following command for training:

python train.py --img 640 --batch 16 --epochs 300 --data yourDataset.yaml --weights yolov5s.pt

For more details on our training process and results, refer to the provided correlation matrix and learning curve graphics.

App and Model Deployment

Our app is based on the official PyTorch Object Detection Demo App. This app includes the necessary post-processing for the YOLO-style object detector to deliver final results, such as non-maximum-suppression (NMS), which is required to purge proposals with lower confidence scores from the result set as long as they overlap ones with higher scores to a certain threshold. To determine this threshold, the intersection over union (IoU) has to be computed. More details can be found here.

We added functionality to estimate distances, which requires knowledge of the perceived focal length of the device. This parameter is hardware-dependent and was determined during development. For a wider range of devices, a comprehensive list or a calibration activity would be necessary.

Before deploying the trained model to a mobile device, convert it to a format optimized for the PyTorch mobile lite interpreter. Place the converted model in the assets folder of the Android Studio project and load it with the LiteModuleLoader from the PyTorch library.

Contact

For more information or any queries, feel free to reach out to me at hello@mustafayasin.com.

License

This project is licensed under MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.idea		.idea
abgabe		abgabe
app		app
gradle/wrapper		gradle/wrapper
.gitignore		.gitignore
Klassen und Hoehen.txt		Klassen und Hoehen.txt
README.md		README.md
Training_YOLOv5_approach_2.ipynb		Training_YOLOv5_approach_2.ipynb
build.gradle		build.gradle
convert_annotations.py		convert_annotations.py
correlation.png		correlation.png
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
label.png		label.png
learning_curves.png		learning_curves.png
prediction.png		prediction.png
settings.gradle		settings.gradle
torch-to-mobile.py		torch-to-mobile.py
train_freezed_backbone.py		train_freezed_backbone.py
train_freezed_full.py		train_freezed_full.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Practical Machine Learning: Assisting Visually Impaired People

Prerequisites

Data Collection and Training

Training

App and Model Deployment

Contact

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Practical Machine Learning: Assisting Visually Impaired People

Prerequisites

Data Collection and Training

Training

App and Model Deployment

Contact

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages