Comparative analysis of the Moonshine and Faster-Whisper Tiny transcription models, focusing on latency and Word Error Rate (WER)
-
Updated
Oct 27, 2024 - Jupyter Notebook
Comparative analysis of the Moonshine and Faster-Whisper Tiny transcription models, focusing on latency and Word Error Rate (WER)
Correcting the noisy OCR output on Bangla language using seq2seq Model
his repository contains an Automatic Speech Recognition (ASR) system for Amharic built by fine-tuning Facebook’s Wav2Vec2.0 model using Hugging Face Transformers. The goal is to provide an open-source Amharic speech-to-text model, making it easier for developers and researchers to work with Amharic audio data.
Python tool to evaluate Automatic Speech Recognition (ASR) outputs by calculating Word Error Rate (WER) for multiple models. Ideal for NLP research, ASR benchmarking, and improving transcription accuracy.
Add a description, image, and links to the jiwer topic page so that developers can more easily learn about it.
To associate your repository with the jiwer topic, visit your repo's landing page and select "manage topics."