I am a machine learning engineer with over four years of experience in software development.
I currently work at EPR Labs, where I develop software and data pipelines for training, evaluating, and deploying predictive and generative ML models in Python.
Previously, as part of the PRODIS project, I developed the first phoneme-level GPT model for Polish, along with CI pipelines for survey processing, GUI QA tools, a batch ASR wrapper, and a web interface for data collection.
ML & Data
Testing & Deployment
Highlights include:
| Name | Stack | Type | Description |
|---|---|---|---|
| Phoneme-level GPT pipeline | Python, PyTorch, NumPy, Pandas | CLI tool | Pipeline for training a phoneme-level GPT model to predict surprisal in Polish. Custom IPA tokenizer, parallelized formant extraction, and automatic alignment + stress annotation. |
| vroom | C++20, SFML3, ImGui | Game | 2D racing game featuring arcade drift physics, procedurally-generated tracks, and waypoint-based AI. |
| Bulk automatic speech recognition | Python, Whisper, FFmpeg | CLI tool | Pipeline for bulk automatic speech recognition (ASR) using OpenAI Whisper. Also performs stereo-to-mono conversion using FFmpeg. |
| header-warden | C++17 | CLI tool | Multithreaded static analysis tool that reports missing standard library headers in C++ code. |
| aegyo | C++20, SFML3 | Desktop app | GUI app for learning Korean Hangul. |
Full list: ryouze.net/projects
The unlinked projects belong to the science project and remain private.

