Skip to content
View ryouze's full-sized avatar

Block or report ryouze

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ryouze/README.md

About Me

I am a machine learning engineer with over four years of experience in software development.

I currently work at EPR Labs, where I develop software and data pipelines for training, evaluating, and deploying predictive and generative ML models in Python.

Previously, as part of the PRODIS project, I developed the first phoneme-level GPT model for Polish, along with CI pipelines for survey processing, GUI QA tools, a batch ASR wrapper, and a web interface for data collection.

Tech Stack

ML & Data

PyTorch Scikit-learn NumPy Pandas Streamlit

Testing & Deployment

Pytest GitHub Actions Docker

Projects

Highlights include:

Name Stack Type Description
Phoneme-level GPT pipeline Python, PyTorch, NumPy, Pandas CLI tool Pipeline for training a phoneme-level GPT model to predict surprisal in Polish. Custom IPA tokenizer, parallelized formant extraction, and automatic alignment + stress annotation.
vroom C++20, SFML3, ImGui Game 2D racing game featuring arcade drift physics, procedurally-generated tracks, and waypoint-based AI.
Bulk automatic speech recognition Python, Whisper, FFmpeg CLI tool Pipeline for bulk automatic speech recognition (ASR) using OpenAI Whisper. Also performs stereo-to-mono conversion using FFmpeg.
header-warden C++17 CLI tool Multithreaded static analysis tool that reports missing standard library headers in C++ code.
aegyo C++20, SFML3 Desktop app GUI app for learning Korean Hangul.

Full list: ryouze.net/projects

The unlinked projects belong to the science project and remain private.

Pinned Loading

  1. vroom vroom Public archive

    Cross-platform 2D racing game with arcade drift physics, procedurally-generated tracks, and waypoint AI.

    C++ 3 2

  2. home-server home-server Public

    Documentation repository for my home server, covering general Linux setup and Docker services.

  3. aegyo aegyo Public archive

    Cross-platform GUI app for learning Korean Hangul.

    C++ 1

  4. header-warden header-warden Public archive

    Cross-platform multithreaded CLI tool that identifies and reports missing standard library headers in C++ code.

    C++ 1

  5. ungpt ungpt Public archive

    Cross-platform GUI app that converts ChatGPT's smart punctuation and symbols to plain ASCII.

    C++

  6. asset-packer asset-packer Public archive

    *nix CLI tool for embedding assets (e.g., images, sounds, fonts) into C++ headers.

    C