Skip to content

feat: add Windows CUDA GPU model support with NSIS installer#1

Merged
gcmullen merged 3 commits into
mainfrom
feat/windows-cuda-gpu-models
Feb 7, 2026
Merged

feat: add Windows CUDA GPU model support with NSIS installer#1
gcmullen merged 3 commits into
mainfrom
feat/windows-cuda-gpu-models

Conversation

@gcmullen

@gcmullen gcmullen commented Feb 7, 2026

Copy link
Copy Markdown
Owner

Add GPU-accelerated Parakeet model support on Windows via NVIDIA CUDA, with runtime detection, bundled ONNX Runtime DLLs, an NSIS installer with prerequisite checks, and a rewritten model management system.

Model management:

  • Overhaul model.rs with CUDA detection (CUDA_PATH/CUDNN_PATH env vars)
  • Add HuggingFace multi-file download support for GPU model variants
  • Add Whisper Small/Medium/Turbo/Large, Moonshine Base, and Parakeet V2/V3 in both CPU (INT8) and GPU (FP32) variants
  • GPU models only visible when CUDA + cuDNN are detected at runtime

Build and installer:

  • Add NSIS installer hooks checking for CUDA/cuDNN prerequisites
  • Add build.rs logic to copy ONNX Runtime DLLs for Windows builds
  • Add CI step to download ONNX Runtime GPU DLLs during workflow
  • Bundle onnxruntime*.dll in resources/gpu-deps (DLLs gitignored)
  • Change Windows build target to NSIS-only

Runtime:

  • Setup GPU DLL PATH in lib.rs for ONNX Runtime + CUDA discovery
  • Update transcription manager for CUDA-aware engine loading

Dependency changes:

  • Update transcribe-rs to use git fork with CUDA support
  • Dependency bumps (tauri, tauri-apps packages)
  • Frontend updates for new model variants

Also includes: documentation and changelog updates.

Cleanup:

  • Remove upstream-specific GitHub templates (FUNDING, issue/PR templates)

gcmullen and others added 3 commits February 6, 2026 20:17
Add GPU-accelerated Parakeet model support on Windows via NVIDIA CUDA,
with runtime detection, bundled ONNX Runtime DLLs, an NSIS installer
with prerequisite checks, and a rewritten model management system.

Model management:
- Overhaul model.rs with CUDA detection (CUDA_PATH/CUDNN_PATH env vars)
- Add HuggingFace multi-file download support for GPU model variants
- Add Whisper Small/Medium/Turbo/Large, Moonshine Base, and Parakeet
  V2/V3 in both CPU (INT8) and GPU (FP32) variants
- GPU models only visible when CUDA + cuDNN are detected at runtime

Build and installer:
- Add NSIS installer hooks checking for CUDA/cuDNN prerequisites
- Add build.rs logic to copy ONNX Runtime DLLs for Windows builds
- Add CI step to download ONNX Runtime GPU DLLs during workflow
- Bundle onnxruntime*.dll in resources/gpu-deps (DLLs gitignored)
- Change Windows build target to NSIS-only

Runtime:
- Setup GPU DLL PATH in lib.rs for ONNX Runtime + CUDA discovery
- Update transcription manager for CUDA-aware engine loading

Dependency changes:
- Update transcribe-rs to use git fork with CUDA support
- Dependency bumps (tauri, tauri-apps packages)
- Frontend updates for new model variants

Also includes: documentation and changelog updates.

Cleanup:
- Remove upstream-specific GitHub templates (FUNDING, issue/PR templates)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Resolves conflicts in:
- .gitignore (kept both .onnxruntime/ and Playwright dirs)
- package.json (kept Playwright test + newer @tauri-apps/cli@2.10.0)
- bun.lock (regenerated after package.json merge)
@gcmullen gcmullen merged commit 1607a8e into main Feb 7, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant