A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.
-
Updated
Oct 30, 2025 - Python
A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.
A Python base cli tool for tagging images with joy-caption-pre-alpha models.
CaptionForge generates stronger local dataset image captions by combining multiple raw caption witnesses, extracting structured claims, and synthesizing a final auditable caption from accepted semantic evidence.
Add a description, image, and links to the joy-caption topic page so that developers can more easily learn about it.
To associate your repository with the joy-caption topic, visit your repo's landing page and select "manage topics."