Ultra-low bitrate speech codec (0.27-1 kbps) with cross-modal alignment and real-time capabilities
-
Updated
Aug 27, 2025 - Python
Ultra-low bitrate speech codec (0.27-1 kbps) with cross-modal alignment and real-time capabilities
[FG 2024] Finite Scalar Quantization as Facial Tokenizer for Dyadic Reaction Generation - Winning Solution in REACT@FG24 Challenge
Track how discrete representations evolve during neural network training — lifecycle events, phase transitions, ontology discovery, and causal verification
Add a description, image, and links to the fsq topic page so that developers can more easily learn about it.
To associate your repository with the fsq topic, visit your repo's landing page and select "manage topics."