Skip to content
@MM-Speech

MM-Speech

Welcome to MM-Speech 👋

Our lab is committed to cutting-edge research in speech generation, spoken dialogue systems, and spatial audio generation. We strive to develop intelligent, natural, and immersive audio technologies that advance human–machine interaction and multimedia experiences.

Popular repositories Loading

  1. VoxMind VoxMind Public

    [ACL 2026] VoxMind: An End-to-End Agentic Spoken Dialogue System

    Python 35 3

  2. DiTReducio DiTReducio Public

    [ACL 2026] DiTReducio: A Training-Free Acceleration for DiT-Based TTS viaProgressive Calibration

    Python 12 1

  3. DualAxisRM DualAxisRM Public

    [ACL 2026] Dual-Axis Generative Reward Model Toward Semantic and Turn-taking Robustness in Interactive Spoken Dialogue Models

    Python 12

  4. SwanBench-Speech SwanBench-Speech Public

    [ACL 2026] SwanBench-Speech: Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

    10

  5. WavAlign WavAlign Public

    [ACL 2026] WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training

    Python 7 2

  6. SDiaReward SDiaReward Public

    [ACL 2026] SDiaReward: Modeling and Benchmarking Spoken Dialogue Rewards with Modality and Colloquialness

    Python 6 1

Repositories

Showing 10 of 12 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…