Code for paper "Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models".
-
Updated
Sep 28, 2025 - Python
Code for paper "Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models".
OmniAgent (ICML 2026): the first native omni-modal agent for active video perception — a 7B agent that beats Qwen2.5-VL-72B with 73% fewer frames.
This repo is the official implementation of "Stage-adaptive Token Selection for Efficient Omni-modal LLMs"
Add a description, image, and links to the omni-modal topic page so that developers can more easily learn about it.
To associate your repository with the omni-modal topic, visit your repo's landing page and select "manage topics."