#

multomodal

Here are 2 public repositories matching this topic...

FudanCVL / AVTrack

[ICML 2026] AVTrack: Audio-Visual Tracking in Human-centric Complex Scenes

tracking computer-vision segmentation audiovisual icml omnimodal multomodal

Updated May 14, 2026
Python

marie-jeannesotho844 / AVTrack

Track human speakers in complex scenes using this audio-visual instance segmentation dataset.

linux crawler database spider computer-vision hacking loading magnet-link segmentation magnet hacking-tool qtav icml javlibrary osint-python omnimodal multomodal

Updated May 27, 2026

Improve this page

Add a description, image, and links to the multomodal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multomodal topic, visit your repo's landing page and select "manage topics."