Multimodal extractors for video, image, audio, text & PDF — turn any file into searchable vector embeddings (SigLIP, Gemini, E5, CLAP, ArcFace).
machine-learning ocr ai embeddings gemini feature-extraction face-recognition clip semantic-search whisper video-search multimodal rag image-embeddings vector-search vector-database siglip audio-embeddings mixpeek video-embeddings
-
Updated
Jun 16, 2026 - Python