clip-architecture

Here are 2 public repositories matching this topic...

sbmagar13 / VQGAN-CLIP-Text-to-Image

Text-to-Image Synthesis using Multimodal (VQGAN + CLIP) Architectures

python machine-learning deep-neural-networks gan artificial-neural-networks adversarial-networks taming-transformers vqgan clip-architecture

Updated Nov 14, 2024
Jupyter Notebook

R0GUE-A5H / CLIP-Style-Multimodal

Star

A Multimodal AI Search Engine built from scratch using CLIP-style architecture (ViT + MPNet). Capable of searching images via text or image queries with 27.6% Recall@1 on Flickr8k.

nlp search-engine computer-vision deep-learning pytorch gradio multimodal-learning flickr8k-dataset flickr8k hugging-face clip-architecture

Updated Jun 23, 2026
Jupyter Notebook

Improve this page

Add a description, image, and links to the clip-architecture topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the clip-architecture topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly