AsymVLM: Official PyTorch Implementation

This repository contains the implementation of our paper, "Exploiting the Asymmetric Uncertainty Structure of Pre-trained VLMs on the Unit Hypersphere," accepted at NeurIPS 2025.

Overview

AsymVLM is a post-hoc adaptation method for pre-trained vision-language models (VLMs) that models the uncertainty of text embeddings on the unit hypersphere. This implementation supports two distributional families for this purpose:

von Mises-Fisher (vMF)
Power Spherical (PSD)

llustrative Example

AsymVLM captures aleatoric uncertainty in text embeddings. As a prompt becomes more descriptive, its uncertainty measure (i.e., the inverse concentration $\kappa^{-1}$ for the vMF/PS distribution) decreases monotonically:

Prompt                                             Uncertainty
-------------------------------------------------  ----------
a photo                                            4.4939e-01
a photo of a cat                                   2.1371e-01
a photo of a black cat                             1.6452e-01
a photo of a black cat with green eyes             1.3495e-01
a photo of a black cat with big dark green eyes    1.3059e-01

Installation

Install the required dependencies:

pip install -r requirements.txt

Usage

1. Cache Embeddings

First, cache the CLIP embeddings for the dataset. Before running the code, you need to specify the path to the data and corresponding annotation files in cache_embeddings.py, line 11-19. Then run

python cache_embeddings.py --dataset coco

2. Train the Adaptor

Train the AsymVLM adaptor with either VMF or PSD distribution:

python train.py --dataset coco --method asymvlm-psd --seed 0

Options for --method:

asymvlm-psd: Power Spherical Distribution
asymvlm-vmf: von Mises-Fisher Distribution
probvlm: ProbVLM
pfe: PFE
pcmepp: PCME++
prolip: ProLIP

3. Evaluate the Model

Evaluate the trained model on cross-modal retrieval tasks:

python eval.py --dataset coco --method asymvlm-psd --seed 0 --uncer_levels 10

Project Structure

.
├── datasets/
│   ├── coco.py
│   └── embedding.py
├── models/                 # Method implementation
│   ├── asymvlm/
│   ├── pcmepp/
│   ├── pfe/
│   ├── probvlm/
│   └── prolip/
├── utils/
│   ├── preprocess.py
│   └── seed.py
├── cache_embeddings.py     # Script for CLIP embeddings caching
├── train.py                # Training script
├── eval.py                 # Evaluation script
└── requirements.txt        # Project dependencies

Citation

If you find this code useful for your research, please cite our paper:

@article{ju2025exploiting,
    title={Exploiting the Asymmetric Uncertainty Structure of Pre-trained VLMs on the Unit Hypersphere},
    author={Ju, Li and Andersson, Max and Fredriksson, Stina and Gl{\"o}ckner, Edward and Hellander, Andreas and Vats, Ekta and Singh, Prashant},
    booktitle = {NeurIPS},
    year = {2025}
}

License

CC-BY-4.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AsymVLM: Official PyTorch Implementation

Overview

llustrative Example

Installation

Usage

1. Cache Embeddings

2. Train the Adaptor

3. Evaluate the Model

Project Structure

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
datasets		datasets
models		models
utils		utils
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
cache_embeddings.py		cache_embeddings.py
eval.py		eval.py
requirements.txt		requirements.txt
train.py		train.py

Folders and files

Latest commit

History

Repository files navigation

AsymVLM: Official PyTorch Implementation

Overview

llustrative Example

Installation

Usage

1. Cache Embeddings

2. Train the Adaptor

3. Evaluate the Model

Project Structure

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages