LWAIL: Latent Wasserstein Adversarial Imitation Learning

This is the official PyTorch implementation of the paper Latent Wasserstein Adversarial Imitation Learning (LWAIL).

📝 Abstract

Imitation Learning (IL) enables agents to mimic expert behavior by learning from demonstrations. However, traditional IL methods require large amounts of medium-to-high-quality demonstrations as well as actions of expert demonstrations, both of which are often unavailable. To reduce this need, we propose Latent Wasserstein Adversarial Imitation Learning (LWAIL), a novel adversarial imitation learning framework that focuses on state-only distribution matching. It benefits from the Wasserstein distance computed in a dynamics-aware latent space. This dynamics-aware latent space differs from prior work and is obtained via a pre-training stage, where we train the Intention Conditioned Value Function (ICVF) to capture a dynamics-aware structure of the state space using a small set of randomly generated state-only data. We show that this enhances the policy’s understanding of state transitions, enabling the learning process to use only one or a few state-only expert episodes to achieve expert-level performance. Through experiments on multiple MuJoCo environments, we demonstrate that our method outperforms prior Wasserstein-based IL methods and prior adversarial IL methods, achieving better results across various tasks.

⚙️ Environment Settings

To run the experiments, please set up the environment using the following commands.

Prerequisites

Python = 3.8
PyTorch 2.4.1 with CUDA 12.4
D4RL
dm_control(optional, not required for main experiments)

Installation

We recommend using Anaconda to manage the environment:

For other Python packages, install the required dependencies via:

pip install -r requirements.txt

Note: Other versions of Python and PyTorch may also work. The most important requirement is that D4RL and its dependencies are installed successfully.

📂 Models and Datasets

Pre-trained Models

We provide our basic ICVF (Intent-Conditioned Value Function) model in the icvf_model/ directory, which can be loaded directly for evaluation or fine-tuning.

For other model variants or to access the training code for the base model, please refer to the external codebase:

ICVF-PyTorch Repository(todo)

Datasets

Due to GitHub storage capacity limitations, this repository includes a standard one-trajectory dataset as a sample.

To generate the full datasets or acquire more trajectories:

Navigate to the datasets/ folder.

🚀 How to Run Experiments

We provide shell scripts to streamline the training and evaluation process.

Single Experiment

To run a specific experiment configuration:

bash run.bash

Reproduce Main Results

To perform a quick reimplementation of the main experiments presented in the paper, execute the following script. This will run the training loop across the defined environments:

bash all_run.bash

📍 Citation

If you find this code or paper useful for your research, please cite:

@inproceedings{yang2026lwail,
  title={Latent Wasserstein Adversarial Imitation Learning},
  author={Siqi Yang and Kai Yan and Alexander G. Schwing and Yu-Xiong Wang},
  booktitle={ICLR},
  year={2026}
}

📧 Contact

For any questions, please feel free to open an issue in this repository(prefered) or reach out to Siqi Yang at siqiyang@illinois.edu.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
datasets		datasets
icvf_model		icvf_model
utils		utils
.gitignore		.gitignore
README.md		README.md
all_run.bash		all_run.bash
core.py		core.py
core_dm.py		core_dm.py
d4rl_utils.py		d4rl_utils.py
ddpg.py		ddpg.py
environment.yml		environment.yml
main.py		main.py
main_dm.py		main_dm.py
network.py		network.py
ppo.py		ppo.py
pwdice.py		pwdice.py
requirements.txt		requirements.txt
run.bash		run.bash
sac.py		sac.py
td3.py		td3.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LWAIL: Latent Wasserstein Adversarial Imitation Learning

📝 Abstract

⚙️ Environment Settings

Prerequisites

Installation

📂 Models and Datasets

Pre-trained Models

Datasets

🚀 How to Run Experiments

Single Experiment

Reproduce Main Results

📍 Citation

📧 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LWAIL: Latent Wasserstein Adversarial Imitation Learning

📝 Abstract

⚙️ Environment Settings

Prerequisites

Installation

📂 Models and Datasets

Pre-trained Models

Datasets

🚀 How to Run Experiments

Single Experiment

Reproduce Main Results

📍 Citation

📧 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages