ddpo

Star

Here are 2 public repositories matching this topic...

arthur-x / AlmostPerfect

Star

Simple end-to-end RLHF (Reinforcement Learning from Human Feedback) for diffusion models (DDPO) on personal hardware.

reinforcement-learning diffusion-models stable-diffusion rlhf ddpo

Updated Feb 26, 2025
Python

meghanaNanuvala / Diffusion-Personalization

Star

Comparative study of six diffusion model personalization methods (DreamBooth, LoRA, Textual Inversion, Custom Diffusion, LCM, DDPO) using HuggingFace Diffusers on Stable Diffusion v1.5 | NVIDIA H100 | IU Quartz HPC

personalization lora diffusion-models stable-diffusion textual-inversion dreambooth latent-consistency-model ddpo custom-diffusion

Updated Apr 27, 2026
Python

Improve this page

Add a description, image, and links to the ddpo topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ddpo topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ddpo

Here are 2 public repositories matching this topic...

arthur-x / AlmostPerfect

meghanaNanuvala / Diffusion-Personalization

Improve this page

Add this topic to your repo