Skip to content

feat(loss): split surrogate (--policy-loss) and IS granularity (--is-level) out of --advantage-estimator#2

Open
EazyReal wants to merge 1 commit into
mainfrom
upstream-pr/policy-loss-axis
Open

feat(loss): split surrogate (--policy-loss) and IS granularity (--is-level) out of --advantage-estimator#2
EazyReal wants to merge 1 commit into
mainfrom
upstream-pr/policy-loss-axis

Commits

Commits on Jun 21, 2026