### Description We need to support SFT as one mode besides RL to allow warmup. ### Additional Information _No response_
Description
We need to support SFT as one mode besides RL to allow warmup.
Additional Information
No response