a-kaa

Follow

Hongshaorou a-kaa

Follow

SUSTech master 12433300@mail.sustech.edu.cn

1 follower · 4 following

Shenzhen
01:34 (UTC +08:00)

Achievements

Achievements

Pinned Loading

RL-Align/RL-Kernel RL-Align/RL-Kernel Public

High-performance RL post-training infrastructure. Designed to achieve bitwise operator-level train-inference consistency across heterogeneous engines and extreme memory efficiency for GRPO, PPO, etc.

Python 166 39