fix(mtp): support multi-head MTP loss logging (mtp-num-layers > 1)#2132
Draft
ZiyiTsang wants to merge 1 commit into
Draft
fix(mtp): support multi-head MTP loss logging (mtp-num-layers > 1)#2132ZiyiTsang wants to merge 1 commit into
ZiyiTsang wants to merge 1 commit into
background
wait
wait-all
cancel
parallel
Loading