Hi! I'm asking about `train_megatron.py` are you using parallel mechanisms from `fairscale` and I don't see any sources of `megatron` library it's your custom megatron with `fairscale`?
Hi!
I'm asking about
train_megatron.pyare you using parallel mechanisms from
fairscaleand I don't see any sources ofmegatronlibraryit's your custom megatron with
fairscale?