Hi, thanks for the great post-training acceleration work for VAR!
I have two questions:
1.It seems the flash attention will not be applied if just using the standarad flash attn lib, could you provide the method to install your customized flash attention lib?
2.If changing the h_w div ratio to other values from 1.0, the acceleration seems not to work anymore... Does that meet the expectation?
Thanks in advance for your reply!
Hi, thanks for the great post-training acceleration work for VAR!
I have two questions:
1.It seems the flash attention will not be applied if just using the standarad flash attn lib, could you provide the method to install your customized flash attention lib?
2.If changing the h_w div ratio to other values from 1.0, the acceleration seems not to work anymore... Does that meet the expectation?
Thanks in advance for your reply!