cosyvoice2:by deploying with vllm, the first chunk (15 tokens) of LLM model will take about 75ms and with flow + hift model, it can get a first frame time about 150ms, so with rwkv, with same 15 tokens, how long will it take and how many resoures will use?
cosyvoice2:by deploying with vllm, the first chunk (15 tokens) of LLM model will take about 75ms and with flow + hift model, it can get a first frame time about 150ms, so with rwkv, with same 15 tokens, how long will it take and how many resoures will use?