I’m trying to understand how to run the KV cache workload in a multi-node setup, but the README does not provide guidance.
Could you please clarify:
- How should KV cache work across multiple nodes?
- What are the rules for closed submissions?
It would be very helpful if you could include a few example run commands for:
- KV cache workload (single-node)
- KV cache workload (multi-node, if applicable)
Thanks!
I’m trying to understand how to run the KV cache workload in a multi-node setup, but the README does not provide guidance.
Could you please clarify:
It would be very helpful if you could include a few example run commands for:
Thanks!