eBPF agent and MCP server for GPU causal observability
-
Updated
May 16, 2026 - C
eBPF agent and MCP server for GPU causal observability
Monitor low-utilization time, idle-state episodes, and workload starvation signals on NVIDIA datacenter GPUs.
Cluster-side OpenTelemetry Collector distribution and MCP-queryable event store for GPU clusters
Add a description, image, and links to the gpu-observability topic page so that developers can more easily learn about it.
To associate your repository with the gpu-observability topic, visit your repo's landing page and select "manage topics."