ml-infra

Here are 3 public repositories matching this topic...

msunda17 / impactarbiter-cli

A deterministic PyTorch autograd verification trap for catching silent KV-cache routing and block-alignment failures in vLLM and SGLang serving infrastructure.

cli inference pytorch autograd multi-agent fuzzing sympy formal-verification mlops kv-cache llm-serving vllm pagedattention sglang agentic-workflow ml-infra radixattention

Updated Jun 7, 2026
Python

vgandhi1 / vla-bench

Star

Systematic VLA training optimization on 2× RTX 3090. WebDataset + FlashAttention-2 + FSDP → 3.3× throughput, 26% VRAM reduction. Profiler traces and W&B report linked. Reproducible in one command.

robotics pytorch tensorboard profiling imitation-learning multi-gpu vla gpu-optimization huggingface weights-and-biases webdataset fsdp flash-attention vision-language-action training-efficiency ml-infra

Updated May 16, 2026
Python

SahibTaj / Regression-Safe-RAG-Guardrails-Evaluation-Platform

Star

Regression-safe evaluation framework for RAG systems with faithfulness and coverage-based deployment gating.

evaluation ai-safety rag guardrails llm genai ml-infra

Updated Jan 27, 2026
Python

Improve this page

Add a description, image, and links to the ml-infra topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ml-infra topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly