Skip to content

Commit 526fd79

Browse files
zhoward-1claudeaustingreco
authored
docs: add integrations landing page for operator guides (#1043)
## Summary - Adds `docs/operator-guides/integrations/index.md` as a navigation hub for all integration content - Organizes guides into three sections: Built-in Components (Model Registry), External System Integrations (Experiment Tracking), and Extending Built-in Components (Custom Serving Backend, Custom Scheduler, Register a Compute Cluster) - Links back to related platform guides via a "Next Steps" section (auth, network, monitoring, troubleshooting) ## Why The individual integration guides exist, but without a landing page operators have no single place to discover all integration surfaces. This completes the integrations section started by #1036 (operator index restructure). ## Test plan - [x] All relative links resolve (verified after #1041 and #1042 merged from main) - [x] Hub matches voice and structure of sibling index pages (`jobs/index.md`, `serving/index.md`) --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: Austin Greco <austingreco@gmail.com> Co-authored-by: Austin Greco <agreco@uber.com>
1 parent 4c00dee commit 526fd79

1 file changed

Lines changed: 37 additions & 0 deletions

File tree

  • docs/operator-guides/integrations
Lines changed: 37 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,37 @@
1+
# Integrations
2+
3+
Michelangelo runs alongside the ML infrastructure your organization already has. This page is the operator reference for connecting external systems and extending Michelangelo's built-in components. It targets platform engineers responsible for running Michelangelo in production.
4+
5+
## Built-in Components
6+
7+
These guides cover components that ship with Michelangelo. Operators configure them; they are not external systems.
8+
9+
| Guide | Description |
10+
|-------|-------------|
11+
| [Model Registry](model-registry.md) | Verify the registry is healthy, configure object store and RBAC, and integrate registered models with serving and CI/CD |
12+
13+
## External System Integrations
14+
15+
These guides cover connecting Michelangelo to systems your organization already runs.
16+
17+
| Guide | Description |
18+
|-------|-------------|
19+
| [Experiment Tracking](experiment-tracking.md) | Expose an external experiment tracking server to task pods — network setup, URI injection, and operator/user boundary |
20+
21+
## Extending Built-in Components
22+
23+
Michelangelo exposes extension points for replacing or augmenting its core subsystems. Use these when the defaults don't fit your infrastructure.
24+
25+
| Guide | Description |
26+
|-------|-------------|
27+
| [Custom Serving Backend](../serving/integrate-custom-backend.md) | Add support for any inference framework — Triton, vLLM, TensorRT-LLM, or your own |
28+
| [Extend the Job Scheduler](../jobs/extend-michelangelo-batch-job-scheduler-system.md) | Replace or extend the scheduler — integrate Kueue, Volcano, or implement a custom `JobQueue` and `AssignmentStrategy` |
29+
| [Register a Compute Cluster](../jobs/register-a-compute-cluster-to-michelangelo-control-plane.md) | Connect an existing Kubernetes cluster so Michelangelo can dispatch Ray jobs to it |
30+
31+
## Next Steps
32+
33+
- [Platform Setup](../platform-setup.md) — ConfigMap reference for all components
34+
- [Authentication](../authentication.md) — OIDC, RBAC, and service-to-service auth
35+
- [Network & Ingress](../network.md) — Ingress setup, Envoy proxy config, TLS, multi-cluster networking
36+
- [Monitoring](../monitoring.md) — Prometheus metrics, alerting, Grafana dashboards
37+
- [Troubleshooting](../troubleshooting.md) — Common failure modes and `kubectl` diagnostic commands

0 commit comments

Comments
 (0)