GCP Template Forge

An AI-driven pipeline that designs, deploys, and validates production-ready GKE reference architectures — dual-path (Terraform + Helm and Config Connector) — with every merge.

Objectives

Design — Use an AI agent (Gemini CLI + Claude) to author complete, enterprise-grade IaC templates from Google Cloud reference architectures, covering both Terraform/Helm and Config Connector deployment paths.
Deploy & Test — Run every template through a full apply → verify → destroy cycle in a real GCP sandbox project before any PR is merged, and again after merge to confirm the published artifact works end-to-end.
Consolidate — Act as a living, continuously-validated library of GKE patterns drawn from Google Cloud's public reference repositories, so teams can adopt them with confidence.

System Architecture

The forge is powered by the operator stack from gke-labs/gemini-for-kubernetes-development, running on a GKE Standard control-plane cluster.

flowchart LR
    DEV["👤 Developer\nopens issue"]

    subgraph OPS ["gke-labs Operator Stack"]
        OV["🎯 Overseer + Repo-Agent\ncreates branch & PR"]
        AG["🤖 Agent Sandbox\nauthors Terraform + KCC templates"]
        OV --> AG
    end

    subgraph GH ["GitHub — gcp-template-forge"]
        PR["🔀 Pull Request\nlint · deploy-test-tf ∥ deploy-test-kcc"]
        MAIN["✅ main\nvalidated template library"]
        PR -->|approved + merged| MAIN
    end

    subgraph GCP ["GCP Sandbox"]
        TF["🏗️ TF + Helm\ncluster"]
        KCC["☸️ Config Connector\ncluster"]
    end

    DEV --> OV
    AG -->|commits templates| PR
    PR -->|deploy-test-tf| TF
    PR -->|deploy-test-kcc| KCC
    MAIN -->|validate-tf-helm ∥ validate-kcc\nthen publish-validated| TF
    MAIN -->|validate-tf-helm ∥ validate-kcc\nthen publish-validated| KCC

Key Components

Component	Role	Repo
Overseer	Kubernetes operator that watches GitHub for new issues, coordinates the agent lifecycle, and manages PR state	`gke-labs/gemini-for-kubernetes-development`
Repo-Agent	Creates GitHub issues, branches, and PRs; posts status comments; triggers the agent sandbox	same
AgentSandboxes	Kubernetes Jobs that spin up an isolated Gemini CLI session per template; the agent authors all IaC files and commits them	same
CI Service Account	GCP service account used by GitHub Actions CI to authenticate and run Terraform/Helm/KCC against the sandbox project	`agent-infra/`

Repository Layout

.github/
  workflows/
    sandbox-validation-*.yml  ← lint · deploy-test-tf ∥ deploy-test-kcc (PR) · validate-tf-helm ∥ validate-kcc · publish-validated (push)
  ISSUE_TEMPLATE/           ← template request form
agent-infra/
  terraform/                ← control-plane GKE cluster + CI service account
  manifests/                ← Overseer + Repo-Agent + AgentSandboxes deployments
templates/                  ← validated template library (see Templates section below)
GEMINI.md                   ← guardrails and instructions for the Gemini CLI agent
GUIDANCE.md                 ← manual setup steps (identity, Secret Manager)

CI Pipeline

Job dependency graph

flowchart LR
    DC(["🔍 detect-changes\nDiff PR head SHA or push\nagainst base — outputs\nchanged template list"])

    subgraph pr ["━━━  Pull Request  ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"]
        direction LR
        L(["🔎 lint\nper template\ntf fmt · tf validate\nhelm lint · KCC YAML\nboth-paths check"])
        DTTF(["🏗️ deploy-test-tf\nper template\nTF apply → verify → Helm deploy\n→ security scan → TF destroy\nPost PR summary comment"])
        DTKCC(["☸️ deploy-test-kcc\nper template\nKCC apply → wait Ready\n→ delete\nPost PR summary comment"])
    end

    subgraph push ["━━━  Push to main  ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━"]
        direction LR
        VTH(["🏗️ validate-tf-helm\nper template · 120 min timeout\nTF apply → verify cluster\nRUNNING → TF destroy\nSaves results artifact"])
        VKCC(["☸️ validate-kcc\nper template · 120 min timeout\nFresh runner — KCC cluster\ncreds from the start\nKCC apply → wait Ready\n→ delete\nSaves results artifact"])
        PUB(["📋 publish-validated\nper template\nAccumulate agent metrics\nUpdate README + .validated\n.agent-metrics-cumulative\ngit push [skip ci]"])
    end

    DC --> L
    L --> DTTF
    L --> DTKCC
    DC --> VTH
    DC --> VKCC
    VTH --> PUB
    VKCC --> PUB

deploy-test-tf and deploy-test-kcc run in parallel on separate runners after lint passes. Likewise, validate-tf-helm and validate-kcc both run in parallel on push to main — GCP resource name collisions are avoided by the -tf / -kcc suffix convention on all resource names. publish-validated waits for both validate jobs to complete before updating the README and .validated marker.

End-to-end sequence

sequenceDiagram
    actor Dev as Developer / Overseer
    participant GH as GitHub
    participant CI as GitHub Actions
    participant GCP as GCP Sandbox

    Dev->>GH: Open issue (template request)
    GH->>CI: Overseer creates branch + triggers AgentSandbox
    CI->>GH: Agent commits terraform-helm/ + config-connector/

    Note over GH,CI: ── Pull Request CI Gate ──────────────────────────────
    GH->>CI: PR opened → detect-changes (PR head SHA diff)
    CI->>CI: lint: tf fmt/validate · helm lint · KCC YAML · both-paths check
    par deploy-test-tf (parallel)
        CI->>GCP: TF apply (VPC · cluster · node pool)
        GCP-->>CI: cluster RUNNING ✓
        CI->>GCP: helm upgrade --install
        GCP-->>CI: workload ready ✓
        CI->>GCP: TF destroy
    and deploy-test-kcc (parallel)
        CI->>GCP: KCC apply (Config Connector manifests)
        GCP-->>CI: ContainerCluster Ready ✓
        CI->>GCP: kubectl delete (KCC teardown)
    end
    CI->>GH: Post deploy summary comment to PR (both paths)
    Dev->>GH: Review + merge PR

    Note over GH,CI: ── Post-Merge CI Gate (4 independent jobs) ──────────
    GH->>CI: push to main → detect-changes
    par validate-tf-helm (parallel)
        CI->>CI: skip-check (changed since last .validated?)
        CI->>GCP: TF apply (VPC · cluster · node pool · Helm workload)
        GCP-->>CI: cluster RUNNING ✓  nodes ready ✓
        CI->>GCP: TF destroy (full teardown)
    and validate-kcc (parallel)
        CI->>CI: skip-check (changed since last .validated?)
        CI->>GCP: kubectl apply (KCC manifests)
        GCP-->>CI: ContainerCluster Ready ✓
        CI->>GCP: kubectl delete (KCC teardown)
    end
    CI->>CI: publish-validated starts (waits for both above)
    CI->>CI: accumulate .agent-metrics across all sandbox sessions
    CI->>GH: Commit README.md + .validated + .agent-metrics-cumulative

Template structure

templates/<name>/
├── terraform-helm/              ← Terraform + Helm deployment path
│   ├── main.tf                  ← VPC · cluster · workload resources
│   ├── variables.tf
│   ├── versions.tf              ← pinned provider versions + GCS backend
│   ├── outputs.tf               ← cluster_name + cluster_location (required by CI)
│   └── workload/                ← Helm chart for the workload
│       ├── Chart.yaml
│       ├── values.yaml
│       └── templates/
├── config-connector/            ← Config Connector (KCC) deployment path
│   ├── network.yaml             ← ComputeNetwork + ComputeSubnetwork
│   ├── cluster.yaml             ← ContainerCluster (+ NodePool if standard)
│   └── workload/                ← Kubernetes manifests for the workload (required)
│       └── *.yaml               ← Deployment · Service · HPA · NetworkPolicy etc.
├── README.md                    ← auto-updated by CI with validation record
├── .validated                   ← CI marker: commit + status after successful deploy
├── .agent-metrics               ← written by agent sandbox (latest session)
└── .agent-metrics-cumulative    ← CI-maintained running total across all sessions

CI enforcement rules:

Both terraform-helm/ and config-connector/ must exist (lint fails otherwise)
google_container_cluster must have deletion_protection = false
KCC manifests must not use cnrm.cloud.google.com/deletion-policy: abandon
Resources must use template-based names (e.g., enterprise-gke-vpc) not issue numbers
validate-tf-helm / validate-kcc re-run whenever the template changes since last .validated commit

Templates

Template	TF+Helm	KCC	Validated
basic-gke-hello-world	GKE Standard + hello-world	GKE Standard + hello-world	—
enterprise-gke	GKE Standard + security stack + Helm workload	GKE Standard + security stack + KCC workload	—
latest-gke-features	GKE Standard + Gateway API + NAP + Native Sidecars	GKE Standard + Native Sidecars + Gateway API	—
gke-fqdn-egress-security	GKE Standard + FQDN Network Policies + AI Egress	GKE Standard + KCC Networking	—
gke-topology-aware-routing	GKE Standard + Topology-Aware Routing + Gateway API	GKE Standard + Topology-Aware Routing + Gateway API	—

Public Reference Sources

The forge validates patterns drawn from:

Source	Focus
Cloud Foundation Toolkit	GCP security baselines
Cluster Toolkit	HPC + AI/ML clusters
Kubernetes Engine Samples	GKE workload patterns
Terraform GKE Modules	Reusable TF modules
GKE AI Labs	AI/ML on GKE
Gemini for Kubernetes Development	Operator stack powering this forge
Accelerated Platforms	GPU/TPU workloads
GKE Policy Automation	Policy as code
LLM-D	LLM inference on GKE

Name		Name	Last commit message	Last commit date
Latest commit History 448 Commits
.gemini		.gemini
.github		.github
agent-infra		agent-infra
templates		templates
.gitignore		.gitignore
GEMINI.md		GEMINI.md
GUIDANCE.md		GUIDANCE.md
LICENSE		LICENSE
README.md		README.md
devcontainer.json		devcontainer.json
llms.txt		llms.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GCP Template Forge

Objectives

System Architecture

Key Components

Repository Layout

CI Pipeline

Job dependency graph

End-to-end sequence

Template structure

Templates

Public Reference Sources

Test Janitor

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GCP Template Forge

Objectives

System Architecture

Key Components

Repository Layout

CI Pipeline

Job dependency graph

End-to-end sequence

Template structure

Templates

Public Reference Sources

Test Janitor

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages