-
Notifications
You must be signed in to change notification settings - Fork 0
Home
Welcome to the cloud-telemetry-agent wiki!
A production-grade Python telemetry agent that collects real-time system metrics, publishes them to AWS CloudWatch, and fires threshold-based alerts via AWS SNS.
cloud-telemetry-agent is a lightweight, deployable monitoring agent built to mirror real-world NOC and CloudOps infrastructure patterns. It runs on any host, collects live system metrics, and ships telemetry directly to AWS — no third-party monitoring platforms required.
This is not a tutorial project. Every design decision reflects production engineering practices: config-driven architecture, structured logging, modular components, and resilient AWS integration with automatic retry logic.
| Page | Description |
|---|---|
| Architecture | System design, component breakdown, and data flow |
| Setup & Configuration | Environment setup, AWS configuration, and config.ini reference |
| Usage Guide | Running the agent, interpreting output, and operational modes |
| Runbook | NOC-style operational procedures and escalation steps |
| Troubleshooting | Common issues, error messages, and fixes |
| Roadmap | Planned features and future integrations |
| Property | Detail |
|---|---|
| Language | Python 3.x |
| Cloud Provider | AWS (Free Tier compatible) |
| Key Services | CloudWatch, SNS, IAM |
| Collection Interval | Configurable (default: 60s) |
| Alert Delivery | Email via AWS SNS |
| Log Format | Structured JSON |
Your Machine → cloud-telemetry-agent → AWS CloudWatch → Metrics Dashboard
→ AWS SNS → Alert Email
→ Local Logs → JSON Log File
Part of the Alex-CloudOps observability portfolio — built to demonstrate production-grade CloudOps and NOC engineering practices.