First name
Shashank
Last name
Agarwal
Startup domain
https://noveum.ai
GitHub repo or gist to share (optional)
No response
Twitter / X handle (optional)
itsshashank
Talk summary
AI agents are already silently failing in production — most enterprises just don’t realize it yet. At Noveum.ai, we’re building autonomous reliability infrastructure for AI agents: replay testing, continuous evals, synthetic voice testing, regression detection, and PR-based auto-remediation workflows. The hardest problem has been closing the loop between detecting failures and safely validating fixes before deployment. I’ll share real-world lessons from deploying these systems across enterprise voice agents, customer support automation, and multi-step AI workflows.
First name
Shashank
Last name
Agarwal
Startup domain
https://noveum.ai
GitHub repo or gist to share (optional)
No response
Twitter / X handle (optional)
itsshashank
Talk summary
AI agents are already silently failing in production — most enterprises just don’t realize it yet. At Noveum.ai, we’re building autonomous reliability infrastructure for AI agents: replay testing, continuous evals, synthetic voice testing, regression detection, and PR-based auto-remediation workflows. The hardest problem has been closing the loop between detecting failures and safely validating fixes before deployment. I’ll share real-world lessons from deploying these systems across enterprise voice agents, customer support automation, and multi-step AI workflows.