Talk: Your AI Agents Are Already Broken In Production (You Just Don’t Know It Yet)

### First name

Shashank

### Last name

Agarwal

### Startup domain

https://noveum.ai

### GitHub repo or gist to share (optional)

_No response_

### Twitter / X handle (optional)

itsshashank

### Talk summary

AI agents are already silently failing in production — most enterprises just don’t realize it yet. At Noveum.ai, we’re building autonomous reliability infrastructure for AI agents: replay testing, continuous evals, synthetic voice testing, regression detection, and PR-based auto-remediation workflows. The hardest problem has been closing the loop between detecting failures and safely validating fixes before deployment. I’ll share real-world lessons from deploying these systems across enterprise voice agents, customer support automation, and multi-step AI workflows.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Talk: Your AI Agents Are Already Broken In Production (You Just Don’t Know It Yet) #24

First name

Last name

Startup domain

GitHub repo or gist to share (optional)

Twitter / X handle (optional)

Talk summary

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Talk: Your AI Agents Are Already Broken In Production (You Just Don’t Know It Yet) #24

Description

First name

Last name

Startup domain

GitHub repo or gist to share (optional)

Twitter / X handle (optional)

Talk summary

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions