Why AI Agents May Fail by 2025

Why AI Agents May Fail by 2025

Category: Technology
Duration: 3 minutes
Added: July 20, 2025
Source: utkarshkanwat.com

Description

In this episode of Tech Talks Today, host Alex sits down with AI expert Utkarsh Kanwat, who shares his insights on the future of AI agents. With over 12 production systems built in development and DevOps, Utkarsh argues that the excitement surrounding autonomous agents in 2025 may be overly optimistic. He delves into critical mathematical realities that challenge their reliability, particularly in multi-step workflows where error compounding can drop overall success rates significantly. Additionally, he highlights the skyrocketing costs associated with maintaining context in AI conversations. Utkarsh advocates for a balanced approach that combines automation with human oversight, emphasizing that while the potential for AI agents is promising, we need to be realistic about their current limitations. Tune in to learn more about the factors influencing the future of AI agents and the lessons learned from real-world applications.

Show Notes

## Key Takeaways

1. AI agent success is heavily influenced by error compounding in multi-step processes, leading to lower overall reliability.
2. The costs of maintaining context in AI interactions can be prohibitively high, impacting implementation viability.
3. A balanced approach that incorporates human oversight is essential for effective AI systems.

## Topics Discussed

- The hype vs. reality of AI agents
- Mathematical challenges in AI workflows
- Cost implications of context in AI systems
- The importance of human checkpoints in automation

Topics

AI agents artificial intelligence automation software engineering DevOps 2025 predictions error compounding production systems context windows human oversight AI reliability Utkarsh Kanwat Tech Talks Today automation challenges AI hype

Transcript

H

Host

Welcome back to Tech Talks Today! I'm your host, Alex, and today we’re diving into a hot topic: AI agents. There’s a lot of buzz around them, especially with predictions that 2025 will be their breakthrough year. But our guest today, Utkarsh Kanwat, has built over a dozen of these systems and has some surprising insights. Utkarsh, welcome!

E

Expert

Thanks for having me, Alex! I’m excited to share my perspective.

H

Host

So, Utkarsh, you’ve been in the trenches building AI agents for various applications. What’s your take on the hype surrounding AI agents for 2025?

E

Expert

Well, I think a lot of what’s being said is overly optimistic. While I’ve built systems that deliver real value, the current excitement overlooks some critical mathematical realities.

H

Host

Interesting! Can you break that down for us? What are the key points that make you skeptical?

E

Expert

Sure! First, there's the issue of error compounding in multi-step workflows. If you have an AI agent performing a task, and each step has, say, a 95% reliability rate, the overall success rate drops significantly as you add more steps.

H

Host

Right, so if I’m understanding this correctly, 95% reliability sounds good in isolation, but when you're chaining tasks together…

E

Expert

Exactly! A 95% success rate over 20 steps only nets a 36% success overall. That’s far below what’s needed for production systems.

H

Host

So, it’s not just about how good the AI is at one individual task?

E

Expert

Exactly! It’s a mathematical reality. Even if you could miraculously achieve 99% per step, you still wouldn't meet the reliability required for production.

H

Host

Wow, that’s a real eye-opener. What about the costs associated with these systems?

E

Expert

Another crucial point! The costs can skyrocket due to context windows in language models. Each time an agent needs to keep track of a longer conversation or context, the computational cost increases exponentially.

H

Host

So, the longer the conversation, the more expensive it gets? Can you give us an example?

E

Expert

Sure! Imagine a customer service agent that needs to remember past interactions to assist a customer. As the conversation goes on, maintaining that context becomes prohibitively expensive.

H

Host

That sounds like a big hurdle for companies looking to implement these agents!

E

Expert

Absolutely. It’s a challenge that needs to be addressed. Moreover, successful agents I’ve designed rely on bounded contexts and human checkpoints, rather than trying to automate everything.

H

Host

So, it’s about finding a balance between automation and human oversight?

E

Expert

Precisely! Building systems that can operate effectively within our current limitations is crucial. Autonomous workflows are just not feasible at scale right now.

H

Host

It sounds like you’re advocating for a more measured approach to AI agents.

E

Expert

Exactly! Focusing on what works and acknowledging the limitations is key. The future of AI agents is bright, but it’s important to ground ourselves in reality.

H

Host

Thank you, Utkarsh! This has been an enlightening conversation, and I’m sure our listeners appreciate your insights.

E

Expert

Thanks for having me, Alex! I hope it helps people make informed decisions in the AI landscape.

H

Host

Absolutely! And for our listeners, stay tuned for more discussions on the future of technology. Until next time!

Create Your Own Podcast Library

Sign up to save articles and build your personalized podcast feed.