Hi,šŸ‘‹ we have updated the app and fixed multiple bugs. We are lacking funds, request to free user not to use Adblock. Ads are non intrusive. 😊

@PawelHuryn: After an interview with @karpa...

@PawelHuryn
22 views Oct 23, 2025
1
After an interview with @karpathy, everyone is talking about what AI agents can/can't do.

But an opinion without data is just a hypothesis.

So, I tested 3x185 workflow executions for a market researcher agent.

The results have shocked me🧵
Media image
2
I tested three variants:

I. LLM Workflow: No agency, the entire logic carefully orchestrated.

What was expected:
- An LLM workflow was 2x faster (the same model) compared to an AI Agent.
- An LLM workflow consumed 12x less tokens to an AI Agent.

3/185 "errors" are minor formatting results.
Media image
3
II. Agentic Workflow: Deterministic logic moved to the orchestration layer.

More time, more tokens.
100% task success.

GPT-5 (a reasoning model) consumed less tokens than GPT-4o due to better compression.

None of this was surprising. But then:
Media image
4
III. AI Agent: Full autonomy without steps to take, just an objective

I were staring at the screen.

An AI agent without predefined reasoning steps succeeded 185/185 times (100%).
Media image
5
This is different from my previous observations for the same models:

6
Conclusions & learnings:

1. For simple use cases, we can already achieve 99%+ reliability
2. A verifier agent with a high TPR would push it even further
3. For complex or critical processes, you still need orchestration
4. Orchestration is faster, cheaper, and more reliable
7
@karpathy might be right.

We might need 10 years to achieve true AI intelligence.

But autonomy and reliability for most processes seem more like ~12 months away.

Agree? Disagree?

Let me know in the comments.

P.S....
8
A. Free n8n templates I used for testing: productcompass.pm/p/the-ultimate…
9
B. Enjoy this?

- Follow me @PawelHuryn for deep researched AI & PM
- Share this thread with others

I appreciate it!

Actions
Visual Editor Carousel Maker NEW
Update Thread
What You Can Do
  • Download as PDF
  • Save to Notion
  • Export as Markdown
  • Visual Editor
  • LinkedIn & Instagram Carousel Maker
Create Free Account

Includes 7-day Premium trial