✨ Visual Editor

close

Thread Truncated

Only the first 20 tweets are shown to ensure high-quality rendering and prevent image size issues.

palette Canvas & Background

Gradient:arrow_forward
Text Color:
135°

style Card Style

40px
16px

text_fields Typography

16px
hoeem
@hooeem
Your agentic workflows are wasting your tokens. They’re wasting your money repeatedly for an orchestration loop. Here’s how you fix it and make your workflows 100x cheaper (yes, 100x cheaper).
Thread image
hoeem
@hooeem
In fact in testing it was found to be 128x, 296x, and 462x cheaper in the three tested domains, so 100x is an understatement.
hoeem
@hooeem
The paper this research has come from has been written by Simon Dennis, Riviaan Patil, Kevin Shabahang, & Hao Guo from the University of Melbourne (I’ll link the paper in full at the end).
hoeem
@hooeem
This article is going to tell you how to utilise their research so that you can make your agentic workflows 100x cheaper and the contents of this article is the following:
hoeem
@hooeem
1. Why your agentic workflow costs so much
hoeem
@hooeem
1. The one idea to take away
hoeem
@hooeem
1. How it's actually done (theory)
hoeem
@hooeem
1. Does it hold up?
hoeem
@hooeem
1. Run your own numbers (find how much it cost)
hoeem
@hooeem
1. Is this for you?
hoeem
@hooeem
1. Full build guide
hoeem
@hooeem
This guide can make your conversations up to 462x cheaper whilst keeping 87-98% of the frontier quality kept, so let's get started!
hoeem
@hooeem
## 1: Why your agentic workflow costs so much
hoeem
@hooeem
Okay, so you have a fixed procedure - an agentic workflow.
hoeem
@hooeem
Then you have where this agentic workflow lives and depending on that single choice is exactly what drives the cost of your agentic workflow.
hoeem
@hooeem
A: Orchestration
hoeem
@hooeem
This is the most common setup you see today, this is where software sits on top of the model and, every single turn, injects instructions and decides where the conversation goes next.
hoeem
@hooeem
The cost? $0.05-0.17 per conversation.
hoeem
@hooeem
B: In-context
hoeem
@hooeem
This is the "just prompt it" route, this is where you paste the whole workflow into the model's system prompt and you let it run yourself.
Generated by Thread Navigator
100%
view_carousel Carousel Studio NEW
Press + S to quick-export