@itsalexvacca: 8 Google engineers wrote the p...
@itsalexvacca
24 views
Sep 30, 2025
1
8 Google engineers wrote the paper that every AI company now uses as their bible. OpenAI built GPT on it, Anthropic built Claude on it, and Meta built LLaMA on it.
Every LLM worth billions uses this paper's transformer architecture as the foundation...
Every LLM worth billions uses this paper's transformer architecture as the foundation...
3
They published an 8-page paper titled "Attention Is All You Need"
The idea was simple: Instead of reading word by word, why not look at everything at once? Like how you can glance at a page and immediately see which words relate to each other.
They called it a Transformer.
The idea was simple: Instead of reading word by word, why not look at everything at once? Like how you can glance at a page and immediately see which words relate to each other.
They called it a Transformer.
9
Encoders need paired data - English sentence, German translation.
Whereas decoders only need raw text, maybe the entire internet.
Just predict the next word which needs no translation needed.
OpenAI turned Google's translation machine into a universal intelligence engine.
Whereas decoders only need raw text, maybe the entire internet.
Just predict the next word which needs no translation needed.
OpenAI turned Google's translation machine into a universal intelligence engine.
11
Then came RLHF - humans rating millions of Claude's responses.
Do this millions of times. The transformer learns what humans actually want.
Same 8-page architecture underneath. But Meta went even further.
Do this millions of times. The transformer learns what humans actually want.
Same 8-page architecture underneath. But Meta went even further.
13
Zuck's play: Let 100,000 developers improve LLaMA. They debug it, optimize it and build tools. Meta gets all innovations back.
While Google/OpenAI charge fees, Meta built an army of unpaid developers. Genius move? I don't know
While Google/OpenAI charge fees, Meta built an army of unpaid developers. Genius move? I don't know
14
Today, transformers power everything:
ChatGPT: Decoder transformer
Claude: Standard transformer
DALL-E: Vision transformer
Copilot: Code transformer
Same architecture. Different products.
ChatGPT: Decoder transformer
Claude: Standard transformer
DALL-E: Vision transformer
Copilot: Code transformer
Same architecture. Different products.
15
Thanks for making it to the end!
I'm Alex, co-founder at ColdIQ. Built a $6M ARR business in under 2 years. We're a remote team across 10 countries, helping 400+ businesses.
Here's how I make $450k+ every month with AI:
tinyurl.com/5n79rd5w
I'm Alex, co-founder at ColdIQ. Built a $6M ARR business in under 2 years. We're a remote team across 10 countries, helping 400+ businesses.
Here's how I make $450k+ every month with AI:
tinyurl.com/5n79rd5w
16
RT the first tweet if you found this thread valuable.
Follow me @itsalexvacca for more threads on outbound and GTM strategy, AI-powered sales systems, and how to build profitable businesses that don't depend on you.
I share what worked (and what didn't) in real time.
Follow me @itsalexvacca for more threads on outbound and GTM strategy, AI-powered sales systems, and how to build profitable businesses that don't depend on you.
I share what worked (and what didn't) in real time.
View Tweet








