Hi,👋 we have updated the app and fixed multiple bugs. We are lacking funds, request to free user not to use Adblock. Ads are non intrusive. 😊

✨ Visual Editor

close

Thread Truncated

Only the first 20 tweets are shown to ensure high-quality rendering and prevent image size issues.

palette Canvas & Background

Gradient:arrow_forward
Text Color:
135°

style Card Style

40px
16px

text_fields Typography

16px
Avid
@Av1dlive
You don't need billions to train the next ChatGPT

All you need is a $100 and Andrej's Karpathy's Nanochat

I used it for the last week here is what I found
Thread image
Avid
@Av1dlive
---
Avid
@Av1dlive

Disclaimer: the cost of compute is expected to go down the next decade. Even though my statement is hyperbolic , you can get a usable version in less than $100. This is a build and not a bold statement. I do agree that the capex right now to train these AI models is insanely high, but I'm expecting that one day will come a time where we will be able to train awesome frontier models at really economical prices.
Avid
@Av1dlive
---
Avid
@Av1dlive
I spent ~$100 and one weekend training a ChatGPT-style model from scratch on my own notes, writing, and exported AI chats.

It now answers in my voice and recalls my own ideas, with no API and no rented brain.
Avid
@Av1dlive
This guide is the version I wish I'd had: every command, every code change, and plain-English explanations of the jargon so you don't get stuck.
Avid
@Av1dlive
If you've never trained a model before, you're the target reader. Take it one step at a time.
Avid
@Av1dlive
## Read this first (what you're signing up for)
Avid
@Av1dlive
What you'll end up with: a small GPT, roughly as capable as OpenAI's original GPT-2 (2019), fine-tuned on your own data so it sounds like you and knows your stuff. You can chat with it in a ChatGPT-style web page.
Avid
@Av1dlive
Honest expectations: this is not GPT-4. It's "a kindergartener with your memories", charming, useful for recall and drafting, and confidently wrong sometimes. The magic isn't raw IQ; it's that it's yours, it's private, and you understand every part of it.
Avid
@Av1dlive
What it costs: about $48–$100 in rented GPU time for the full run. You can learn the entire pipeline for ~$0 first (more on that below).
Avid
@Av1dlive
Skills you need:
Avid
@Av1dlive
• Comfort typing commands into a terminal (copy-paste is fine).
Avid
@Av1dlive
• Basic Python literacy helps for the data step, but I'll give you working scripts.
Avid
@Av1dlive
• No machine-learning background required. I'll explain the concepts as we go.
Avid
@Av1dlive
Time: budget a weekend. The actual training is ~3 hours; the rest is setup and preparing your data.
Avid
@Av1dlive
## The 60-second mental model
Avid
@Av1dlive
Training a chatbot happens in two big phases. Keep these straight and everything else makes sense.
Avid
@Av1dlive
1. Pretraining → produces the base model. The model reads a huge pile of internet text and learns one skill: predict the next word. This is where it learns grammar, facts, and reasoning. It's expensive (this is the ~3 hours of GPU time). The result talks like the internet, it can complete text but can't chat.
Avid
@Av1dlive
1. Fine-tuning (SFT) → produces the chat model. You show the base model thousands of example conversations so it learns to answer like an assistant. This is cheap and fast (minutes). This is where your personal data goes in.
Generated by Thread Navigator
100%
view_carousel Carousel Studio NEW
Press ⌘ + S to quick-export