Visualize Thread by @Av1dlive

✨ Visual Editor

Thread Truncated

Only the first 20 tweets are shown to ensure high-quality rendering and prevent image size issues.

palette Canvas & Background

Presets

Custom Colors

Gradient:arrow_forward

Text Color:

Gradient Angle135°

Background Pattern

Grain Texture

Aspect Ratio

style Card Style

Preset

Padding40px

Card Radius16px

Enable Card Shadow

Glassmorphism Effect

Show Watermark AGENCY

Show Timestamps

Show X Logo

text_fields Typography

Font Family

Font Size16px

Avid

@Av1dlive

You don't need billions to train the next ChatGPT

All you need is a $100 and Andrej's Karpathy's Nanochat

I used it for the last week here is what I found

Avid

@Av1dlive

---

Avid

@Av1dlive

Disclaimer: the cost of compute is expected to go down the next decade. Even though my statement is hyperbolic , you can get a usable version in less than $100. This is a build and not a bold statement. I do agree that the capex right now to train these AI models is insanely high, but I'm expecting that one day will come a time where we will be able to train awesome frontier models at really economical prices.

Avid

@Av1dlive

---

Avid

@Av1dlive

I spent ~$100 and one weekend training a ChatGPT-style model from scratch on my own notes, writing, and exported AI chats.

It now answers in my voice and recalls my own ideas, with no API and no rented brain.

Avid

@Av1dlive

This guide is the version I wish I'd had: every command, every code change, and plain-English explanations of the jargon so you don't get stuck.

Avid

@Av1dlive

If you've never trained a model before, you're the target reader. Take it one step at a time.

Avid

@Av1dlive

## Read this first (what you're signing up for)

Avid

@Av1dlive

What you'll end up with: a small GPT, roughly as capable as OpenAI's original GPT-2 (2019), fine-tuned on your own data so it sounds like you and knows your stuff. You can chat with it in a ChatGPT-style web page.

Avid

@Av1dlive

Honest expectations: this is not GPT-4. It's "a kindergartener with your memories", charming, useful for recall and drafting, and confidently wrong sometimes. The magic isn't raw IQ; it's that it's yours, it's private, and you understand every part of it.

Avid

@Av1dlive

What it costs: about $48–$100 in rented GPU time for the full run. You can learn the entire pipeline for ~$0 first (more on that below).

Avid

@Av1dlive

Skills you need:

Avid

@Av1dlive

• Comfort typing commands into a terminal (copy-paste is fine).

Avid

@Av1dlive

• Basic Python literacy helps for the data step, but I'll give you working scripts.

Avid

@Av1dlive

• No machine-learning background required. I'll explain the concepts as we go.

Avid

@Av1dlive

Time: budget a weekend. The actual training is ~3 hours; the rest is setup and preparing your data.

Avid

@Av1dlive

## The 60-second mental model

Avid

@Av1dlive

Training a chatbot happens in two big phases. Keep these straight and everything else makes sense.

Avid

@Av1dlive

1. Pretraining → produces the base model. The model reads a huge pile of internet text and learns one skill: predict the next word. This is where it learns grammar, facts, and reasoning. It's expensive (this is the ~3 hours of GPU time). The result talks like the internet, it can complete text but can't chat.

Avid

@Av1dlive

1. Fine-tuning (SFT) → produces the chat model. You show the base model thousands of example conversations so it learns to answer like an assistant. This is cheap and fast (minutes). This is where your personal data goes in.

Generated by Thread Navigator

100%

view_carousel Carousel Studio NEW

Press ⌘ + S to quick-export