| Thread Navigator

Thread Truncated (Cap Enforced)

Only the first 20 tweets are unrolled into slides to ensure reliable PDF exporting and high server performance.

Canvas & Ratio

Choose your destination platform format

Layout Template

Choose a content structure for your slides

Preset Themes

Typography & Sizing

Font Family

Title Font Size36px

Body Font Size18px

Header & Footer Size12px

Brand Kit Customization

AGENCY

Configure brand assets for headers & footers

MULTI-PROFILES (AGENCY)

Active Brand Profile

Show Brand Watermark

Brand Watermark Text

Social Handle

Brand Logo URL (PNG) AGENCY

SAVE PRESETS (AGENCY)

Save current as Preset

Outro Slide CTA

Customize your closing call-to-action slide

CTA Title

CTA Message & Emojis

Custom CTA Buttons

Background Pattern

Source Content

Build Your Carousel

Drag and drop any post card below onto a slide, or use the quick buttons to insert content/images instantly!

Drag Post #1

gemchanger

@gemchange_ltd

> I had a swarm running. 80 agents on the same task, the kind where you can check the answer at the end. About a third of them were quietly garbage.

Apply Image

Drag Post #2

gemchanger

@gemchange_ltd

I did what everyone does. Averaged all 80. Throw a pile of agents at it, average, the mess washes out. Error came back at 0.99. Useless.

Drag Post #3

gemchanger

@gemchange_ltd

So I tried something else. I let the agents grade each other against a small set of questions where I already knew the answer, and fire the worst. Cut the bad ones, average who's left.

Drag Post #4

gemchanger

@gemchange_ltd

0.135.

Drag Post #5

gemchanger

@gemchange_ltd

86% of the error, gone. Same agents. I didn't add anything. I removed.

Drag Post #6

gemchanger

@gemchange_ltd

## Why more agents was never the answer

Drag Post #7

gemchanger

@gemchange_ltd

If your agents are wrong in random, independent ways, adding more cancels the wrongness out. That's the whole pitch, and it's true.

Drag Post #8

gemchanger

@gemchange_ltd

But they all came off the same model. So they miss together. Same hallucinated convention, same misread of the spec, all leaning the same way. Averaging a stack of numbers that lean the same way doesn't move the lean.

Drag Post #9

gemchanger

@gemchange_ltd

Agent 300, agent 400, doesn't matter. The agent count on the slide is the most worthless number in the system, and nobody wants to hear it.

Drag Post #10

gemchanger

@gemchange_ltd

## So you cut instead

Drag Post #11

gemchanger

@gemchange_ltd

Stop trying to drown the bad agents. Remove them.

Drag Post #12

gemchanger

@gemchange_ltd

You need a verify gate. A few questions where you know the truth. Tests, anchors, whatever you have. Score every agent, cut the worst, average the survivors. 0.99 to 0.135.

Drag Post #13

gemchanger

@gemchange_ltd

A plain median on the same dirty swarm gives 0.56. A 20% trimmed mean, 0.82. The firing, 0.135.

Drag Post #14

gemchanger

@gemchange_ltd

Apply Image

Drag Post #15

gemchanger

@gemchange_ltd

Median and trim are blind. They cut a fixed amount and hope. Firing isn't blind. Same idea as trimming, except it knows where the bodies are buried.

Drag Post #16

gemchanger

@gemchange_ltd

## But you can't just crank it

Drag Post #17

gemchanger

@gemchange_ltd

Firing is not a slider you push to 100.

Drag Post #18

gemchanger

@gemchange_ltd

I pushed it. Error dropped, bottomed out, then climbed straight back up. 128% above the bottom by the time I'd gutted nearly everyone. Cut too deep and four agents are holding the whole answer, and four agents is loud and shaky.

Drag Post #19

gemchanger

@gemchange_ltd

The bottom sits further out than your gut says. 30% of my agents were bad. The best cut was 70%.

Drag Post #20

gemchanger

@gemchange_ltd

Apply Image