Carousel Studio

Repurpose X Threads into LinkedIn & Instagram Carousels

Canvas & Ratio

Choose your destination platform format


Layout Template

Choose a content structure for your slides


Preset Themes


Typography & Sizing

Title Font Size36px
Body Font Size18px
Header & Footer Size12px

Brand Kit Customization

AGENCY

Configure brand assets for headers & footers

MULTI-PROFILES (AGENCY)
AGENCY
SAVE PRESETS (AGENCY)

Outro Slide CTA

Customize your closing call-to-action slide

#1
#2
#3

Background Pattern

Source Content

Build Your Carousel

Drag and drop any post card below onto a slide, or use the quick buttons to insert content/images instantly!

Drag Post #1
METR
@METR_Evals

We’re updating the way we measure model time horizons on software tasks (TH 1.0→1.1). The updated methodology incorporates more of the tasks from HCAST, expanding our total from 170 to 288. This produces tighter estimates, especially at longer horizons.

Apply Image
Drag Post #2
METR
@METR_Evals

Our new time horizon estimates are a bit lower for GPT-4-era models and a bit higher for recent models. This doesn’t change the long-run trend (2019-2025), but it does make the growth since 2023 appear significantly steeper.

Drag Post #3
METR
@METR_Evals

We’re also replacing Vivaria, our original evaluation infrastructure. Our tasks now run on Inspect, an open-source evaluation framework developed & maintained by @AISecurityInst.

Drag Post #4
METR
@METR_Evals

We are exploring additional ways to raise the ceiling for our measurements. Even this updated suite has relatively few long tasks (ones that take humans 8+ hours to complete), while model capabilities are continuing to rapidly improve.

Drag Post #5
METR
@METR_Evals

We've updated our interactive graphs and data to include estimates from time horizon 1.1 in addition to 1.0. For more details on the TH 1.0→1.1 update, check out our blog: <a target="_blank" href="https://metr.org/blog/2026-1-29-time-horizon-1-1/" color="blue">metr.org/blog/2026-1-29…</a>