| Thread Navigator

Canvas & Ratio

Choose your destination platform format

Layout Template

Choose a content structure for your slides

Preset Themes

Typography & Sizing

Font Family

Title Font Size36px

Body Font Size18px

Header & Footer Size12px

Brand Kit Customization

AGENCY

Configure brand assets for headers & footers

MULTI-PROFILES (AGENCY)

Active Brand Profile

Show Brand Watermark

Brand Watermark Text

Social Handle

Brand Logo URL (PNG) AGENCY

SAVE PRESETS (AGENCY)

Save current as Preset

Outro Slide CTA

Customize your closing call-to-action slide

CTA Title

CTA Message & Emojis

Custom CTA Buttons

Background Pattern

Source Content

Build Your Carousel

Drag and drop any post card below onto a slide, or use the quick buttons to insert content/images instantly!

Drag Post #1

METR

@METR_Evals

We’re updating the way we measure model time horizons on software tasks (TH 1.0→1.1). The updated methodology incorporates more of the tasks from HCAST, expanding our total from 170 to 288. This produces tighter estimates, especially at longer horizons.

Apply Image

Drag Post #2

METR

@METR_Evals

Our new time horizon estimates are a bit lower for GPT-4-era models and a bit higher for recent models. This doesn’t change the long-run trend (2019-2025), but it does make the growth since 2023 appear significantly steeper.

Drag Post #3

METR

@METR_Evals

We’re also replacing Vivaria, our original evaluation infrastructure. Our tasks now run on Inspect, an open-source evaluation framework developed & maintained by @AISecurityInst.

Drag Post #4

METR

@METR_Evals

We are exploring additional ways to raise the ceiling for our measurements. Even this updated suite has relatively few long tasks (ones that take humans 8+ hours to complete), while model capabilities are continuing to rapidly improve.

Drag Post #5

METR

@METR_Evals

We've updated our interactive graphs and data to include estimates from time horizon 1.1 in addition to 1.0. For more details on the TH 1.0→1.1 update, check out our blog: <a target="_blank" href="https://metr.org/blog/2026-1-29-time-horizon-1-1/" color="blue">metr.org/blog/2026-1-29…</a>