Hi,👋 we have updated the app and fixed multiple bugs. We are lacking funds, request to free user not to use Adblock. Ads are non intrusive. 😊

@karpathy: I quite like the idea using ga...

43 views Feb 03, 2025

1

I quite like the idea using games to evaluate LLMs against each other, instead of fixed evals. Playing against another intelligent entity self-balances and adapts difficulty, so each eval (/environment) is leveraged a lot more. There's some early attempts around. Exciting area.

View Tweet

Actions

Visual Editor Carousel Maker NEW

What You Can Do

Download as PDF
Save to Notion
Export as Markdown
Visual Editor
LinkedIn & Instagram Carousel Maker

Create Free Account

Includes 7-day Premium trial