✨ Visual Editor

close

Thread Truncated

Only the first 20 tweets are shown to ensure high-quality rendering and prevent image size issues.

palette Canvas & Background

Gradient:arrow_forward
Text Color:
135°

style Card Style

40px
16px

text_fields Typography

16px
Delba
@delba_oliveira
As we delegate more ambitious tasks to Claude, it becomes increasingly important that it can verify its own work.

The more Claude can self-verify:
Thread image
Delba
@delba_oliveira
• the more independently it can work on long-running tasks
Delba
@delba_oliveira
• the better the quality of the final result
Delba
@delba_oliveira
• the fewer back and forths it takes to get there
Delba
@delba_oliveira
Video thumbnail
VIDEO
Delba
@delba_oliveira
The good news is that Claude already self-verifies against deterministic signals like type errors, lint errors, failing tests, and runtime errors. And as models improve, this will only get better.
Delba
@delba_oliveira
What Claude can’t always infer are the manual checks you run after it responds, and later on, before you merge code into production.

The more of those checks you can encode, the closer Claude’s first response gets to the final result you had in mind.
Delba
@delba_oliveira
You spend less time babysitting, and Claude can keep going while you work on something else.
Delba
@delba_oliveira
## Write down your processes
Delba
@delba_oliveira
A good place to start is to write down the best-practices version of what you or your team already do.
Delba
@delba_oliveira
For frontend, that's usually: run the dev server, open the browser, check the console for errors, click around as the user would and look out for things like layout shift or slow navigations.
Delba
@delba_oliveira
Every domain has its own version. For each of those steps, there's likely a tool Claude can use for verification:
Delba
@delba_oliveira
Thread image
Delba
@delba_oliveira
## Encode your process as a skill
Delba
@delba_oliveira
Once the process is clear, encode as much of it as possible as a skill. Install the `skill-creator` plugin, then ask Claude to interview you:
Delba
@delba_oliveira
/skill-creator Create a skill for verifying frontend changes end-to-end. Interview me about my workflow.
Delba
@delba_oliveira
If you're struggling to put your process into words, ask Claude for the domain best practices first and let it show you what an end-to-end verification flow might look like.

Taste and judgment are difficult to codify, but many checks have criteria Claude measure against: a performance budget, an accessibility checklist, design system rules, good vs bad examples.
Delba
@delba_oliveira
For example, a frontend skill might include instructions to capture a performance trace through the Chrome DevTools MCP or Agent browser.
Delba
@delba_oliveira
---
name: frontend-verify
description: Verify frontend changes in a browser. Run whenever
a UI (page, component, typography, CSS style) change is made.
---

# Frontend verify

- Run a two-step verification pass in a real browser.
- Fix issues and re-verify before responding to the user.

## Step 1 — Verify the change behaves as expected

1. Open the URL in a browser:
- In the Claude Code desktop app, use the embedded preview.
- In the CLI, use the Chrome DevTools MCP.
2. Interact with the new element and confirm it renders and
behaves as expected.

## Step 2 — Verify the change passes a mobile audit

1. Open the URL in a new page via the Chrome DevTools MCP
2. Run a performance trace and audit Core Web Vitals
Delba
@delba_oliveira
Other checks are more qualitative than pass/fail, like comparing data against historical norms. For these, you can work with Claude to set a rubric for evaluating output.
Generated by Thread Navigator
100%
view_carousel Carousel Studio NEW
Press + S to quick-export