| Thread Navigator

Canvas & Ratio

Choose your destination platform format

Layout Template

Choose a content structure for your slides

Preset Themes

Typography & Sizing

Font Family

Title Font Size36px

Body Font Size18px

Header & Footer Size12px

Brand Kit Customization

AGENCY

Configure brand assets for headers & footers

MULTI-PROFILES (AGENCY)

Active Brand Profile

Show Brand Watermark

Brand Watermark Text

Social Handle

Brand Logo URL (PNG) AGENCY

SAVE PRESETS (AGENCY)

Save current as Preset

Outro Slide CTA

Customize your closing call-to-action slide

CTA Title

CTA Message & Emojis

Custom CTA Buttons

Background Pattern

Source Content

Build Your Carousel

Drag and drop any post card below onto a slide, or use the quick buttons to insert content/images instantly!

Drag Post #1

elvis

@omarsar0

Fine-tuning LLM Agents without Fine-tuning LLMs Catchy title and very cool memory technique to improve deep research agents. Great for continuous, real-time learning without gradient updates. Here are my notes:

Apply Image

Drag Post #2

elvis

@omarsar0

Overview Proposes a memory‑based learning framework that lets deep‑research agents adapt online without updating model weights. The agent is cast as a memory‑augmented MDP with case‑based reasoning, implemented in a planner–executor loop over MCP tools.

Apply Image

Drag Post #3

elvis

@omarsar0

Method Decisions are guided by a learned case‑retrieval policy over an episodic Case Bank. Non‑parametric memory retrieves Top‑K similar cases; parametric memory learns a Q‑function (soft Q‑learning or single‑step CE training in deep‑research settings) to rank cases for reuse and revision.

Drag Post #4

elvis

@omarsar0

Architecture Planner (LLM CBR) + Executor (LLM MCP client) with three memories: Case, Subtask, Tool. It involves planning, tool execution, writing/reading of cases, and a replay buffer. Tools span search, crawl, multimodal document parsing, code execution, and math utilities.

Apply Image

Drag Post #5

elvis

@omarsar0

Results: • GAIA: 87.88% Pass@3 on validation and 79.40% on test, competitive with or above open‑source agent frameworks • DeepResearcher: 66.6 F1 and 80.4 PM average across seven open‑domain QA sets • SimpleQA: 95.0% accuracy, beating recent web‑agent baselines • HLE: 24.4 PM, close to GPT‑5 and ahead of several strong baselines

Apply Image

Drag Post #6

elvis

@omarsar0

Practical takeaways for agent builders: • Use a compact, curated case memory with adaptive retrieval rather than growing prompts. • Keep planning concise. A fast planner outperforms slow‑think planners for multi‑step tool use on GAIA by avoiding verbose or shortcut plans. • Separate planning and execution with explicit Subtask and Tool memories to coordinate long‑horizon work and reduce hallucinations Paper: <a target="_blank" href="https://arxiv.org/abs/2508.16153" color="blue">arxiv.org/abs/2508.16153</a>

Apply Image