Hi,πŸ‘‹ we have updated the app and fixed multiple bugs. We are lacking funds, request to free user not to use Adblock. Ads are non intrusive. 😊
@_avichawla

Avi Chawla (@_avichawla)

View on X 9 Unrolled Threads
Thread Archive
10
πŸ€– AI & Machine Learning

Every generate() call to an LLM runs two distinct computational phases on the same GPU:...

Jun 29, 2026
Thread Archive
9

- Google Maps uses graph ML to predict ETA - Netflix uses graph ML in recommendation - Spotify uses graph ML in recommendation - Pinterest uses graph ML in recommendation Here are 6 must-know ways for graph feature engineering (with code):...

Dec 12, 2025
Thread Archive
23

Fine-tuning LLM Agents without Fine-tuning LLMs! Imagine improving your AI agent's performance from experience without ever touching the model weights. It's just like how humans remember past episodes and learn from them. That's precisely what Memento does. The core concept: Instead of updating...

Oct 24, 2025
Thread Archive
6

KV caching in LLMs, clearly explained (with visuals):...

Oct 07, 2025
Thread Archive
9

The growth of LLM context length with time: - GPT-3.5-turbo β†’ 4k tokens - OpenAI GPT4 β†’ 8k tokens - Claude 2 β†’ 100k tokens - Llama 3 β†’ 128k tokens - Gemini β†’ 1M tokens Let's understand how they extend the context length of LLMs:...

Aug 23, 2025
Thread Archive
10

A new embedding model cuts vector DB costs by ~200x. It also outperforms OpenAI and Cohere models. Here's a complete breakdown (with visuals):...

Aug 14, 2025
Thread Archive
8

An MCP server that makes anyone a database engineer (open-source)! @MongoDB just released an MCP Server that lets AI tools like Claude, Cursor, and GitHub Copilot talk directly to a MongoDB deployment. That means anyone (technical or non-technical) can now say: - β€œShow me the most active users” -...

Aug 07, 2025
Thread Archive
10

A simple technique makes RAG ~32x memory efficient! - Perplexity uses it in its search index - Azure uses it in its search pipeline - HubSpot uses it in its AI assistant Let's understand how to use it in RAG systems (with code):...

Aug 04, 2025
Thread Archive
8

Andrew Ng's team once made a big mistake in a research paper. And it happened due to randomly splitting the data. Here's what happened:...

Jul 19, 2025