Visualize Thread by @akshay_pachaar

✨ Visual Editor

palette Canvas & Background

Presets

Custom Colors

Gradient:arrow_forward

Text Color:

Gradient Angle135°

Background Pattern

Grain Texture

Aspect Ratio

style Card Style

Preset

Padding40px

Card Radius16px

Enable Card Shadow

Glassmorphism Effect

Show Watermark AGENCY

Show Timestamps

Show X Logo

text_fields Typography

Font Family

Font Size16px

Akshay 🚀

@akshay_pachaar

Let's compare DeepSeek-R1 and OpenAI-o1 using RAG:

Akshay 🚀

@akshay_pachaar

DeepSeek-R1 delivers OpenAI-o1 level intelligence at 90% less cost.

Today, we build a Streamlit app to compare and evaluate them using RAG.

Tech stack:

- @Llama_Index for orchestration
- @Cometml Opik for evaluation
- @Streamlit for the UI

Let's go! 🚀

VIDEO

Akshay 🚀

@akshay_pachaar

The architecture presented below illustrates some of the key components & how they interact with each other!

For those who are new, I've provided detailed descriptions & code for each component.

Akshay 🚀

@akshay_pachaar

1️⃣ & 2️⃣ : Loading the knowledge base

A knowledge base is a collection of relevant and up-to-date information that serves as a foundation for RAG. In our case it's the docs stored in a directory.

Here's how you can load it as document objects in LlamaIndex:

Akshay 🚀

@akshay_pachaar

3️⃣ The embedding model

The embedding model Embedding is a meaningful representation of text in form of numbers.

The embedding model is responsible for creating embeddings for the document chunks & user queries.

Akshay 🚀

@akshay_pachaar

4️⃣ Indexing & storing

Embeddings created by embedding model are stored in a vector store that offers fast retrieval and similarity search by creating an index over our data.

By default, LlamaIndex provides a in-memory vector store that’s great for quick experimentation.

Akshay 🚀

@akshay_pachaar

5️⃣ Creating a prompt template

Creating a prompt template A custom prompt template is use to refine the response from LLM & include the context as well:

Akshay 🚀

@akshay_pachaar

6️⃣ Setting up a query engine

The query engine takes a query string & use it to fetch relevant context and then sends them both as a prompt to the LLM to generate a final natural language response.

Here's how you set it up:

Akshay 🚀

@akshay_pachaar

8️⃣ The Chat interface

We create a UI using Streamlit to provide a chat interface for our RAG application.

The code for this & all we discussed so far is shared in the next tweet!

Check this out👇

Akshay 🚀

@akshay_pachaar

Finally, we will conduct a proper evaluation.

For this, we'll use @Cometml's Opik, a 100% open-source platform for evaluation and observability.

I have shared a notebook where you'll find all the code for this evaluation.

Akshay 🚀

@akshay_pachaar

I used @LightningAI ⚡️ Studio for developing this application!

The studio reads like a blog, encapsulating all my code & environment to run it!

Clone a FREE studio now & take it for a spin...👇
lightning.ai/akshay-ddods/s…

Akshay 🚀

@akshay_pachaar

If you're interested in:

- Python 🐍
- Machine Learning 🤖
- AI Engineering ⚙️

Find me → @akshay_pachaar ✔️

Cheers! 🥂

Generated by Thread Navigator

100%

view_carousel Carousel Studio NEW

Press ⌘ + S to quick-export