Visualize Thread by @akshay_pachaar

✨ Visual Editor

palette Canvas & Background

Presets

Custom Colors

Gradient:arrow_forward

Text Color:

Gradient Angle135°

Background Pattern

Grain Texture

Aspect Ratio

style Card Style

Preset

Padding40px

Card Radius16px

Enable Card Shadow

Glassmorphism Effect

Show Watermark AGENCY

Show Timestamps

Show X Logo

text_fields Typography

Font Family

Font Size16px

Akshay 🚀

@akshay_pachaar

Let's build a pipeline to evaluate and monitor a RAG application, using a 100% open-source tool:

Akshay 🚀

@akshay_pachaar

Before we start here's a quick demo what we're building:

Tech Stack:

- @Cometml's Opik for eval and observability
- @Llama_Index to build a RAG app

Track everything from, LLM calls to chunking, embedding, generation and evaluation!

VIDEO

Akshay 🚀

@akshay_pachaar

The architecture diagram presented below illustrates some of the key components & how they interact with each other!

It will be followed by detailed descriptions & code for each component:

Akshay 🚀

@akshay_pachaar

1️⃣ Configuration and setup

First we configure everything to:

- Trace all LLM calls
- Trace all RAG steps

Note: You can also easily use Ollama LLMs, i have shared example in the GitHub below.

Fundamentals would still remain same.

Akshay 🚀

@akshay_pachaar

2️⃣ Create a simple RAG app

This is more a didactic example, but you can always make it more sophisticated.

Here's a simple RAG setup:

Akshay 🚀

@akshay_pachaar

3️⃣ LLM app and Evaluation task

Next we need to create an LLM application function and define an evaluation task.

Here's how we do it...👇

Akshay 🚀

@akshay_pachaar

4️⃣ Prep eval dataset

We triples of the following:

- Questions
- Their answers
- The relevant context for each QA pair

Here's our sample dataset...👇

Akshay 🚀

@akshay_pachaar

5️⃣ Load the dataset into Opik

Next we load this dataset in Opik so that everything is tracked an can be used for evaluation.

Check this out👇

Akshay 🚀

@akshay_pachaar

6️⃣ Load the dataset into Opik

Next we load this dataset in Opik so that everything is tracked an can be used for evaluation.

Check this out👇

Akshay 🚀

@akshay_pachaar

7️⃣ Define Evaluation metrics

Opik provide out of the box for all the popular LLM/RAG evaluation metrics.

Check this out👇

Akshay 🚀

@akshay_pachaar

8️⃣ Evaluate

Finally, it's time to put everything together and run evaluation.

Check this out👇

Akshay 🚀

@akshay_pachaar

You can find all the code and everything you need here!

Don't forget to star the repo: github.com/patchy631/ai-e…

Akshay 🚀

@akshay_pachaar

If you're interested in:

- Python 🐍
- ML/AI Engineering ⚙️

Find me → @akshay_pachaar ✔️
Everyday, I share tutorials on above topics!

Generated by Thread Navigator

100%

view_carousel Carousel Studio NEW

Press ⌘ + S to quick-export