intermediateFree

Building and Evaluating Advanced RAG Applications

Name: Building and Evaluating Advanced RAG Applications
Rating: 4.6 (5500 reviews)

by Jerry Liu & Anupam Datta · DeepLearning.AI

4.6

(5,500 reviews)

90K+ enrolled1 hourUpdated 2024-08

Go to Course

Our Verdict

Worth it — with caveats

Building and Evaluating Advanced RAG Applications is a free, ~2-hour DeepLearning.AI short course taught by Jerry Liu (co-founder/CEO of LlamaIndex) and Anupam Datta (TruEra co-founder, now AI Research Lead at Snowflake), and for engineers who have already built a basic RAG pipeline it is one of the most focused, high-signal introductions to RAG evaluation available. Across six short video lessons and four runnable notebooks it teaches two retrieval upgrades that reliably beat baseline RAG (sentence-window and auto-merging retrieval) and a concrete evaluation framework, the 'RAG triad' of Context Relevance, Groundedness, and Answer Relevance, measured with TruLens/TruEra. Its real value is the eval-and-iterate loop most RAG tutorials skip, not breadth of coverage. The biggest caveats are honesty issues with positioning and aging code: the official page labels it 'Beginner', but Class Central's editors and the prerequisites (solid Python plus prior RAG experience) make clear it is not a true beginner course, and learners report TruLens/LlamaIndex version-compatibility breakage plus a non-pip-installable 'utils' helper that makes local replication outside the provided environment frustrating. Treat it as a sharp, dated-but-still-relevant supplement rather than a from-scratch RAG bootcamp.

It is an excellent, free, expert-taught primer on RAG evaluation and advanced retrieval, but only if you already understand and have built a basic RAG pipeline; it is short (no end-to-end project), the code now suffers from library version drift and a non-standard 'utils' package, and despite the official 'Beginner' tag it is genuinely intermediate. Right fit for that audience, wrong fit for true beginners.

Best for: Developers and ML/LLM engineers who already know Python and have built (or at least understand) a basic RAG pipeline and now want to measure and systematically improve it: people frustrated by inconsistent retrieval quality who need an evaluation methodology (the RAG triad) and two proven retrieval upgrades. It is also a fast, free way to evaluate LlamaIndex + TruLens before adopting them, and a good complement after a broader LLM course.

Skip if: Complete beginners to RAG, LLMs, or Python, despite the official 'Beginner' label, you will struggle without prior RAG context. Also not ideal for people who want a long, comprehensive, project-based build (this is ~2 hours with no capstone), those who need a verifiable, credit-bearing certificate, anyone wanting framework-agnostic theory (it is tightly coupled to LlamaIndex and TruLens), or learners who need fully reproducible, currently-maintained code, as some notebooks rely on a custom 'utils' helper and specific library versions that have drifted.

About This Course

Build production-grade RAG pipelines with sentence-window retrieval, auto-merging, and evaluation using TruLens.

What You'll Learn

Build a baseline RAG pipeline with LlamaIndex and understand its limitations

Apply the RAG triad evaluation framework: Context Relevance, Groundedness, and Answer Relevance

Use TruLens (TruEra) to instrument, score, and track RAG experiments via a leaderboard/dashboard

Implement sentence-window retrieval to add surrounding context to retrieved chunks

Implement auto-merging (hierarchical) retrieval that merges child chunks into parent nodes above a threshold

Iteratively tune parameters (e.g., window size, merge threshold) and compare variants against the baseline using real metrics

Reason about tradeoffs such as larger context windows increasing token cost and potentially overwhelming the LLM

Curriculum

Introduction

~4 min overview video framing RAG evaluation and the course goals.

Advanced RAG Pipeline

~15 min video + code (notebook L1) building a baseline RAG app with LlamaIndex.

RAG Triad of metrics

~42 min video + code (notebook L2) defining Context Relevance, Groundedness, and Answer Relevance and measuring them with TruLens.

Sentence-window retrieval

~29 min video + code (notebook L3) adding surrounding-sentence context to chunks and evaluating window sizes.

Auto-merging Retrieval

~21 min video + code (notebook L4) using hierarchical parent/child chunks merged by threshold for better context.

Conclusion + Quiz

~1 min wrap-up video plus a short reading/quiz.

Prerequisites

Basic-to-solid Python programming (able to read and run Jupyter notebooks)
Prior exposure to or experience building a basic RAG pipeline (recommended despite the official 'Beginner' framing)
Conceptual familiarity with LLMs, embeddings, and vector retrieval
An OpenAI API key (lessons use OpenAI models; note community reports of API-usage issues when running locally)

Instructor

Jerry Liu & Anupam Datta

Instructor · DeepLearning.AI

Pros & Cons

Pros

Taught by the actual creators of the tools: Jerry Liu (LlamaIndex) and Anupam Datta (TruEra/TruLens), giving authoritative, first-party guidance
Tightly focused on RAG evaluation, the part most tutorials skip, with a concrete, reusable framework (the RAG triad) and an iterate-on-metrics workflow
Completely free and only ~2 hours (1h55m), with hands-on, runnable notebooks for two genuinely useful retrieval techniques
High signal-to-noise: sentence-window and auto-merging retrieval are practical upgrades you can port to your own LlamaIndex projects immediately
Independently recommended by Class Central's 2026 RAG roundup as the best advanced-learner RAG applications course

Cons

Mispositioned as 'Beginner' on the official page when it is realistically intermediate (Class Central states 'this isn't for beginners' and assumes prior RAG experience)
Code has aged: learners report TruLens/LlamaIndex version-compatibility issues (e.g., empty TruLens dashboard Records/Evaluations tab) and a non-pip-installable custom 'utils' helper that makes local replication outside the course environment hard
Very short with no end-to-end capstone project; it teaches concepts and snippets rather than a fully deployed system
Tightly coupled to LlamaIndex + TruLens (a single vendor stack) rather than teaching framework-agnostic RAG evaluation

Alternatives To Consider

LangChain for LLM Application Development

DeepLearning.AI

View course

Generative AI with Large Language Models

Coursera

View course

NLP Course

Hugging Face

View course

Frequently Asked Questions

Is Building and Evaluating Advanced RAG Applications free?

Yes — Building and Evaluating Advanced RAG Applications is free to access. Free to take on the DeepLearning.AI learning platform (free during the platform beta). A shareable certificate/accomplishment of completion is generated on learn.deeplearning.ai, but it is a non-accredited, non-credit completion record (the project catalog lists certificate:false; treat it as a portfolio token, not a formal credential).

Who is Building and Evaluating Advanced RAG Applications for?

Developers and ML/LLM engineers who already know Python and have built (or at least understand) a basic RAG pipeline and now want to measure and systematically improve it: people frustrated by inconsistent retrieval quality who need an evaluation methodology (the RAG triad) and two proven retrieval upgrades. It is also a fast, free way to evaluate LlamaIndex + TruLens before adopting them, and a good complement after a broader LLM course.

What will you learn in Building and Evaluating Advanced RAG Applications?

Build a baseline RAG pipeline with LlamaIndex and understand its limitations; Apply the RAG triad evaluation framework: Context Relevance, Groundedness, and Answer Relevance; Use TruLens (TruEra) to instrument, score, and track RAG experiments via a leaderboard/dashboard; Implement sentence-window retrieval to add surrounding context to retrieved chunks.

What are the prerequisites for Building and Evaluating Advanced RAG Applications?

Basic-to-solid Python programming (able to read and run Jupyter notebooks); Prior exposure to or experience building a basic RAG pipeline (recommended despite the official 'Beginner' framing); Conceptual familiarity with LLMs, embeddings, and vector retrieval; An OpenAI API key (lessons use OpenAI models; note community reports of API-usage issues when running locally).

Is Building and Evaluating Advanced RAG Applications worth it?

How we reviewed this course

This is an independent editorial assessment by Cursarium, based on DeepLearning.AI's published course materials and aggregated public learner feedback (last reviewed 2026-06). We have not independently completed the course. Links to providers are standard references, not paid placements.

Sources

Free

Go to Course