Juliacon 2024

The Levels of RAG 🦜
2024-07-11 , Else (1.3)

LLM's can be supercharged using a technique called RAG, allowing us to overcome dealbreaker problems like hallucinations or no access to internal data. RAG is gaining more industry momentum and is becoming rapidly more mature both in the open-source world and at major Cloud vendors. But what can we expect from RAG? What is the current state of the tech in the industry? What use-cases work well and which are more challenging? Let's find out together!


Retrieval Augmented Generation (RAG) is a popular technique to combine retrieval methods like vector search together with Large Language Models (LLM's). This gives us several advantages like retrieving extra information based on a user search query: allowing us to quote and cite LLM-generated answers. Because the underlying techniques are very broadly applicable, many types of data can be used to build up a RAG system, like textual data, tables, graphs or even images.

In this talk, we will deep dive into this popular emerging technique. Together, we will learn about: what the current state of RAG is, what tech is available to support you and what you expect to work well and what is still very challenging.

Join us if you 🫵:
- Are interested in GenAI / LLM's and RAG
- Want to know more about the current state of RAG
- Would like to know when you can most successfully apply RAG

Contents of the talk 📌

  1. [2 min] Intro
  2. [3 min] Why RAG?
    1. The case for RAG
    2. The RAG advantage
    3. … so how-to RAG?
  3. [10 min] Basic RAG: Which ingredients make up a successful RAG system?
    1. Data ingestion (OCR, PDF reading)
    2. Chunking
    3. Vector search
    4. Keyword search
    5. Mapping these components to the tech landscape
    6. 💥 Encountering difficulties
  4. [10 min] Advanced RAG: Going multimodal
    1. Tabular data
    2. Graphs
    3. Images
  5. [4 min] Summing things up
    1. The levels of RAG: from basic to advanced
    2. GenAI community 🫂
    3. Concluding remarks
  6. [1 min] End

[30 minutes total]

❤️ Open Source Software

RAG and LLM’s are presented in a cloud-agnostic way. Many of the software libraries mentioned are open source. There is no agenda for representing any major cloud.