rag-vectorlessemerging

PageIndex RAG

No vector DB. LLM navigates a page tree to find answers.

Overview

Eliminates the vector database entirely. Builds a hierarchical index of document summaries and uses an LLM to navigate the tree — reading page summaries, deciding which branches to explore, and synthesizing final answers. Perfect for prototypes and small corpora.

Architecture

Interactive walkthrough

Indexing

Document Loaderlocal files

Tokenizerlowercase + split

BM25Okapi Indextf-idf scoring

Pickle Persistenceindex.pkl on disk

Retrieval

Query Tokenizersame pipeline

BM25 Scoringlexical relevance

Top-k Chunksranked results

Generation

Prompt Templatecontext window

LLM Callany provider

Indexing

Document Loaderlocal files

Tokenizerlowercase + split

BM25Okapi Indextf-idf scoring

Pickle Persistenceindex.pkl on disk

Retrieval

Query Tokenizersame pipeline

BM25 Scoringlexical relevance

Top-k Chunksranked results

Generation

Prompt Templatecontext window

LLM Callany provider

Summarized pipeline view. For the full interactive, scroll-driven walkthrough with clickable stages → Pipeline detail

When to use

Use when

corpus_size < 500 documents
zero infrastructure preference
prototyping / proof-of-concept
document structure is hierarchical

Avoid when

corpus_size > 1000 documents
latency sensitivity
cost sensitivity (LLM calls per navigation step)
real-time corpus updates

Compatible frameworks

llamaindexraw python

#vectorless#no-vector-db#tree-navigation#llama-index#prototype

Ready to build with PageIndex RAG?

Walk through the wizard to generate a complete, parameterized pipeline.

Generate Pipeline