> STAGE:Ingestion
> RECAP:Load documents, split them into overlapping chunks, embed each chunk, and store the vectors.
> PATHS:Document LoaderDocument LoaderReads raw files in any supported format and normalizes them into clean plain text.➔Text SplitterText SplitterSplits each document into overlapping ~512-token chunks so context isn't lost at boundaries.➔EmbedderEmbedderTurns every chunk into a dense vector that captures its meaning for similarity search.➔Vector StoreVector StorePersists the vectors (with their text) in a vector database for fast nearest-neighbour lookup. > BLUEPRINT:Fuses dense vector semantic retrieval with BM25 keyword matching for optimal recall.