The two modes

GRAIL has one architecture and two ways to feed it. That's the design decision worth understanding before you start.

Direct comparison

	Mode · Knowledge base	Mode · Agentic memory
Who writes	An LLM reads your documents and extracts entities	Your agent writes what it already knows
What goes in	`input/` with PDFs, markdown, code	`memories/<category>/` with markdown observations
Write cost	One LLM call per document chunk	Zero LLM calls — the agent already knows what it meant
When to process	Batch indexing (`grail index`)	Incremental, one observation at a time
Source of communities	Leiden algorithm over the graph	Declared folders + reviewed proposals
Extra search mode	—	`recall`: structural filter, no LLM
Init command	`grail init my-project`	`grail init my-project --memory`
Python API	`GRAIL.from_config(...)`	`MemoryProject(...)`

Same search layer on both sides

This is the critical part: the six search modes work identically on both. When you ask a natural-language question, it doesn't matter which door the facts came through.

local — find entities similar to your question, assemble their context, answer.
cascade — entity gate + BM25/cosine text rescue, combined ranking.
global — map-reduce over thematic community reports.
document — scope the search to a single source file.
agent — the LLM picks which mode to call per question, iterating up to 5 times.
recall — pure pandas filter over date, category, tags. Memory only. Zero LLM.

See Search modes for each one in detail.

When to pick which?

Pick knowledge base if…

You have an existing corpus you want queryable: a legal library, technical manuals, papers, legacy code.
The sources are authoritative (not written by your agent).
You're going to index once or periodically and query many times.
You care about exact provenance to source files for verifiable citations.

Pick agentic memory if…

Your agent needs to remember across sessions what it decided, observed, or learned.
The "sources" don't exist as documents — they're agent observations about the conversation, the code, the user's decisions.
You want the agent itself to control which entities and relationships to create (not an intermediate LLM guessing).
You need writes to be incremental and cheap — no re-indexing cycle.

Or both at once

Nothing stops you from mixing them. A project can have input/ with reference PDFs and memories/ with observations the agent accumulates in use. Both paths feed the same graph, and the six modes query both sources at once.

Layer	Behaviour
Artefacts	Same parquet files (`final_entities`, `final_relationships`, `final_text_units`, …)
Graph	Same NetworkX underneath
Vector store	Same (FAISS / LanceDB / ChromaDB)
Embeddings	Same model
Provenance	Each text unit keeps a pointer to the source file
Search modes	The 5 LLM modes work identically; `recall` is the sixth, memory-only

Next step

If you have documents to index → KB quickstart.
If you have an agent you want to give memory → Memory quickstart.
If you're not sure which mode fits → The decision tree breaks it down by question type.

Direct comparison​

Same search layer on both sides​

When to pick which?​

Pick knowledge base if…​

Pick agentic memory if…​

Or both at once​

What they share​

Next step​