What is the difference between RAG and a knowledge graph?

RAG (Retrieval-Augmented Generation) retrieves semantically similar text chunks from a vector database and injects them into an LLM prompt. A knowledge graph models entities and relationships as structured triples (subject-predicate-object), enabling multi-hop reasoning. RAG answers 'what text is similar to this query?' Knowledge graphs answer 'what is related to X through what relationship?'

What is agent memory and how does it differ from RAG?

Agent memory persists and retrieves state across agent invocations — current task progress, past decisions, user preferences, tool outputs. RAG retrieves external knowledge. Agent memory retrieves internal state. Production agents need both: RAG for domain knowledge, memory for operational continuity.

When should you use a knowledge graph instead of RAG?

Use a knowledge graph when your queries require multi-hop reasoning (find all products made by companies founded by people who attended MIT), when relationships between entities matter as much as entity attributes, or when you need consistent structured answers rather than retrieved text passages.

What vector databases are best for RAG in 2026?

Pinecone and Weaviate for hosted/managed. Qdrant for self-hosted with good performance. pgvector (PostgreSQL extension) for teams already on Postgres who want minimal operational overhead. For agent memory specifically, Redis with vector search handles short-term memory well.

RAG, Knowledge Graphs & Agent Memory Explained

RAG retrieves text chunks; knowledge graphs model relationships; agent memory persists state. Combine all three correctly and you get AI agents that reason over real, current, structured context — not just surface-level similarity matching. Here's how to think about each layer and when to use which.

RAG: What It Does Well and Where It Fails

RAG works by embedding documents into a vector space and retrieving the top-k most semantically similar chunks to a query. It excels at: finding relevant documentation passages, answering questions from a large corpus, and reducing hallucination by grounding responses in retrieved text.

RAG fails at: multi-hop reasoning (find the answer that requires linking three different documents), structured queries (give me all customers in Region A with ARR > $100K), and temporal reasoning (what changed between v1 and v2). For these, you need structured data or a knowledge graph, not cosine similarity.

Knowledge Graphs: When Relationships Matter

A knowledge graph stores entities (Shahriar Labs, LetX, Shihab Shahriar Antor) and typed relationships (Shahriar Labs → founder → Shihab Shahriar Antor, Shahriar Labs → builds → LetX). Queries like "who builds products that compete with Overleaf?" require traversing these relationships — RAG can't answer them reliably from text alone.

At Shahriar Labs, the Organization/Person/SoftwareApplication @graph in our schema.org markup is a lightweight public knowledge graph — it tells search engines and AI models exactly the entity relationships without requiring them to infer from text. Same principle, different scale.

Agent Memory: Three Tiers

In-context memory: The current prompt window — task state, recent tool outputs, working notes. Ephemeral, highest retrieval speed, limited by context window.

Session memory: Redis or DynamoDB with TTL. User preferences, conversation history, task progress within a session. Survives across agent invocations within a workflow run.

Long-term memory: Vector DB (Qdrant, pgvector) + structured store. Past decisions, domain knowledge learned from interactions, user-specific knowledge bases. See common-knowledge skill for an open-source implementation.

For production agent architecture, see our full guide on building production AI agents.

Frequently Asked Questions

RAG vs knowledge graph — what's the difference?: RAG retrieves similar text. Knowledge graphs model entity relationships for multi-hop reasoning.
What is agent memory?: Persistent state across agent invocations — task progress, past decisions, user preferences. Different from RAG's external knowledge retrieval.
When to use knowledge graph over RAG?: Multi-hop reasoning, structured relational queries, when entity relationships matter more than text similarity.
Best vector databases for RAG in 2026?: Pinecone/Weaviate (managed), Qdrant (self-hosted), pgvector (PostgreSQL teams).

Written by Shihab Shahriar Antor — AI Engineer & Founder of Shahriar Labs. Building LetX, QuantumSketch, and open-source AI agent skills.

RAG, Knowledge Graphs & Agent Memory Explained

RAG: What It Does Well and Where It Fails

Knowledge Graphs: When Relationships Matter

Agent Memory: Three Tiers

Frequently Asked Questions

Related Articles

How We Build AI That Actually Works: The Agency Stack

AI Agents Are Changing Software Dev in 2026