Codemem

A standalone Rust memory engine for AI coding assistants. Single binary, zero runtime deps.

Codemem stores what your AI assistant discovers -- files read, symbols searched, edits made -- so repositories don't need re-exploring across sessions. Optionally enriches the graph with compiler-grade cross-references via SCIP indexers.

Quick Start

Install

# Shell (macOS/Linux)
curl -fsSL https://raw.githubusercontent.com/cogniplex/codemem/main/install.sh | sh

# Homebrew
brew install cogniplex/tap/codemem

# Cargo
cargo install codemem

Or download a prebuilt binary from Releases.

Platform	Architecture	Binary
macOS	ARM64 (Apple Silicon)	`codemem-macos-arm64.tar.gz`
Linux	x86_64	`codemem-linux-amd64.tar.gz`
Linux	ARM64	`codemem-linux-arm64.tar.gz`

Initialize

cd your-project
codemem init

Downloads the local embedding model (~440MB, one-time), registers lifecycle hooks, and configures the MCP server for your AI assistant. Automatically detects Claude Code, Cursor, and Windsurf.

That's it

Codemem now automatically captures context, injects prior knowledge at session start, and provides 26 MCP tools to your assistant.

Map your codebase (optional)

Run the full analysis pipeline -- indexes your codebase with tree-sitter, enriches the graph with SCIP cross-references (if indexers are installed), computes PageRank, and detects architectural clusters:

codemem analyze

Then launch the code-mapper agent to do deep, agent-driven analysis -- it spawns a team of specialized agents that traverse the knowledge graph, discover patterns, and store architectural insights:

claude --agent code-mapper

See Index & Enrich Pipeline for what happens under the hood.

SCIP enrichment (optional, recommended)

SCIP (Source Code Intelligence Protocol) gives codemem compiler-grade cross-references -- every call, import, type reference, and override in your codebase, with zero false positives. codemem analyze auto-detects installed indexers and runs them automatically.

Install the indexer for your language:

Language	Indexer	Install
Rust	rust-analyzer	`rustup component add rust-analyzer`
TypeScript/JavaScript	scip-typescript	`npm install -g @sourcegraph/scip-typescript`
Python	scip-python	`npm install -g @sourcegraph/scip-python`
Go	scip-go	`go install github.com/sourcegraph/scip-go/cmd/scip-go@latest`
Java/Kotlin	scip-java	scip-java releases
C#	scip-dotnet	`dotnet tool install --global scip-dotnet`
Ruby	scip-ruby	`gem install scip-ruby`

Codemem detects languages from manifest files (Cargo.toml, package.json, pyproject.toml, go.mod, etc.) and checks PATH for the corresponding indexer. If found, SCIP edges are fused with ast-grep pattern edges -- when both sources agree on the same edge, their confidence scores sum (0.10 + 0.15 = 0.25), producing higher-fidelity graph connections.

Configure via ~/.codemem/config.toml:

[scip]
enabled = true                    # Master switch (default: true)
auto_detect_indexers = true       # Check PATH for available indexers
cache_index = true                # Cache .scip files between runs
cache_ttl_hours = 24              # Re-index if cache older than this
create_external_nodes = true      # Create ext: nodes for dependency symbols
store_docs_as_memories = true     # Attach hover docs as memories
hierarchical_containment = true   # Build nested containment tree
collapse_intra_class_edges = true # Fold intra-class calls into parent metadata

[scip.fan_out_limits]
module = 200     # Modules can be widely imported
class = 50       # Classes referenced moderately
function = 30    # Functions and methods less so
method = 30

# Override auto-detected indexer commands per language:
# [scip.indexers]
# rust = "rust-analyzer scip ."
# typescript = "scip-typescript index --infer-tsconfig"

No SCIP indexer installed? No problem -- codemem works fine without it. You just get ast-grep pattern edges (confidence 0.10) instead of compiler-grade ones.

Key Features

Graph-vector hybrid architecture -- HNSW vector search (768-dim) + petgraph knowledge graph (PageRank, Louvain community detection, betweenness centrality, BFS/DFS, SCC, topological sort, and more)
SCIP integration -- Compiler-grade cross-references via SCIP indexers (rust-analyzer, scip-typescript, scip-python, scip-go, scip-java, and more). Multi-layer edge fusion with ast-grep: when both sources agree, confidence scores sum. Auto-detects installed indexers, caches results, and cleans up stale nodes on re-index
26 MCP tools -- Memory CRUD, self-editing (refine/split/merge), graph traversal, code search, consolidation, impact analysis, session context, pattern detection over JSON-RPC
4 lifecycle hooks -- Automatic context injection (SessionStart), prompt capture (UserPromptSubmit), observation capture (PostToolUse), and session summaries (Stop)
8-component hybrid scoring -- Vector similarity, graph strength, BM25 token overlap, temporal alignment, tag matching, importance, confidence, and recency
Code-aware indexing -- tree-sitter structural extraction for 14 languages (Rust, TypeScript/JS/JSX, Python, Go, C/C++, Java, Ruby, C#, Kotlin, Swift, PHP, Scala, HCL/Terraform) with manifest parsing (Cargo.toml, package.json, go.mod, pyproject.toml)
Contextual embeddings -- Metadata and graph context enriched before embedding for higher recall precision
Pluggable embeddings -- Candle (local BERT, default), Ollama, or any OpenAI-compatible API
Cross-session intelligence -- Pattern detection, file hotspot tracking, decision chains, and session continuity
Memory consolidation -- 5 neuroscience-inspired cycles: Decay (power-law), Creative/REM (semantic KNN), Cluster (cosine + union-find), Summarize (LLM-powered), Forget
Self-editing memory -- Refine, split, and merge memories with full provenance tracking via temporal graph edges
Operational metrics -- Per-tool latency percentiles (p50/p95/p99), call counters, and gauges via codemem_status tool
Real-time file watching -- notify-based watcher with <50ms debounce and .gitignore support
Persistent config -- TOML-based configuration at ~/.codemem/config.toml
Production hardened -- Zero .unwrap() in production code, safe concurrency, versioned schema migrations

Benchmarks

Although codemem is designed for code exploration memory (not generic conversational recall), it scores competitively on standard memory benchmarks:

Benchmark	Score	Notes
LoCoMo (ACL 2024)	91.64%	vs 90.53% published SOTA — stricter conditions: recall limit 10, no evidence oracle, no embedding fallback
LongMemEval (ICLR 2025)	70%	vs 71.2% Zep, 82.4% oracle — recall limit 10, GPT-4o judge

Both benchmarks use stricter conditions than published baselines: recall limit of 10 (vs 50-100), no evidence oracle, no embedding fallback. Both were run with OpenAI text-embedding-3-small. With the built-in local BERT model (BAAI/bge-base-en-v1.5), LoCoMo scores 89.58% — a ~2% gap that graph expansion closes entirely (91.49% for both models in codemem-graph mode). Higher scores are achievable with better embedding models without any architectural changes.

See bench/locomo/ and bench/longmemeval/ for methodology, reproduction steps, and detailed breakdowns.

How It Works

graph LR
    A[AI Assistant] -->|SessionStart hook| B[codemem context]
    A -->|PostToolUse hooks| C[codemem ingest]
    A -->|Stop hook| E[codemem summarize]
    A -->|MCP tools| D[codemem serve]
    B -->|Inject context| A
    C --> F[Storage + Vector + Graph]
    D --> F
    F -->|Recall| A

Passively captures what your AI reads, searches, and edits via lifecycle hooks
Actively recalls relevant context via MCP tools with 8-component hybrid scoring
Injects context at session start so your assistant picks up where it left off

Hybrid scoring

Component	Weight
Vector similarity	25%
Graph strength (PageRank + betweenness + degree + cluster)	20%
BM25 token overlap	15%
Temporal	10%
Importance	10%
Confidence	10%
Tag matching	5%
Recency	5%

Weights are configurable via codemem config set scoring.<key> <value> and persist in ~/.codemem/config.toml.

Configuration

Embedding providers

By default, Codemem runs a local BERT model (no API key needed). To use a remote provider:

# Ollama (local server)
export CODEMEM_EMBED_PROVIDER=ollama

# OpenAI-compatible (works with Voyage AI, Together, Azure, etc.)
export CODEMEM_EMBED_PROVIDER=openai
export CODEMEM_EMBED_URL=https://api.voyageai.com/v1
export CODEMEM_EMBED_MODEL=voyage-3
export CODEMEM_EMBED_API_KEY=pa-...

Observation compression

Optionally compress raw tool observations via LLM before storage:

export CODEMEM_COMPRESS_PROVIDER=ollama   # or openai, anthropic

Persistent config

Scoring weights, vector/graph tuning, and storage settings persist in ~/.codemem/config.toml. Partial configs merge with defaults.

MCP Tools

26 tools organized by category. See MCP Tools Reference for full API documentation.

Category	Tools
Memory CRUD (7)	`store_memory`, `recall`, `delete_memory`, `associate_memories`, `refine_memory`, `split_memory`, `merge_memories`
Graph & Structure (6)	`graph_traverse`, `summary_tree`, `codemem_status`, `search_code`, `get_symbol_info`, `get_symbol_graph`
Graph Analysis (5)	`find_important_nodes`, `find_related_groups`, `get_cross_repo`, `get_node_memories`, `node_coverage`
Consolidation & Patterns (3)	`consolidate`, `detect_patterns`, `get_decision_chain`
Namespace (3)	`list_namespaces`, `namespace_stats`, `delete_namespace`
Session & Context (2)	`session_checkpoint`, `session_context`

CLI

codemem init          # Initialize project (model + hooks + MCP)
codemem search        # Search memories
codemem stats         # Database statistics
codemem serve         # Start MCP server (JSON-RPC stdio)
codemem index         # Index codebase with tree-sitter
codemem consolidate   # Run consolidation cycles
codemem analyze       # Full pipeline: index + enrich + PageRank + clusters
codemem watch         # Real-time file watcher
codemem export/import # Backup and restore (JSONL, JSON, CSV, Markdown)
codemem sessions      # Session management (list, start, end)
codemem doctor        # Health checks on installation
codemem config        # Get/set configuration values
codemem migrate       # Run pending schema migrations

See CLI Reference for full usage.

Performance

Operation	Target
HNSW search k=10 (100K vectors)	< 2ms
Embedding (single sentence)	< 50ms
Embedding (cache hit)	< 0.01ms
Graph BFS depth=2	< 1ms
Hook ingest (Read)	< 200ms

Documentation

Architecture -- System design, data flow diagrams, storage schema
Index & Enrich Pipeline -- Step-by-step data flow from source files to annotated graph
MCP Tools Reference -- All 26 tools with parameters and examples
CLI Reference -- All 19 commands
Comparison -- vs Mem0, Zep/Graphiti, Letta, claude-mem, and more

Building from Source

git clone https://github.com/cogniplex/codemem.git
cd codemem
cargo build --release          # Optimized binary at target/release/codemem
cargo test --workspace         # Run all tests
cargo bench                    # Criterion benchmarks

6-crate Cargo workspace. See CONTRIBUTING.md for development guidelines.

Research and Inspirations

Codemem builds on ideas from several research papers, blog posts, and open-source projects.

Papers

Paper	Venue	Key Contribution
HippoRAG	NeurIPS 2024	Neurobiologically-inspired long-term memory using LLMs + knowledge graphs + Personalized PageRank. Up to 20% improvement on multi-hop QA.
From RAG to Memory	ICML 2025	Non-parametric continual learning for LLMs (HippoRAG 2). 7% improvement in associative memory tasks.
A-MEM	2025	Zettelkasten-inspired agentic memory with dynamic indexing, linking, and memory evolution.
MemGPT	ICLR 2024	OS-inspired hierarchical memory tiers for LLMs -- self-editing memory via function calls.
MELODI	Google DeepMind 2024	Hierarchical short-term + long-term memory compression. 8x memory footprint reduction.
ReadAgent	Google DeepMind 2024	Human-inspired reading agent with episodic gist memories for 20x context extension.
LoCoMo	ACL 2024	Benchmark for evaluating very long-term conversational memory (300-turn, 9K-token conversations).
Mem0	2025	Production-ready AI agents with scalable long-term memory. 26% accuracy improvement over OpenAI Memory.
Zep	2025	Temporal knowledge graph architecture for agent memory with bi-temporal data model.
Memory in the Age of AI Agents	Survey 2024	Comprehensive taxonomy of agent memory: factual, experiential, working memory.
AriGraph	2024	Episodic + semantic memory in knowledge graphs for LLM agent exploration.

Blog posts and techniques

Contextual Retrieval (Anthropic, 2024) -- Prepending chunk-specific context before embedding reduces failed retrievals by 49%. Codemem adapts this as template-based contextual enrichment using metadata + graph relationships.
Contextual Embeddings Cookbook (Anthropic) -- Implementation guide for contextual embeddings with prompt caching.

Open-source projects

AutoMem -- Graph-vector hybrid memory achieving 90.53% on LoCoMo. Direct inspiration for Codemem's hybrid scoring and consolidation cycles.
claude-mem -- Persistent memory compression via Claude Agent SDK. Inspired lifecycle hooks and observation compression.
Mem0 -- Production memory layer for AI (47K+ stars). Informed memory type design.
Zep/Graphiti -- Temporal knowledge graph engine. Inspired graph persistence model.
Letta (MemGPT) -- Stateful AI agents with self-editing memory.
Cognee -- Knowledge graph memory via triplet extraction.
claude-context -- AST-aware code search via MCP (by Zilliz).

See docs/comparison.md for detailed feature comparisons.

License

Apache 2.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Codemem

Quick Start

Install

Initialize

That's it

Map your codebase (optional)

SCIP enrichment (optional, recommended)

Key Features

Benchmarks

How It Works

Hybrid scoring

Configuration

Embedding providers

Observation compression

Persistent config

MCP Tools

CLI

Performance

Documentation

Building from Source

Research and Inspirations

License

About

Uh oh!

Releases 18

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 161 Commits
.github		.github
bench		bench
crates		crates
docs		docs
ui		ui
.gitignore		.gitignore
.release-please-manifest.json		.release-please-manifest.json
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
install.sh		install.sh
package.json		package.json
release-please-config.json		release-please-config.json

Folders and files

Latest commit

History

Repository files navigation

Codemem

Quick Start

Install

Initialize

That's it

Map your codebase (optional)

SCIP enrichment (optional, recommended)

Key Features

Benchmarks

How It Works

Hybrid scoring

Configuration

Embedding providers

Observation compression

Persistent config

MCP Tools

CLI

Performance

Documentation

Building from Source

Research and Inspirations

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 18

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages