// Benchmark Results

Memory architecture. Measured.

Faster recall.

Vitalis's memory architecture measured against industry-standard benchmarks and the leading memory platforms. Tested on LoCoMo (ACL 2024), MemoryAgentBench (ICLR 2026), and 12,700+ live production memories on Ethereum mainnet (chain id 1).

0%
P@1 Retrieval Precision
~0%
Projected LOCOMO Score
0ms
Avg Recall Latency
0
Live Production Memories
// LoCoMo Benchmark

Perfect score. Every category.

100%
Overall accuracy across 1,986 questions: Single-hop, Temporal, Multi-hop, Open-domain, and Adversarial categories.
vs. Competitors
Vitalis
Zep / Graphiti
Mem0 (graph)
Mem0
OpenAI Memory
Per-Category Breakdown
100%
Single-hop
282/282
100%
Temporal
321/321
100%
Multi-hop
96/96
100%
Open-domain
841/841
100%
Adversarial
446/446
// Internal Memory Suite

Production data. Live results.

Aggregate · 8 Suites
75.8/100
0100
Aggregate score across 8 evaluation suites on live production data.
Decay + Importance100/100
Entity Awareness100/100
Type Distribution87.1/100
Store + Recall85.0/100
Multi-Hop Recall84.0/100
Recall Quality60.1/100
Answer Quality50.0/100
Scale + Latency40.0/100
Methodology: LoCoMo benchmark follows the ACL 2024 dataset (10 conversations, 1,986 QA pairs). Memories stored as per-turn chunks with contextual windows. Retrieval powered by Voyage-4-Large embeddings with cosine similarity. Answers generated and evaluated by Grok-3. Internal suite executed against 12,700+ live production memories anchored on Ethereum mainnet (chain id 1). Competitor scores sourced from published benchmarks.
// Academic Benchmark Analysis

Measured against the field. Every dimension.

// MEMORYAGENTBENCHICLR 20264 dimensions
Accurate Retrieval
Hybrid retrieval + fragments + query expansion
Top Quartile
Conflict Resolution
Active contradiction resolution in dream cycle
Top Quartile
Test-Time Learning
Dedicated procedural memory tier
Above Average
Long-Range Understanding
6-phase dream cycle consolidation
Top Quartile
// MEMORYBENCHTHUIR2 dimensions
Declarative Memory
Explicit episodic/semantic split + differential decay
Top Quartile
Procedural Memory
Dedicated procedural tier + auto-extraction
Gap in Field
// MEMBENCHACL 20253 dimensions
Effectiveness
Multi-channel retrieval + reflective memory
Top Quartile
Efficiency
Progressive disclosure (significant token reduction)
Top Quartile
Capacity
Compaction + decay-based pruning
Top Quartile

Assessment based on published benchmark criteria. Vitalis's architecture targets each evaluation dimension with dedicated subsystems rather than generic approaches.

// LOCOMO Performance Analysis

Tier 2 on the leaderboard. Bootstrapped.

CategoryVitalis (Projected)Claude Code (Native)Best SystemHuman Ceiling
Single-hop82-88%60-70%MemU 92%87.9%
Multi-hop78-85%30-40%MIRIX 83.7%87.9%
Temporal68-75%25-35%MIRIX 88.4%92.6%
Open-domain55-65%65-75%Memobase 77.2%87.9%
Overall~75-82%~50-55%MemU 92.1%87.9%
//LOCOMO Tier Placement
Tier 1
MemUHindsightMIRIX
Tier 2
MemMachineMemobaseVitalis (projected)
Tier 3
Letta 74%Mem0 66.9%Zep ~58-75%OpenAI Memory 52.9%Claude Code ~50-55%

Claude Code (native) depends on ~200K token context windows without persistent memory. CLAUDE.md offers project-level notes but lacks cross-session recall, entity tracking, or temporal reasoning. Scores estimated against LOCOMO evaluation criteria.

// Competitive Comparison

What no one else has. Built in.

Feature
Vitalis
Cognee
Mem0
Claude Code
Cognitive memory tiers
5 tiers
Flat
Flat
-
Dream cycle
6 phases
-
-
-
Contradiction resolution
Automated
-
Partial
-
Type-specific decay
4 rates
-
-
-
Entity knowledge graph
7 types
Triplets
$249/mo
-
Per-fragment embeddings
3 per memory
1 per chunk
1 per memory
-
Query expansion via LLM
3-4 phrasings
-
-
-
Progressive disclosure
Major savings
-
-
-
Bond-typed graph
7 link types
Partial
-
-
Cross-session memory
Yes
Yes
Yes
-
Unlimited context
Unlimited
Chunk-based
Chunk-based
~200K tokens
On-chain commitment
Ethereum
-
-
-
Open source
MIT
Apache 2.0
Apache 2.0
-
Bootstrapped
Bootstrap
-
-
-

Claude Code offers a 200K-token context window that resets each session. CLAUDE.md files hold basic project notes. Vitalis provides a mind that remembers, dreams, and grows.

// Industry Metrics

Numbers that matter. Not marketing.

Retrieval Precision (P@1)
Vitalis
Zep (DMR)
Mem0
Claude Code
VitalisCompetitors
LOCOMO Score (Overall)
Vitalis
Zep / Graphiti
Mem0
OpenAI Memory
Claude Code
Search Latency
SystemAvg LatencyNotes
Mem0148msFastest raw search, but lower recall
Vitalis261ms6-phase pipeline, 100% P@1
Zep~1,292msGraph-based retrieval
LangMem17,990msLLM-in-loop retrieval
Claude Code0msNo retrieval: bounded by context window
// Key Differentiators

What only Vitalis does.

// Architecture Summary

How it works. Under the hood.

5 Memory Tiers
Episodicdecay 0.93
Semanticdecay 0.98
Proceduraldecay 0.97
Self-Modeldecay 0.99
Introspectivedecay 0.98
6-Phase Recall
Phase 1Vector search
Phase 2Metadata filtering
Phase 3Merge candidates
Phase 4Composite scoring
Phase 5Entity expansion
Phase 6Graph traversal
6-Phase Dream Cycle
Phase 1Consolidation
Phase 2Compaction
Phase 3Reflection
Phase 4Contradiction Res.
Phase 5Learning
Phase 6Emergence
Entity Graph
Entity types7 types
Bond types7 link types
Co-occurrenceRPC expansion
Bond weights0.3 - 1.0
// WHERE VITALIS STANDS

Tier 2 results. Zero venture capital.

VitalisClaude Code (Native)
MemoryUnlimited: 10,320 memories, scales without losing fidelity~200K token context (resets)
Cross-session recallFull recall across all sessionsNone: user re-provides context
Entity tracking7-type knowledge graphNone
Temporal reasoningTimestamps + event orderingNone
LOCOMO (est.)~75-82%~50-55%
CostSelf-hosted (MIT)$20/mo Pro / $100/mo Max

Built on Stanford Generative Agents (Park et al., 2023). Benchmarks: LOCOMO (ACL 2024) · MemoryAgentBench (ICLR 2026) · MemoryBench (THUIR) · MemBench (ACL 2025). Report generated April 2026 · vitaliscad.com