Core Concepts

This guide explains the architecture and design principles behind CortexaDB.

Overview

CortexaDB is an embedded database — it runs inside your application's process, not as a separate server. It stores all data in a single directory on disk and provides an in-memory query engine for fast retrieval.

The database is built around three pillars:

Vector Search - Find semantically similar memories using embedding similarity
Graph Relations - Connect memories with directed edges and traverse them
Temporal Awareness - Filter and boost results based on when memories were created

Architecture

┌──────────────────────────────────────────────────┐
│Python API (PyO3 Bindings)                        │
│CortexaDB, Collection, Embedder, chunk(), etc.    │
└────────────────────────┬─────────────────────────┘
                         │
┌────────────────────────▼─────────────────────────┐
│CortexaDB Facade                                  │
│High-level API (add, query, etc.)                 │
└────────────────────────┬─────────────────────────┘
                         │
┌────────────────────────▼─────────────────────────┐
│CortexaDBStore                                    │
│Concurrency coordinator & durability layer        │
│ ┌────────────────┐  ┌────────────────────────┐   │
│ │WriteState      │  │ReadSnapshot            │   │
│ │(Mutex)         │  │(ArcSwap, lock-free)    │   │
│ └────────────────┘  └────────────────────────┘   │
└───────┬──────────────────┬────────────────┬──────┘
        │                  │                │
┌───────▼──────┐   ┌───────▼───────┐   ┌────▼───────────┐
│Engine        │   │Segments       │   │Index Layer     │
│(WAL)         │   │(Storage)      │   │                │
│              │   │               │   │VectorIndex     │
│Command       │   │MemoryEntry    │   │HnswBackend     │
│recording     │   │persistence    │   │GraphIndex      │
│              │   │               │   │TemporalIndex   │
│Crash         │   │CRC32          │   │                │
│recovery      │   │checksums      │   │HybridQuery     │
└──────────────┘   └───────────────┘   └────────────────┘
                          │
                ┌─────────▼──────────┐
                │State Machine       │
                │(In-memory state)   │
                │- Memory entries    │
                │- Graph edges       │
                │- Temporal index    │
                └────────────────────┘

Key Components

Facade

The CortexaDB facade is the primary entry point. It provides the high-level API (add, query, connect, etc.) and delegates to the store for durability and concurrency.

Store

The CortexaDBStore coordinates concurrent access:

Single Writer: A Mutex<WriteState> ensures writes are serialized and deterministic
Concurrent Readers: An ArcSwap<ReadSnapshot> provides lock-free read access — readers never block writers and vice versa
Background Sync: A dedicated thread handles disk fsync based on the configured sync policy

State Machine

The in-memory state machine holds the current database state:

All memory entries indexed by ID
Graph edges (directed, per-collection)
Temporal index (BTreeMap of timestamp to memory IDs)
Next ID counter

Every mutation goes through the state machine, ensuring consistency between disk and queries.

Engine (WAL)

The Write-Ahead Log is the source of truth for durability. Every command (insert, delete, connect) is first appended to the WAL before updating the state machine. On startup, the WAL is replayed to reconstruct the state.

Segments

Large memory payloads are stored in append-only segment files. Each segment is capped at 10MB before rotating to a new file. Segments use CRC32 checksums for integrity verification.

Index Layer

The index layer provides fast retrieval through multiple backends:

VectorIndex - Cosine similarity search (exact or HNSW)
GraphIndex - BFS/DFS traversal of memory connections
TemporalIndex - Time-range filtering

These are combined by the hybrid query engine for multi-signal retrieval.

Data Model

Memory Entry

A memory is the fundamental unit of storage:

Field	Type	Description
`id`	`u64`	Auto-incrementing unique identifier
`collection`	`String`	Isolation scope (default: `"default"`)
`content`	`bytes`	Raw content (typically UTF-8 text)
`embedding`	`Vec<f32>?`	Optional vector embedding
`metadata`	`Dict[str, str]`	Key-value metadata pairs
`created_at`	`u64`	Unix timestamp (seconds)
`importance`	`f32`	User-defined importance score

Graph Edges

Edges are directed relationships between memories:

Memory A --[relates_to]--> Memory B

Edges are collection-scoped — you cannot create edges across collections
Each memory can have multiple outgoing edges
Used by the query engine for graph expansion during hybrid search

Hit (Query Result)

Hit(id=42, score=0.87)

A query result containing the memory ID and a relevance score (0.0 to 1.0).

Write Path

Command is constructed (e.g., InsertMemory)
Command is appended to the WAL (with CRC32 checksum)
Memory payload is written to the current segment file
State machine is updated in-memory
Vector index is updated (if embedding is present)
Read snapshot is atomically swapped for readers
Disk fsync happens based on sync policy

Read Path

Query embedding is compared against the vector index
Top candidates are fetched (exact scan or HNSW)
Optional graph expansion via BFS
Optional temporal filtering
Scores are combined with configurable weights
Top-k results are returned as Hit objects

Crash Recovery

On startup, CortexaDB recovers through:

Load checkpoint (if exists) — fast binary snapshot of the state machine
Replay WAL — apply any commands written after the checkpoint
Rebuild segment index — scan segment files to build the offset index
Repair mismatches — sync missing vectors from the state machine to the HNSW index

This ensures zero data loss for any committed write, even after a crash.

Next Steps

Storage Engine - Deep dive into WAL, segments, and checkpoints
Query Engine - How hybrid search works
Configuration - Tune CortexaDB for your use case

Core Concepts

On this page