Platform Thesis
MatrixArk turns LLM context into a production serving layer.
Production LLM apps need live context infrastructure, not another prompt template.
MatrixArk lets vertical AI companies send simple domain events and raw user questions;
MatrixArk handles context planning, schema validation, storage routing, freshness,
replay, and token-budgeted context-pack assembly.
TemporalStore is the default serving engine: time-aware memory, temporal KV, latest KV,
low-latency serving, replay, freshness, cache, and persistence. MatrixDB and MatrixKV
are complements: add MatrixDB for Redis-compatible hot state at scale, and add MatrixKV
only for low-volume transactional truth. The Rust version of TemporalStore is planned
to be open sourced in July 2026.
Time + Speed
Default serving engine for time-aware memory, temporal KV, latest KV, low-latency fetch, prompt replay, freshness, and long sequences. Planned Rust open source in July 2026.
- Cover most LLM context management use cases directly.
- Use multi-layer cache plus persistent storage.
- Serve fresh context and latest values in one path.
Open TemporalStore
Serverless DB
Complementary Redis-compatible, multi-tenant KV database for hot sessions, profile KV, LMCache metadata, scans, exports, and database-style operations.
- Support Redis migration and familiar APIs.
- Scale to tens of millions of QPS with tenant isolation.
- Serve large profile, summary, cache, scan, and export workloads.
Open MatrixDB
Truth + transactional
Complementary low-volume transactional KV for strong consistency, permissions, approvals, committed actions, and trusted control state.
- Usually not required for context management.
- Use for ownership, leases, approvals, and actions.
- Keep strong consistency separate from serving paths.
Open MatrixKV
Enterprise
Enterprise on-premise deployment by default for controlled data and context.
We expect enterprise teams to run MatrixArk inside their own network boundary first,
with Cursor-like platforms calling a local MatrixArk context endpoint for serving and policy decisions.
The goal is simple: keep user prompts, domain context, and context signals in the customer
infrastructure while still getting production-grade temporal context and replay.
Deployment model
- Private cloud and air-gapped clusters for regulated customers.
- Hybrid mode for selective cloud inference with on-prem context serving.
- Single CLI/deployment package with isolated tenant namespaces and optional service mesh.
Security and governance
- Tenant-level RBAC/ABAC, SAML/SSO, and policy bundles.
- Customer-managed keys for encryption at rest and TLS-enforced transit.
- Immutable audit log for query, ingest, decision, and replay operations.
Data control
- Context packs, memory rows, and replay records stay on customer infrastructure.
- Optional external retrieval stacks (VectorDB/S3/object store) can be wired in locally.
- Data retention and deletion policies are applied at tenant and workflow level.
Enterprise setup flow:
install MatrixArk in your VPC/VNET, connect your identity and observability stack,
register your domain schema once, then register Cursor-compatible MCP/context hooks for
before-tool-call and after-response enrichment.
Blogs
Product notes on why LLM context needs real infrastructure.
Time-aware context is still underused in LLM engineering. Read how TemporalStore makes
memory temporal, fresh, replayable, and cache-aware; how MatrixDB and MatrixKV complement
it only when database KV or strong consistency is needed; and why MatrixArk is different
from retrieval, framework memory, caches, and generic databases.
Flagship Use Case
How TemporalStore can power Milvus-backed hierarchy, bounded online queries, ingestion loops, and replayable context packs for vertical AI harnesses.
Time-Aware Context
How time validity, stale-memory blocking, prompt replay, and stable context reuse can save tokens and improve agent quality.
TemporalStore
The core product story: time-aware memory lets agents know what changed, what expired, what is still open, and what to replay.
Target Customers
How vertical AI teams can ship reliable domain agents with durable memory, prompt freshness, replay, and context governance.
Open Source TemporalStore
Use TemporalStore when the core need is context serving: temporal KV, latest KV, replay, freshness, and persistent memory.
Full Stack
When context becomes a platform: TemporalStore for serving, external VectorDB/S3 retrieval, MatrixDB for database KV, and MatrixKV for low-volume strong consistency.