Production Memory Platform
AtlasRAG turns retrieval into durable, governed memory for AI agents.
AtlasRAG combines a high-speed C++ vector store with a secure Node gateway so teams can ingest knowledge, retrieve context, and continuously improve memory quality in production without building custom control loops.
This gives product teams a stable memory foundation for customer assistants, internal copilots, and workflow agents, with strong tenant boundaries, operational controls, and measurable runtime behavior.
Full stack placement
What AtlasRAG solves for client programs
Answer drift over time
Durable memory and recall controls keep agent outputs grounded in approved, reusable context.
Governance and isolation pressure
Tenant boundaries, visibility modes, ACLs, and role checks make data access explicit and auditable.
Cost growth from unmanaged memory
AMV-L lifecycle policies and TTL sweeps help bound memory growth and reduce unnecessary token use.
Opaque production behavior
Gateway telemetry, metrics, and job status endpoints create observability for operators and stakeholders.
Unified ingest, search, and answer APIs
AtlasRAG provides one operational surface for ingestion, retrieval, and answer generation, so implementation teams can ship memory-backed experiences without stitching multiple services together.
- /v1/docs and /v1/docs/url for indexing.
- /v1/search and /v1/ask for retrieval and answer workflows.
- Requests are scoped by tenant and collection to prevent cross-environment leakage.
- Idempotency keys on write operations reduce duplicate ingestion risk in distributed clients.
Durable memory model, not just transient context
Beyond chunks, AtlasRAG stores typed memory objects so agents can preserve and reuse knowledge with explicit meaning, provenance, and lifecycle controls.
- Six memory types: artifact, semantic, procedural, episodic, conversation, and summary.
- /v1/memory/write, /v1/memory/recall, and /v1/memory/reflect.
- Memory scoring signals from /v1/feedback and /v1/memory/event.
- Async reflection jobs generate derived knowledge while preserving links to source artifacts.
Enterprise governance and access control
Security and governance are built into the request path and data model, so client deployments can enforce policy without custom wrappers.
- JWT and service-token auth with role-based endpoint controls.
- Visibility modes: tenant, private, and acl.
- SSO support for Google, Azure, and Okta with tenant-level auth mode controls.
- Principal-aware recall constraints protect private and ACL-scoped memory access.
AMV-L keeps memory quality high over time
AtlasRAG applies Adaptive Memory Value + Lifecycle (AMV-L) to evaluate memory utility continuously and automate retention decisions based on measured behavior.
- Lifecycle actions promote, compact, retain, or delete by policy thresholds.
- TTL sweep remains the hard retention boundary for expired memory.
- Async job processing keeps reflection and lifecycle operations non-blocking.
- Value decay and redundancy sweeps reduce stale or duplicate memories in long-running deployments.
Operationally ready for developer teams
Integration and operations are designed for real delivery teams, from pilots through production scale-up.
- OpenAPI schema, Swagger UI, and official Node SDK for integration.
- Idempotency support for write/index/reflect endpoints and idempotent job reruns.
- Metrics, telemetry, and structured request logs for production visibility.
- Admin endpoints support service tokens, usage reporting, and tenant policy management.
Runtime architecture
Gateway API Layer
Handles auth, tenancy, ACL policy, idempotency, ingestion, retrieval, ask, and memory endpoints.
Persistent Data Plane
C++ TCP vector index stores embeddings while Postgres stores chunk text, memory items, links, and jobs.
Lifecycle & Telemetry Plane
Scheduled jobs run TTL, AMV-L lifecycle tasks, and telemetry snapshots for ongoing optimization.