Deployment Guide

Choose your deployment mode and get started in minutes.

Deployment Modes

Choose the deployment that fits your privacy and operational needs. All modes support ONE LLM Mode — a toggle that lets your MCP client's own LLM handle memory extraction, eliminating extra LLM API calls.

Setup Guide

Step-by-step instructions for each deployment mode. Pick one and get started in minutes.

Fully managed on Neuromem Cloud. Memories are stored in hosted PostgreSQL + pgvector with automatic scaling, backups, and zero maintenance.

1

Sign up and get an API key

Go to neuromem.cloud, sign up, create an Agent, and generate an API Key.

2

Add to your MCP client

claude mcp add --scope user --transport http neuromem \
  https://api.neuromem.cloud/mcp/ \
  --header "Authorization: Bearer nm_sk_..."
*

ONE LLM Mode: Per-agent toggle in Dashboard > Agent settings. When on, your MCP client's LLM handles extraction — no server LLM needed. When off, the server's LLM handles extraction.