Simple pricing, by usage.
Free to get started with. Pay only for what you use, with your own spend limits.
Free
For builders tinkering, prototypes and sideprojects.
- $5/mo of usage built in
- Hermes Plugin
- Supermemory MCP
- Community support
Pro
For small teams and plugin power-users.
- ~$20/mo of usage built in
- Unlimited storage
- Unlimited users
- Auto top-up available – Buy additional usage
- Google Drive, Notion & OneDrive connectors
- 2 teammates included
- OpenClaw, Claude Code and other plugins
- Email support
Max
More headroom for developers who need it.
- ~$130/mo of usage built in (6× Pro)
- Unlimited storage
- Unlimited users
- Gmail connector
- Granola connector
- Auto top-up available
- OpenClaw, Claude Code and other plugins
- Priority support
Scale
For teams running production workloads.
- ~$600/mo of usage built in
- Unlimited storage
- Unlimited users
- Up to 10 teammates
- All connectors (Gmail, GitHub, S3, Web Crawler + Pro)
- Auto top-up + spend caps
- Priority support
- SOC 2 · HIPAA BAA
- Self-hosted option
Used by the best teams
Context infrastructure for your agents
Memory
Memory graph per user. Auto profiles and fact hierarchies so agents learn in real-time.
- Plain text $0.005
- Rich content $0.010
2× cheaper than next-best, with better quality. Powered by our own model.
SuperRAG
Multimodal Extraction -> Contextual Chunking -> Retrieval for agents. No embeddings or vectors required.
- Text mode $0.001
- Rich mode $0.002
'Rich' = images, PDFs, audio, video. SOTA extraction, always tracked.
Search and Traversal
(Honestly?) insanely cheap semantic search and graph traversal against your content.
- Hybrid search — RAG+Memory in one call
- Graph traversal across linked memories
- Configurable filters and re-ranking
Sub-300ms p50. Built for agent loops
Operations
Additional operations for API calls
- Re-ranking
- Aggregation
- Query rewriting
- Other operations
Composable building blocks for richer queries.
Billed in SM tokens — unique content we ingest, deduplicated at the byte level. Repeats cost nothing: a 100% prompt-cache discount, baked into every plan.
Startup & Research Program
Qualifying early-stage startups and academic research projects get production-grade memory infrastructure, free. Ship the agent, we'll cover the context.
$1,000 in free credits
Dedicated support · 6 months to build · all features unlocked
Frequently asked questions
How does the credit balance work?
Every plan comes with a monthly dollar balance. Each API call — storing memories, searching, indexing — draws from that balance at the rates listed above. When the balance runs out, you can top up or auto top-up to keep going. No surprise bills at the end of the month.
What's an SM token?
SM tokens are the unique tokens Supermemory actually ingests and embeds — repeats and unchanged content don't get billed again. Effectively a 100% discount on what a normal prompt cache would re-charge you for. Plain text is $0.005 per 1K SM tokens; rich content (PDFs, audio, video) is $0.010 per 1K because it needs heavier extraction.
What happens if I re-store the same content?
Nothing — you're not billed again. Supermemory deduplicates at the token level, so re-uploading a doc, syncing a connector, or pushing the same conversation history won't redraw from your balance. Only net-new content counts as SM tokens. This is why production agents that loop over the same context end up an order of magnitude cheaper here than with a typical vector DB.
Do unused credits roll over?
Subscription credits reset monthly. Top-up credits you purchase in advance never expire — they sit in your balance until you use them.
What happens if I exceed my balance?
On Free, you'll be paused — pay-as-you-go isn't available on Free, so you'll need to upgrade to Pro or Scale to keep going. On Pro and Scale, auto top-up kicks in to keep your app running. You can set hard spend caps on Scale to prevent runaway usage.
Can I self-host?
Self-hosted deployments are available on Scale and Enterprise. Enterprise additionally supports fully air-gapped deployments (LLM inference may be the only outbound dependency - Unless GPUs are available!).
Do you offer startup or academic research credits?
Yes — qualifying early-stage startups and academic research teams get $1,000 in credits, dedicated support, and 6 months to build. Apply via the Startup Program link above.