LanceDB Pro

memory-lancedb-pro 

 Enhanced LanceDB memory plugin for OpenClaw — community reference guide 

 Overview 

 memory-lancedb-pro is a community-developed, production-grade long-term memory plugin for OpenClaw. It replaces the built-in memory-lancedb plugin with a significantly more capable retrieval pipeline, designed for agents that need persistent, high-quality memory across sessions without manual tagging or configuration overhead. 

 The core problem it solves: standard OpenClaw agents have no memory between sessions. Every conversation starts from zero. memory-lancedb-pro automatically captures what matters from each session and retrieves relevant context in future ones. 

 Primary upstream repo: CortexReach/memory-lancedb-pro . Several community forks exist (win4r, McBorisson, fryeggs, kvc0769) with varying additions such as Volcengine multimodal embeddings or unified Claude Code/Claude Desktop support. 

 OpenClaw 2026.3+ compatibility: The CortexReach fork has been updated to use before_prompt_build hooks, replacing the deprecated before_agent_start hook. If you are on 2026.3.24 or later, use this fork. Run openclaw doctor --fix after upgrading. 

 

 Feature Comparison 

 

 

 

 Feature 

 Built-in memory-lancedb 

 memory-lancedb-pro 

 

 

 

 

 Vector search 

 ✓ 

 ✓ 

 

 

 BM25 full-text search 

 ✗ 

 ✓ 

 

 

 Hybrid fusion (Vector + BM25) 

 ✗ 

 ✓ configurable weights 

 

 

 Cross-encoder reranking 

 ✗ 

 ✓ Jina, SiliconFlow, Pinecone, etc. 

 

 

 Recency / time decay scoring 

 ✗ 

 ✓ 

 

 

 MMR diversity filtering 

 ✗ 

 ✓ 

 

 

 Multi-scope isolation 

 ✗ 

 ✓ global / agent / project / user 

 

 

 Smart LLM extraction 

 ✗ 

 ✓ optional, uses any OpenAI-compatible LLM 

 

 

 Management CLI 

 ✗ 

 ✓ list / search / stats / delete / export / import 

 

 

 Auto-capture on session end 

 ✓ basic 

 ✓ with deduplication, up to 3 per turn 

 

 

 Auto-recall before prompt 

 ✓ basic 

 ✓ adaptive — skips trivial/short queries 

 

 

 Noise filtering 

 ✗ 

 ✓ 

 

 

 Migration tool from built-in plugin 

 — 

 ✓ 

 

 

 

 

 Retrieval Pipeline 

 Queries pass through a multi-stage pipeline before results are injected into the agent prompt: 

 

 Embed query — using the configured OpenAI-compatible embedding provider 

 Parallel search — vector ANN search (cosine distance) + BM25 full-text search run simultaneously 

 Hybrid fusion — vector score used as base; BM25 hits receive a configurable weighted boost 

 Rerank — optional cross-encoder reranking via external API (60% cross-encoder score + 40% fused score) 

 Lifecycle decay scoring — recency boost, time decay, importance weight, length normalisation 

 Filter — hard minimum score, noise filter, MMR diversity deduplication 

 Inject — surviving memories injected as <relevant-memories> context block 

 

 If the reranker API fails, the pipeline degrades gracefully to cosine similarity reranking. 

 

 Installation 

 1. Clone into your OpenClaw workspace 

 cd ~/.openclaw/workspace

git clone https://github.com/CortexReach/memory-lancedb-pro.git plugins/memory-lancedb-pro

cd plugins/memory-lancedb-pro

npm install 

 Common mistake: Cloning the repo somewhere other than your workspace and then using a relative path in plugins.load.paths . Relative paths are resolved from the workspace root. Use an absolute path if cloning elsewhere. 

 2. Disable the built-in memory plugin 

 Only one memory plugin can be active at a time. If you previously used memory-lancedb , disable it before enabling this plugin. 

 3. Add to openclaw.json 

 {

 "plugins": {

 "load": {

 "paths": ["plugins/memory-lancedb-pro"]

 },

 "entries": {

 "memory-lancedb-pro": {

 "enabled": true,

 "config": {

 "embedding": {

 "apiKey": "${JINA_API_KEY}",

 "model": "jina-embeddings-v5-text-small",

 "baseURL": "https://api.jina.ai/v1",

 "dimensions": 1024,

 "taskQuery": "retrieval.query",

 "taskPassage": "retrieval.passage",

 "normalized": true

 }

 }

 }

 },

 "slots": {

 "memory": "memory-lancedb-pro"

 }

 }

} 

 Config changes require a gateway restart. With config watch enabled (default), this happens automatically. 

 

 Key Configuration Options 

 

 

 

 Option 

 Default 

 Notes 

 

 

 

 

 autoCapture 

 true 

 Capture memories at session end 

 

 

 autoRecall 

 true 

 Inject memories before prompt build 

 

 

 smartExtraction 

 true 

 Use LLM to classify memories instead of regex 

 

 

 extractMinMessages 

 3 

 Minimum messages before extraction runs 

 

 

 captureAssistant 

 true 

 Set false to only capture user messages 

 

 

 retrieval.mode 

 hybrid 

 vector , bm25 , or hybrid 

 

 

 retrieval.vectorWeight 

 0.7 

 Weight for vector scores in hybrid fusion 

 

 

 retrieval.bm25Weight 

 0.3 

 Weight for BM25 scores in hybrid fusion 

 

 

 rerank.enabled 

 false 

 Enable cross-encoder reranking 

 

 

 rerank.candidatePoolSize 

 12 

 Candidates passed to reranker 

 

 

 rerank.minScore 

 0.6 

 Soft minimum score post-rerank 

 

 

 rerank.hardMinScore 

 0.62 

 Hard cutoff — below this is always dropped 

 

 

 sessionMemory.enabled 

 true 

 Store session summaries on /new 

 

 

 autoRecall.minPromptLength 

 15 (EN) / 6 (CJK) 

 Skip recall for very short queries 

 

 

 

 

 Management CLI 

 The plugin ships with a CLI for direct memory management: 

 openclaw memory-pro list # list stored memories

openclaw memory-pro search <query> # semantic/keyword search

openclaw memory-pro stats # storage stats

openclaw memory-pro delete <id> # delete a specific memory

openclaw memory-pro export # export all memories

openclaw memory-pro import <file> # import memories 

 

 Agent Tool Definitions 

 When loaded, the plugin registers these tools for the agent to use directly: 

 

 memory_recall — retrieve relevant memories for a query 

 memory_store — explicitly store a memory 

 memory_forget — delete a memory by ID or query 

 memory_update — update an existing memory 

 

 Plus additional management tools exposed via the CLI commands above. 

 

 Multi-Scope Isolation 

 Memories can be scoped to control access between agents and users: 

 

 global — shared across all agents 

 agent:<id> — isolated to a specific agent 

 project:<id> — shared within a project 

 user:<id> — per-user isolation (useful for multi-user bots) 

 custom:<name> — arbitrary named scope 

 

 

 Telegram Setup 

 If running OpenClaw with Telegram, the easiest way to configure the plugin is via the bot directly. Send the following to your main bot: 

 Help me connect this memory plugin with the most user-friendly configuration:

https://github.com/CortexReach/memory-lancedb-pro

Requirements:

1. Set it as the only active memory plugin

2. Use Jina for embedding and reranker

3. Use gpt-4o-mini for the smart-extraction LLM

... (continue with your preferences) 

 

 Important Notes 

 jiti cache: After modifying any .ts file in the plugin, you must clear the jiti cache before restarting the gateway, or OpenClaw will load stale compiled code: rm -rf /tmp/jiti/ && openclaw gateway restart 

 Memory quality guidelines: Never store raw conversation summaries, large blobs, or duplicates. Prefer structured, atomic facts with keywords. On any tool failure or repeated error, call memory_recall with relevant keywords before retrying — the fix may already be stored. 

 Spaced repetition: Frequently recalled memories decay more slowly, similar to spaced-repetition learning systems. 

 

 Notable Community Forks 

 

 

 

 Fork 

 Notable additions 

 

 

 

 

 CortexReach/memory-lancedb-pro 

 Primary upstream. Updated for OpenClaw 2026.3+ hook architecture. 

 

 

 win4r/memory-lancedb-pro 

 Widely referenced in docs; standard feature set. 

 

 

 fryeggs/memory-lancedb-pro 

 Unified edition — extends to Claude Code, Codex CLI, and Claude Desktop via shared LanceDB backend. 

 

 

 kvc0769/memory-lancedb-pro 

 Adds Volcengine multimodal embedding support. 

 

 

 McBorisson/memory-lancedb-pro 

 Uses RRF fusion (vs. weighted boost in other forks); includes JSONL distillation pipeline. 

 

 

 

 

 Generated March 2026. Sources: CortexReach/memory-lancedb-pro, openclaw/openclaw docs, LanceDB blog.