Building a Memory Provider Plugin
Memory provider plugins give Hermes Agent persistent, cross-session knowledge beyond the built-in MEMORY.md and USER.md. This guide covers how to build one.
Directory Structure
Each memory provider lives in plugins/memory/<name>/:
plugins/memory/my-provider/
├── __init__.py # MemoryProvider implementation + register() entry point
├── plugin.yaml # Metadata (name, description, hooks)
└── README.md # Setup instructions, config reference, tools
The MemoryProvider ABC
Your plugin implements the MemoryProvider abstract base class from agent/memory_provider.py:
from agent.memory_provider import MemoryProvider
class MyMemoryProvider(MemoryProvider):
@property
def name(self) -> str:
return "my-provider"
def is_available(self) -> bool:
"""Check if this provider can activate. NO network calls."""
return bool(os.environ.get("MY_API_KEY"))
def initialize(self, session_id: str, **kwargs) -> None:
"""Called once at agent startup.
kwargs always includes:
hermes_home (str): Active HERMES_HOME path. Use for storage.
"""
self._api_key = os.environ.get("MY_API_KEY", "")
self._session_id = session_id
# ... implement remaining methods
Required Methods
Core Lifecycle
| Method | When Called | Must Implement? |
|---|---|---|
name (property) | Always | Yes |
is_available() | Agent init, before activation | Yes — no network calls |
initialize(session_id, **kwargs) | Agent startup | Yes |
get_tool_schemas() | After init, for tool injection | Yes |
handle_tool_call(name, args) | When agent uses your tools | Yes (if you have tools) |
Config
| Method | Purpose | Must Implement? |
|---|---|---|
get_config_schema() | Declare config fields for hermes memory setup | Yes |
save_config(values, hermes_home) | Write non-secret config to native location | Yes (unless env-var-only) |
Optional Hooks
| Method | When Called | Use Case |
|---|---|---|
system_prompt_block() | System prompt assembly | Static provider info |
prefetch(query) | Before each API call | Return recalled context |
queue_prefetch(query) | After each turn | Pre-warm for next turn |
sync_turn(user, assistant) | After each completed turn | Persist conversation |
on_session_end(messages) | Conversation ends | Final extraction/flush |
on_pre_compress(messages) | Before context compression | Save insights before discard |
on_memory_write(action, target, content) | Built-in memory writes | Mirror to your backend |
shutdown() | Process exit | Clean up connections |
Config Schema
get_config_schema() returns a list of field descriptors used by hermes memory setup:
def get_config_schema(self):
return [
{
"key": "api_key",
"description": "My Provider API key",
"secret": True, # → written to .env
"required": True,
"env_var": "MY_API_KEY", # explicit env var name
"url": "https://my-provider.com/keys", # where to get it
},
{
"key": "region",
"description": "Server region",
"default": "us-east",
"choices": ["us-east", "eu-west", "ap-south"],
},
{
"key": "project",
"description": "Project identifier",
"default": "hermes",
},
]
Fields with secret: True and env_var go to .env. Non-secret fields are passed to save_config().
Save Config
def save_config(self, values: dict, hermes_home: str) -> None:
"""Write non-secret config to your native location."""
import json
from pathlib import Path
config_path = Path(hermes_home) / "my-provider.json"
config_path.write_text(json.dumps(values, indent=2))
For env-var-only providers, leave the default no-op.
Plugin Entry Point
def register(ctx) -> None:
"""Called by the memory plugin discovery system."""
ctx.register_memory_provider(MyMemoryProvider())
plugin.yaml
name: my-provider
version: 1.0.0
description: "Short description of what this provider does."
hooks:
- on_session_end # list hooks you implement
Threading Contract
sync_turn() MUST be non-blocking. If your backend has latency (API calls, LLM processing), run the work in a daemon thread:
def sync_turn(self, user_content, assistant_content):
def _sync():
try:
self._api.ingest(user_content, assistant_content)
except Exception as e:
logger.warning("Sync failed: %s", e)
if self._sync_thread and self._sync_thread.is_alive():
self._sync_thread.join(timeout=5.0)
self._sync_thread = threading.Thread(target=_sync, daemon=True)
self._sync_thread.start()
Profile Isolation
All storage paths must use the hermes_home kwarg from initialize(), not hardcoded ~/.hermes:
# CORRECT — profile-scoped
from hermes_constants import get_hermes_home
data_dir = get_hermes_home() / "my-provider"
# WRONG — shared across all profiles
data_dir = Path("~/.hermes/my-provider").expanduser()
Testing
See tests/agent/test_memory_plugin_e2e.py for the complete E2E testing pattern using a real SQLite provider.
from agent.memory_manager import MemoryManager
mgr = MemoryManager()
mgr.add_provider(my_provider)
mgr.initialize_all(session_id="test-1", platform="cli")
# Test tool routing
result = mgr.handle_tool_call("my_tool", {"action": "add", "content": "test"})
# Test lifecycle
mgr.sync_all("user msg", "assistant msg")
mgr.on_session_end([])
mgr.shutdown_all()
Single Provider Rule
Only one external memory provider can be active at a time. If a user tries to register a second, the MemoryManager rejects it with a warning. This prevents tool schema bloat and conflicting backends.