Memory Transformer: Neural Memory Storage for Textual Data

Memory Transformer is a compact, practical library for storing and retrieving textual memories directly inside small neural modules — no external vector DB required. It brings transformer-inspired encoders together with trainable memory slots so applications can write, query, and manage memories using pure PyTorch objects.

Why it matters

Store memory signals in neural weights for tight, offline retrieval.
Lightweight: designed to run locally without cloud embedding services.
Flexible: supports a char-level transformer encoder (MemoryTransformer) and a conv-based neural memory (HierarchicalMemoryModel) for different trade-offs of speed and accuracy.

Key features

Neurogenesis: create new neurons to grow memory capacity automatically when banks fill up.
Write memories by optimizing trainable memory slots to match an encoded representation of the text.
Query by cosine similarity with optional token-overlap boosts and synaptic-strength weighting.
Save and load full model state (weights + human-readable metadata) via PyTorch serialization.
Simple REST API wrapper included (mem_t/mem0_server.py) for quick service deployment.
Compatible with mem0 to switch from mem0-compatible projects.

Quick start

Create and activate a virtual environment (recommended):

python -m venv .venv
source .venv/bin/activate

Install dependencies (PyTorch must be installed according to your platform/GPU):

pip install -r requirements.txt

Run the bundled server (example):

export MEM0_BASE_URL=http://0.0.0.0:8123
python -m mem_t.mem0_server
# then POST /v1/memories/ and /v1/memories/search/ to add/search memories

Examples (Python)

Memory Transformer (transformer-based encoder + slot bank):

from mem_t.memo_tra_model import MemoryTransformer

mt = MemoryTransformer(max_slots=1024)
key = mt.add_memory("Alice enjoys coffee and hiking", user_id="alice")
results = mt.query("coffee", user_id="alice", top_k=5)
for score, item in results:
        print(score, item.text)

Neural Hierarchical Memory (conv encoder + memory bank):

from mem_t.neuro_mem import HierarchicalMemoryModel

hmm = HierarchicalMemoryModel(max_slots=2048)
key = hmm.add_memory("Remember to call Bob tomorrow", user_id="me")
print(hmm.query("call Bob", user_id="me", top_k=3))

Design principles

Local-first: run fully offline without external embedding providers.
Interpretable: textual memory items are stored alongside the learned vectors for inspection and debugging.
Minimal ops: writing is implemented as a short gradient loop to sculpt a slot vector, keeping the system simple and robust.

When to use Memory Transformer

Personal assistants that keep short/long-term user memories locally.
Research prototypes exploring learned memory storage and neuro-inspired mechanisms like replay and capacity growth.
Embedded systems and privacy-sensitive applications where cloud embeddings or vector DBs are undesirable.

If you'd like help with suggestions, open an issue.

Enjoy building with Memory Transformer — small neural memories, big impact.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
mem_t		mem_t
paper		paper
tests		tests
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Memory Transformer: Neural Memory Storage for Textual Data

Why it matters

Key features

Quick start

Examples (Python)

Design principles

When to use Memory Transformer

About

Uh oh!

Languages

License

Pro-GenAI/Memory-Transformer

Folders and files

Latest commit

History

Repository files navigation

Memory Transformer: Neural Memory Storage for Textual Data

Why it matters

Key features

Quick start

Examples (Python)

Design principles

When to use Memory Transformer

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages