luki-memory-service

Open-source unified memory layer for AI agents: ELR ingestion, vector search, session/KV memory

1. Overview

luki-memory-service provides the persistence and retrieval layer for AI agents and applications. It unifies:

Long‑term semantic memory (vector DB of ELR snippets, activity results, documents)
Structured/KV memory (facts, preferences, flags, consents)
Ephemeral session memory (short-term chat summaries)
Ingestion pipelines (chunking, embedding, enrichment, redaction)

It exposes a clean API (gRPC/HTTP) so any service (agent, reporting, engagement) can store, search, and update user memory safely.

Privacy & Proprietary Content Notice

This repository contains the core open-source architecture for a memory service system. However, certain components contain proprietary business logic and have been sanitized for public release:

ELR Domain Knowledge: Specific Electronic Life Record schemas and processing logic contain proprietary healthcare and care domain expertise
ChromaDB Collections: Vector embeddings of proprietary knowledge bases and domain-specific content have been removed
Business Logic: Certain ingestion pipelines and knowledge processing contain proprietary algorithms for healthcare/care applications

The core memory service architecture, API design, and general-purpose components remain fully functional and open-source.

2. Core Capabilities

ELR Ingestion Pipeline – Chunking, metadata tagging, sensitivity labels, embedding generation.
Vector Retrieval – KNN / hybrid search, filter by tags, time, consent scopes.
KV Store – Fast access to key facts (favorite music, mobility level) and agent state.
Session Summaries – Rolling conversation summaries & last-message buffers.
Redaction & Access Control – Strip PII, enforce per-field consent and role-based access.
Audit & Versioning – Immutable log of writes; soft delete & restore.

3. Tech Stack

Vector DB: ChromaDB (default) or FAISS; adapter for managed services (Pinecone, Qdrant)
KV / Document Store: PostgreSQL + pgvector or Redis/Mongo optional backends
Embeddings: sentence-transformers (local) or model-as-a-service via internal endpoint
API Layer: FastAPI (HTTP+JSON) & gRPC (optional)
Schemas: pydantic models; JSON Schema export
ETL / Workers: Celery / Dramatiq for async ingestion jobs

4. Repository Structure

luki-memory-service/
├── README.md
├── pyproject.toml
├── requirements.txt
├── requirements-railway.txt         # railway deployment dependencies
├── runtime.txt                      # python version specification
├── .env                            # environment variables (gitignored)
├── env.example                     # environment template
├── .railwayignore                  # railway deployment exclusions
├── .dockerignore                   # docker build exclusions
├── Dockerfile                      # container build configuration
├── railway.toml                    # railway deployment configuration
├── railway.json                    # railway service configuration
├── nixpacks.toml                   # nixpacks build configuration
├── Procfile                        # process definitions
├── MANIFEST.in                     # package manifest
├── download_models.py              # model download utility
├── data/                           # data storage directory
├── chroma_db/                      # ChromaDB vector store
├── luki_memory/
│   ├── __init__.py
│   ├── config.py                   # env, DB urls, embedding model choice
│   ├── schemas/
│   │   ├── __init__.py
│   │   ├── elr.py                  # ELR item, consent, sensitivity enums
│   │   └── query.py                # search requests/responses
│   ├── ingestion/
│   │   ├── __init__.py
│   │   ├── chunker.py              # text/media chunking
│   │   ├── embedder.py             # embedding calls
│   │   ├── embedding_integration.py # embedding pipeline integration
│   │   ├── elr_ingestion.py        # ELR processing pipeline
│   │   ├── pipeline.py             # orchestration
│   │   └── redact.py               # PII/sensitive-field removal
│   ├── storage/
│   │   ├── __init__.py
│   │   ├── vector_store.py         # Chroma/FAISS adapters
│   │   ├── kv_store.py             # Postgres/Redis adapters
│   │   ├── session_store.py        # short-term memory
│   │   ├── elr_store.py            # ELR-specific storage
│   │   └── [additional storage adapters]
│   ├── api/
│   │   ├── __init__.py
│   │   ├── app.py                  # FastAPI application setup
│   │   ├── config.py               # API configuration
│   │   ├── http.py                 # FastAPI routes
│   │   ├── main.py                 # Main API entry point
│   │   ├── models.py               # API data models
│   │   ├── middleware.py           # API middleware
│   │   ├── dependencies.py         # dependency injection
│   │   ├── exceptions.py           # exception handlers
│   │   └── endpoints/
│   │       ├── __init__.py
│   │       ├── ingestion.py        # Data ingestion endpoints
│   │       ├── search.py           # Search endpoints
│   │       ├── users.py            # User management endpoints
│   │       ├── health.py           # Health check endpoints
│   │       ├── kv.py               # Key-value endpoints
│   │       ├── elr.py              # ELR-specific endpoints
│   │       └── [additional endpoint modules]
│   ├── auth/
│   │   ├── __init__.py
│   │   └── rbac.py                 # role-based access checks
│   ├── audit/
│   │   ├── __init__.py
│   │   └── logger.py               # audit logging
│   ├── integrations/
│   │   ├── __init__.py
│   │   └── [integration modules]
│   └── utils/
│       └── ids.py                  # id generation, hashing, etc.
├── scripts/
│   ├── README.md                   # Scripts documentation
│   ├── run_dev.sh                  # Development server
│   └── run_api_server.py           # API server runner
└── tests/
    ├── __init__.py
    ├── api/                        # API integration tests
    ├── unit/                       # Unit tests
    └── validation/                 # Validation test documentation

5. Quick Start (Internal Dev)

git clone [email protected]:REMELife/luki-memory-service.git
cd luki-memory-service
python -m venv venv && source venv/bin/activate
pip install -r requirements.txt
# Download required models
python download_models.py
# start local services (example: docker-compose)
docker compose up -d   # launches postgres + chroma containers (if using)
# Run the API server
uvicorn luki_memory.api.main:app --reload --port 8002

Example: Ingest ELR text

import requests

payload = {
  "user_id": "user_123",
  "text": "Alice loves gardening and jazz. Wedding in 1975 in Madrid.",
  "tags": ["interests", "life_event"],
  "sensitivity": "normal"
}
r = requests.post("http://localhost:8002/v1/elr/ingest_text", json=payload, headers={"Authorization":"Bearer devtoken"})
print(r.json())

Example: Vector search

q = {
  "user_id": "user_123",
  "query": "music she enjoys",
  "k": 3,
  "filters": {"tags": ["interests"]}
}
r = requests.post("http://localhost:8002/v1/elr/search", json=q, headers={"Authorization":"Bearer devtoken"})
for hit in r.json()["results"]:
    print(hit["score"], hit["text"])

Example: KV set/get

# set favorite_artist
requests.post("http://localhost:8002/v1/kv/set", json={
  "user_id": "user_123",
  "key": "favorite_artist",
  "value": "Miles Davis"
}, headers={"Authorization":"Bearer devtoken"})

# get favorite_artist
g = requests.get("http://localhost:8002/v1/kv/get", params={
  "user_id": "user_123",
  "key": "favorite_artist"
}, headers={"Authorization":"Bearer devtoken"})
print(g.json()["value"])

6. Access Control & Consent

All endpoints require a service token (checked in auth/rbac.py).
Each ELR item carries a consent_scope & sensitivity; queries must specify role/need.
Redaction runs before embedding to avoid leaking PII into vectors.
Audit every write/delete; keep version hash for integrity checks.

7. Backups & Migration

Nightly snapshot of Postgres & vector store indexes to encrypted S3 bucket.
migrations/ contains Alembic scripts—never change old migrations, add new ones.
Provide export_user_memory(user_id) for GDPR export/delete flows.

8. Monitoring & Metrics

Prometheus endpoint /metrics for query counts, latency, failures.
Log trace IDs to correlate with agent requests.
Alerts on ingestion failures, high latency, index corruption.

9. Roadmap

Hybrid retrieval (BM25 + vector fusion)
Multimedia embeddings (audio/image snippets)
Federated ingestion mode (Flower / PySyft integration)
Differential privacy noise for cohort analytics
TTL policies for session memory & stale KV entries

10. Contributing

Branch naming: feat/memory-..., fix/ingest-...
Add unit tests for new storage adapters or consent logic
Never commit real user data; use synthetic test data only
PR requires review and passing CI checks
Follow privacy-by-design principles for all new features

11. License

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Memory with meaning. Privacy with power.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

luki-memory-service

1. Overview

Privacy & Proprietary Content Notice

2. Core Capabilities

3. Tech Stack

4. Repository Structure

5. Quick Start (Internal Dev)

Example: Ingest ELR text

Example: Vector search

Example: KV set/get

6. Access Control & Consent

7. Backups & Migration

8. Monitoring & Metrics

9. Roadmap

10. Contributing

11. License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
luki_memory		luki_memory
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.railwayignore		.railwayignore
Dockerfile		Dockerfile
MANIFEST.in		MANIFEST.in
Procfile		Procfile
README.md		README.md
download_models.py		download_models.py
nixpacks.toml		nixpacks.toml
pyproject.toml		pyproject.toml
railway.json		railway.json
railway.toml		railway.toml
requirements-railway.txt		requirements-railway.txt
requirements.txt		requirements.txt
runtime.txt		runtime.txt

ReMeLife/luki-memory-service

Folders and files

Latest commit

History

Repository files navigation

luki-memory-service

1. Overview

Privacy & Proprietary Content Notice

2. Core Capabilities

3. Tech Stack

4. Repository Structure

5. Quick Start (Internal Dev)

Example: Ingest ELR text

Example: Vector search

Example: KV set/get

6. Access Control & Consent

7. Backups & Migration

8. Monitoring & Metrics

9. Roadmap

10. Contributing

11. License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages