Metadata-Version: 2.3
Name: memuri
Version: 0.1.5
Summary: Memuri is a self‑hosted memory infrastructure for AI applications, offering a unified, pluggable SDK to persist and retrieve conversational context with sub‑second latency. It enables any AI agent or assistant to remember key information across sessions—so your apps never forget what matters, while retaining full flexibility to swap storage backends, embedding models, and retrieval strategies.
License: MIT
Author: Ankhbayasgalan anhbaysgalan1@gmail.com
Requires-Python: >=3.10,<4.0
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Dist: aioredis (>=2.0.1,<3.0.0)
Requires-Dist: asyncpg (==0.30.0)
Requires-Dist: celery (==5.5.2)
Requires-Dist: faiss-cpu (==1.11.0)
Requires-Dist: google-genai (==1.16.1)
Requires-Dist: hnswlib (>=0.7.0,<0.8.0)
Requires-Dist: hydra-core (==1.3.2)
Requires-Dist: mkdocs (==1.6.1)
Requires-Dist: mkdocstrings (==0.29.1)
Requires-Dist: openai (==1.79.0)
Requires-Dist: opentelemetry-sdk (==1.33.1)
Requires-Dist: orjson (>=3.9.10,<4.0.0)
Requires-Dist: pgvector (==0.4.1)
Requires-Dist: prometheus-client (==0.22.0)
Requires-Dist: pydantic (==2.11.4)
Requires-Dist: pydantic-settings (>=2.1.0,<3.0.0)
Requires-Dist: pymilvus (>=2.3.3,<3.0.0)
Requires-Dist: qdrant-client (>=1.7.0,<2.0.0)
Requires-Dist: redis (==6.1.0)
Requires-Dist: sentence-transformers (>=2.2.2,<3.0.0)
Requires-Dist: sqlalchemy (==2.0.41)
Requires-Dist: uvloop (==0.21.0)
Project-URL: Repository, https://github.com/username/memuri
Description-Content-Type: text/markdown

# Memuri - High-Performance Memory SDK

Memuri is a pip-installable SDK for high-performance, pluggable conversational memory services. Designed for local, self-hosted deployment with sub-second latency and production-grade practices.

## Key Features

- **Self-hosted**: No external dependencies or licenses required
- **High Performance**: Sub-100ms memory operations with Faiss and Redis caching
- **Pluggable Architecture**: Swap vector databases, task queues, and LLMs without code changes
- **Production-grade**: Integrated OpenTelemetry traces, health checks, and robust CI/CD
- **Memory Categories**: Organize context by category (PERSONAL, TASK, QUESTION, etc.)
- **Feedback Loop**: Adaptive classification based on user feedback
- **Reranking**: Advanced cross-encoder reranking for more relevant results
- **Flexible Configuration**: Easy configuration with environment variables, dictionary-based config, or direct settings
- **Multiple Embedding Providers**: Support for OpenAI, Google Gemini, Azure, and Sentence Transformers

## Installation

You can install Memuri directly from PyPI:

```bash
pip install memuri
```

Or using Poetry:

```bash
poetry add memuri
```

## Getting Started

### Basic Usage

```python
from memuri import Memuri

# Initialize memory with default settings
memory = Memuri()

# Add a memory item
memory.add_memory(content="John's favorite color is blue", category="PERSONAL")

# Search for relevant memories
results = memory.search_memory("What does John like?")
print(results)
```

### Using Config Dictionary

```python
import os
from memuri import Memuri

# Set API key in environment variable
os.environ["OPENAI_API_KEY"] = "your_api_key"

# Create config with specific provider settings
config = {
    "embedder": {
        "provider": "openai",
        "config": {
            "model": "text-embedding-3-small"
        }
    }
}

# Initialize with config
memory = Memuri.from_config(config)
```

### Storing Chat Conversations

```python
# Add a conversation as memory
messages = [
    {"role": "user", "content": "I'm planning to watch a movie tonight. Any recommendations?"},
    {"role": "assistant", "content": "How about sci-fi? I recommend Interstellar."},
    {"role": "user", "content": "I love sci-fi movies!"}
]

# Store the conversation with user ID
await memory.add(messages, user_id="john")
```

## Architecture

Memuri is designed around a layered memory system:

1. **Short-Term Memory**: HNSW In-Memory Index + Redis LRU Cache
2. **Long-Term Memory**: Pluggable Vector Stores (pgvector, Milvus, Qdrant, Redis Vector)
3. **Memory Triggers**: Category classifiers and rule engine for contextual decisions
4. **Feedback Loop**: Continuous adaptation based on user interactions

## Performance Testing

Memuri includes a comprehensive latency testing framework to measure performance metrics:

### Using the Latency Test Script

```bash
# Run all tests with default settings
cd src/memuri/tests
./run_latency_tests.py

# Run only add memory tests with 20 iterations
./run_latency_tests.py --test-type add --iterations 20

# Run only search tests with custom database URL
./run_latency_tests.py --test-type search --db-url postgresql://user:pass@localhost:5432/mydb

# Set log level to DEBUG for more detailed output
./run_latency_tests.py --log-level DEBUG

# Don't save results to a file
./run_latency_tests.py --no-save
```

### Test Output

The test runner will output detailed statistics for each operation:

```
=== Add Memory Benchmark Results ===
Total operations: 10
Mean latency: 831.96ms
Median latency: 990.54ms
Min latency: 322.88ms
Max latency: 1361.35ms
95th percentile: 1273.73ms
Standard deviation: 387.65ms
==================================

=== Search Memory Benchmark Results ===
Total operations: 10
Mean latency: 817.86ms
Median latency: 804.20ms
Min latency: 410.45ms
Max latency: 1388.41ms
95th percentile: 1283.28ms
Standard deviation: 359.60ms
==================================

```

### Programmatic Usage

You can also use the LatencyBenchmark class in your own code:

```python
import asyncio
from memuri import Memuri
from memuri.tests.test_latency import LatencyBenchmark

async def measure_performance():
    client = Memuri()
    benchmark = LatencyBenchmark("Custom Test")
    
    # Measure add_memory operation
    for i in range(5):
        await benchmark.measure_operation(
            client.add_memory,
            content=f"Test memory {i}",
            category="FACT"
        )
    
    # Print stats
    benchmark.print_stats()
    
    # Get stats as dictionary
    stats = benchmark.get_stats()
    print(f"95th percentile latency: {stats['p95']:.2f}ms")

asyncio.run(measure_performance())
```

## Development and Deployment

### Publishing to PyPI

The project includes a script to handle publishing to PyPI with automatic version management:

```bash
# From project root
./scripts/upload_to_pypi.sh
```

This script will:
1. Automatically increment version numbers (patch, minor, or major)
2. Run tests before building
3. Build and upload the package to PyPI
4. Create git tags and commits for the release

## Next Steps

- Check the [Quick Start](usage/quickstart.md) guide
- Learn about [Configuration](usage/configuration.md) options
- Explore [API Reference](api-reference/index.md) 
- Learn advanced patterns with [Cookbooks](cookbooks/index.md)
- View [Examples](examples/index.md) for complete solutions

## Updates and Improvements

### Latest Updates (v0.2.0)

We've enhanced Memuri with several key improvements:

1. **Enhanced Category System**
   - Hierarchical memory categorization with main categories and subcategories
   - 15 main categories and 45+ subcategories for precise memory organization
   - Backward compatibility with legacy categories

2. **Improved OpenAI Embedding Support**
   - Dynamic client configuration with proper error handling
   - Support for all OpenAI embedding models including text-embedding-3-small/large
   - Advanced configuration options (proxies, Azure integration, custom parameters)

3. **Feedback System Enhancement**
   - Track and analyze feedback for both categories and subcategories
   - Improved classifier training based on user feedback
   - Parent-child relationship awareness in memory classification

4. **Configuration Flexibility**
   - Enhanced `from_config` method for cleaner, more intuitive setup
   - Direct parameter passing to embedding and LLM services
   - Better factory patterns for service initialization

Check the documentation for details on how to use these new features.
