Metadata-Version: 2.4
Name: openai-embeddings-model
Version: 0.5.1
Summary: OpenAIEmbeddingsModel
License: MIT
License-File: LICENSE
Author: Allen Chou
Author-email: f1470891079@gmail.com
Requires-Python: >=3.11,<4
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Programming Language :: Python :: 3.14
Requires-Dist: diskcache
Requires-Dist: numpy
Requires-Dist: openai
Requires-Dist: str_or_none
Requires-Dist: tiktoken
Description-Content-Type: text/markdown

# OpenAI Embeddings Model

A high-performance Python library for generating embeddings using OpenAI's API and other OpenAI-compatible providers, with intelligent caching, batch processing, and similarity search.

## Installation

```bash
pip install openai-embeddings-model
```

## Quick Start

```python
import openai
from openai_embeddings_model import OpenAIEmbeddingsModel, ModelSettings

client = openai.OpenAI(api_key="your-api-key")
model = OpenAIEmbeddingsModel(model="text-embedding-3-small", openai_client=client)

response = model.get_embeddings(
    input=["Hello, world!", "How are you?"],
    model_settings=ModelSettings(dimensions=512)
)

embeddings = response.to_numpy()  # NumPy array
print(f"Shape: {embeddings.shape}, Tokens: {response.usage.total_tokens}")
```

### Async

```python
import asyncio, openai
from openai_embeddings_model import AsyncOpenAIEmbeddingsModel, ModelSettings

async def main():
    client = openai.AsyncOpenAI(api_key="your-api-key")
    model = AsyncOpenAIEmbeddingsModel(model="text-embedding-3-small", openai_client=client)
    response = await model.get_embeddings(
        input=["Hello, world!"],
        model_settings=ModelSettings(dimensions=512)
    )
    print(response.to_numpy().shape)

asyncio.run(main())
```

## Similarity Search

Find the most relevant documents for a query — results are sorted by relevance score:

```python
query = "What is the capital of France?"
documents = [
    "The capital of Germany is Berlin.",
    "The capital of France is Paris.",
    "The capital of Italy is Rome.",
]

response = model.get_similarity(query, documents, model_settings=ModelSettings(dimensions=512))

for result in response.results:
    print(f"[{result.index}] score={result.relevance_score:.4f}  {documents[result.index]}")
```

Also available as `await model.get_similarity(...)` on `AsyncOpenAIEmbeddingsModel`.

## Supported Providers

| Provider         | Example model                                      |
|------------------|----------------------------------------------------|
| **OpenAI**       | `text-embedding-3-small`, `text-embedding-3-large` |
| **Azure OpenAI** | `text-embedding-3-small` via AzureOpenAI client    |
| **Gemini**       | `text-embedding-004`                               |
| **Voyage AI**    | `voyage-3`, `voyage-3-lite`                        |
| **Self-hosted**  | `nomic-embed-text` via Ollama / LocalAI            |

### Voyage AI

```python
import openai
from openai_embeddings_model import OpenAIEmbeddingsModel, ModelSettings

client = openai.OpenAI(
    base_url="https://api.voyageai.com/v1",
    api_key="your-voyage-api-key"
)
model = OpenAIEmbeddingsModel(model="voyage-3-lite", openai_client=client)
response = model.get_embeddings(input=["Hello"], model_settings=ModelSettings(dimensions=512))
```

## Caching

```python
import diskcache
from openai_embeddings_model import get_default_cache

# Default cache
cache = get_default_cache()

# Custom location
cache = diskcache.Cache('/path/to/cache')

model = OpenAIEmbeddingsModel(
    model="text-embedding-3-small",
    openai_client=client,
    cache=cache
)
```

Cache hits are tracked in `response.usage.cache_hits` and never re-billed.

## API Reference

### Classes

| Class                        | Description                                                |
|------------------------------|------------------------------------------------------------|
| `OpenAIEmbeddingsModel`      | Synchronous model                                          |
| `AsyncOpenAIEmbeddingsModel` | Async model (dedicated `ThreadPoolExecutor` for cache I/O) |

### Methods

| Method                                                            | Returns                    |
|-------------------------------------------------------------------|----------------------------|
| `get_embeddings(input, model_settings)`                           | `ModelResponse`            |
| `get_embeddings_generator(input, model_settings, chunk_size=100)` | `Generator[ModelResponse]` |
| `get_similarity(query, documents, model_settings)`                | `SimilarityResponse`       |

### ModelSettings

| Parameter    | Type            | Default | Description               |
|--------------|-----------------|---------|---------------------------|
| `dimensions` | `int \| None`   | `None`  | Custom output dimensions  |
| `timeout`    | `float \| None` | `None`  | Request timeout (seconds) |

### Responses

**`ModelResponse`**

- `to_numpy()` → `NDArray[np.float32]`
- `to_python()` → `List[List[float]]`
- `usage.input_tokens`, `usage.total_tokens`, `usage.cache_hits`

**`SimilarityResponse`**

- `results: list[SimilarityResult]` — sorted by `relevance_score` descending
- `usage` — same as `ModelResponse`

**`SimilarityResult`**

- `index: int` — original document index
- `relevance_score: float`

## Requirements

- Python 3.11+
- OpenAI API key (or compatible provider)

## License

MIT — Allen Chou &lt;<f1470891079@gmail.com>&gt;

