Metadata-Version: 2.4
Name: openai-agents-context-compaction
Version: 0.2.0
Summary: Context compaction for the OpenAI Agents SDK Runner loop
Project-URL: Homepage, https://github.com/damianoneill/openai-agents-context-compaction
Project-URL: Repository, https://github.com/damianoneill/openai-agents-context-compaction
Author: Damian ONeill
License: MIT
License-File: LICENSE
Keywords: agents,compaction,compression,context,memory,openai
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Requires-Python: >=3.11
Requires-Dist: openai-agents>=0.1.0
Provides-Extra: dev
Requires-Dist: commitizen>=3.0.0; extra == 'dev'
Requires-Dist: mypy>=1.19.0; extra == 'dev'
Requires-Dist: pre-commit>=3.0.0; extra == 'dev'
Requires-Dist: pytest-asyncio>=0.21.0; extra == 'dev'
Requires-Dist: pytest-cov>=4.0.0; extra == 'dev'
Requires-Dist: pytest>=7.0.0; extra == 'dev'
Requires-Dist: ruff>=0.14.0; extra == 'dev'
Description-Content-Type: text/markdown

# openai-agents-context-compaction

[![PyPI version](https://img.shields.io/pypi/v/openai-agents-context-compaction.svg)](https://pypi.org/project/openai-agents-context-compaction/)
[![CI](https://github.com/damianoneill/openai-agents-context-compaction/actions/workflows/ci.yml/badge.svg)](https://github.com/damianoneill/openai-agents-context-compaction/actions/workflows/ci.yml)
[![Compatibility](https://github.com/damianoneill/openai-agents-context-compaction/actions/workflows/compatibility.yml/badge.svg)](https://github.com/damianoneill/openai-agents-context-compaction/actions/workflows/compatibility.yml)
[![Status](https://img.shields.io/badge/status-alpha-yellow.svg)](https://github.com/damianoneill/openai-agents-context-compaction)

Context compaction support for the [OpenAI Agents SDK](https://github.com/openai/openai-agents-python), enabling intelligent management of conversation history token size for multi-turn agent interactions.

> **Note:** This package was created in response to [openai/openai-agents-python#2244](https://github.com/openai/openai-agents-python/issues/2244) to provide runner-level context compaction capabilities.

---

## Problem

As agent conversations grow longer, token counts sent to the LLM increase, leading to:

- **Higher costs** – LLM pricing is token-based
- **Degraded reasoning** – Models can lose focus with very long contexts
- **Context window limits** – Models have maximum token limits
- **Latency increases** – Longer prompts take more time to process

---

## Solution

This package extends the existing OpenAI Agents SDK `Session` protocol with **local compaction strategies**, making it **provider-agnostic** and independent of the OpenAI Responses API compaction endpoint.

> ⚠️ **Early-stage alpha:** The current release implements a minimal sliding window approach. Future releases will add token-aware, LLM-based, and pluggable strategies (see [Roadmap](#roadmap)).

---

## Installation

```bash
pip install openai-agents-context-compaction
```

---

## Usage

Wrap any existing session with `LocalCompactionSession` to add automatic compaction support:

```python
from agents import Agent, Runner, SQLiteSession
from openai_agents_context_compaction import LocalCompactionSession

# Create your agent
agent = Agent(name="Assistant", instructions="You are a helpful assistant.")

# Wrap an existing session with compaction support
underlying = SQLiteSession("conversation_123")
session = LocalCompactionSession(underlying, window_size=30)

# Use normally - compaction happens automatically when needed
result = await Runner.run(agent, "Hello!", session=session)
```

- `window_size` controls how many items to keep in the sliding window (`None`, the default, means no compaction)
- Compaction is **boundary-aware** – preserves function call pairs atomically
- **`limit` parameter**: Also boundary-aware — not a simple tail slice; function call pairs are kept atomic even when using `limit`

### Technical Note

The OpenAI Agents SDK stores session data in [Responses API format](https://platform.openai.com/docs/api-reference/responses). Tool calls appear as separate `function_call` and `function_call_output` items matched by `call_id`. This package handles this transparently.

### Performance Considerations

For very large sessions (thousands of items), compaction runs on every `get_items()` call. Results are cached and reused within the same agent turn (invalidated on writes), so repeated calls with the same parameters are O(1). The compaction algorithm itself is O(n) where n is the total session size. If performance becomes a concern:

- Consider periodic session pruning at the storage layer
- Use a reasonable `window_size` that balances context retention with processing cost

---

## Roadmap

| Feature                       | Status         |
| ----------------------------- | -------------- |
| Sliding window compaction     | ✅ Implemented |
| Token-based limits            | 🟡 Planned     |
| LLM-based summarization       | 🟡 Planned     |
| Write-time compaction         | 🟡 Planned     |
| Pluggable compaction policies | 🟡 Planned     |

---

## Write-time Compaction Caveats

| Aspect             | Read-time (current)          | Write-time            |
| ------------------ | ---------------------------- | --------------------- |
| Full history       | ✅ Preserved for audit/debug | ❌ Lost forever       |
| Change window_size | ✅ Retroactive               | ❌ Requires re-import |
| Storage            | ❌ Unbounded growth          | ✅ Bounded            |
| Read cost          | ❌ Compaction on every read  | ✅ Cheap reads        |

**Write-time compaction caveat:**
Function call pairs arrive in stages:

1. `add_items([function_call])` — incomplete (no output yet)
2. `add_items([function_call_output])` — now complete

If compaction runs at write-time, incomplete pairs may be dropped, breaking session integrity.

**Recommendation:** Use read-time compaction (current default) to guarantee atomic function call pairs unless storage size is critical. Write-time compaction requires redesign of `add_items` to handle incomplete pairs safely.

---

## Future Considerations

The following ideas are documented for future reference. Build them when there's a concrete need:

| Feature                        | Add when...                                                  |
| ------------------------------ | ------------------------------------------------------------ |
| **Time-based window**          | Sessions span days and old context becomes stale             |
| **Importance scoring**         | Tool outputs vary wildly in value                            |
| **Pluggable policy interface** | Multiple policies need swapping                              |
| **Role-based prioritization**  | Certain messages must never be evicted                       |
| **Hybrid window**              | Last N items + always keep M most recent function call pairs |

A pluggable policy interface would look like:

```python
class CompactionPolicy(Protocol):
    def compact(self, items: list[TResponseInputItem]) -> list[TResponseInputItem]:
        """Return compacted items. Must preserve function call pair atomicity."""
        ...
```

---

## Compatibility

Tested weekly against the latest OpenAI Agents SDK to ensure compatibility.

---

## License

MIT
