Metadata-Version: 2.4
Name: grover
Version: 0.0.13
Summary: The agentic filesystem. Safe file operations, knowledge graphs, and semantic search — unified for AI agents.
Project-URL: Homepage, https://github.com/ClayGendron/grover
Project-URL: Repository, https://github.com/ClayGendron/grover
Project-URL: Issues, https://github.com/ClayGendron/grover/issues
Author: Clay Gendron
License-Expression: Apache-2.0
License-File: LICENSE
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Programming Language :: Python :: 3.14
Classifier: Typing :: Typed
Requires-Python: >=3.12
Requires-Dist: aiosqlite>=0.20
Requires-Dist: pydantic>=2.0
Requires-Dist: rustworkx>=0.17
Requires-Dist: sqlalchemy[asyncio]>=2.0
Requires-Dist: sqlmodel>=0.0.31
Requires-Dist: unidiff>=0.7
Provides-Extra: all
Requires-Dist: asyncpg>=0.29; extra == 'all'
Requires-Dist: databricks-vectorsearch>=0.40; extra == 'all'
Requires-Dist: deepagents>=0.4; extra == 'all'
Requires-Dist: langchain-core>=0.3; extra == 'all'
Requires-Dist: langgraph>=0.2; extra == 'all'
Requires-Dist: openai>=1.0; extra == 'all'
Requires-Dist: pinecone[asyncio]>=5.0; extra == 'all'
Requires-Dist: tree-sitter-go>=0.21; extra == 'all'
Requires-Dist: tree-sitter-javascript>=0.21; extra == 'all'
Requires-Dist: tree-sitter-typescript>=0.21; extra == 'all'
Requires-Dist: tree-sitter>=0.22; extra == 'all'
Requires-Dist: usearch>=2.0; extra == 'all'
Provides-Extra: databricks
Requires-Dist: databricks-vectorsearch>=0.40; extra == 'databricks'
Provides-Extra: deepagents
Requires-Dist: deepagents>=0.4; extra == 'deepagents'
Provides-Extra: langchain
Requires-Dist: langchain-core>=0.3; extra == 'langchain'
Provides-Extra: langgraph
Requires-Dist: langchain-core>=0.3; extra == 'langgraph'
Requires-Dist: langgraph>=0.2; extra == 'langgraph'
Provides-Extra: mssql
Requires-Dist: aioodbc>=0.5; extra == 'mssql'
Provides-Extra: openai
Requires-Dist: openai>=1.0; extra == 'openai'
Provides-Extra: pinecone
Requires-Dist: pinecone[asyncio]>=5.0; extra == 'pinecone'
Provides-Extra: postgres
Requires-Dist: asyncpg>=0.29; extra == 'postgres'
Provides-Extra: search
Requires-Dist: usearch>=2.0; extra == 'search'
Provides-Extra: treesitter
Requires-Dist: tree-sitter-go>=0.21; extra == 'treesitter'
Requires-Dist: tree-sitter-javascript>=0.21; extra == 'treesitter'
Requires-Dist: tree-sitter-typescript>=0.21; extra == 'treesitter'
Requires-Dist: tree-sitter>=0.22; extra == 'treesitter'
Description-Content-Type: text/markdown

# `grover`: The Agentic File System

<p align="center">
  <a href="https://pypi.org/project/grover/"><img src="https://img.shields.io/pypi/v/grover" alt="PyPI version"></a>
  <a href="https://pypi.org/project/grover/"><img src="https://img.shields.io/pypi/pyversions/grover" alt="Python"></a>
  <a href="https://github.com/ClayGendron/grover/actions/workflows/test.yml"><img src="https://github.com/ClayGendron/grover/actions/workflows/test.yml/badge.svg" alt="Tests"></a>
  <a href="https://github.com/ClayGendron/grover/blob/main/LICENSE"><img src="https://img.shields.io/github/license/ClayGendron/grover" alt="License"></a>
  <a href="https://codecov.io/gh/ClayGendron/grover"><img src="https://codecov.io/gh/ClayGendron/grover/branch/main/graph/badge.svg" alt="Coverage"></a>
</p>

```bash
pip install grover
```

`grover` is an in-process file system that mounts data from multiple sources to enable agentic search and operations through a Unix-like interface.

```python
from grover import Grover, LocalFileSystem, DatabaseFileSystem

g = Grover()

localfs = LocalFileSystem()
dbfs = DatabaseFileSystem(engine_url="sqlite+aiosqlite:///knowledge.db")

g.add_mount('/workspace', localfs)
g.add_mount('/enterprise', dbfs)

g.cli('write /workspace/auth.py "def login(user, password): return authenticate(user, password)"')
g.cli('read /enterprise/security-policy.md')
g.cli('search "how does user login work?" --k 10')
g.cli('grep "authenticate" | pagerank | top 15')

g.close()
```

> Every CLI command maps directly to a Python method. 
>
> - `g.cli('write ...')` calls `g.write()`
> - `g.cli('search ...')` calls `g.semantic_search()`
> - pipelines like `grep | pagerank | top 15` chain results through `g.grep()` → `g.pagerank(candidates)` → `result.top(15)`.

Unix has been a foundational technology in computing for over 50 years because of its enduring core design principles: a uniform namespace, small composable tools, and portability. `grover` builds on these principles to design the platform for building agent context and performing agentic actions.

- **Agent-First Design:** `grover` is built around having the main *user* be a large language model running in a loop over a long time horizon. Building for LLMs means that operations within the file system are versioned and reversible, tools are discoverable *files* loaded into context when needed instead of by default, and every operation can be expressed through a composable CLI — the interface LLMs are increasingly trained to use.
- **Everything is a File:** Everything within `grover` is addressable by path and conforms to standard data types. This single abstraction enables composable operations and predictable data within `grover`.
- **Small, Composable, and On-Demand Tools:** Building a new tool for every use case should be the exception, not the norm. All the capabilities of `grover` can be accessed and expressed through a CLI which frees up context to build more performant and predictable agents. Specialized tools and MCPs can be assigned their own file paths in `grover` for ultimate flexibility without the cost of filling up context.
- **BYOI (Bring Your Own Infrastructure):** `grover` has a database-first design and can run in-process with your application or as an MCP server. No new design patterns or infrastructure required — `grover` runs where you need it and works with your existing AI applications.

> `grover` is in alpha, so we are actively building towards this vision. Please test it out and provide your feedback!

## The `GroverFileSystem`

The main class of this library is `GroverFileSystem`. It handles mounting and routing across storage backends and defines the public API surface for `grover`. The API combines familiar file system operations with search, graph traversal, and ranking. All public methods return the same composable result type, so one method's output can be used as input to the next.

| Category | Methods |
|----------|---------|
| **CRUD** | `read`, `write`, `edit`, `delete`, `move`, `copy`, `mkdir`, `mkconn` |
| **Navigation** | `ls`, `tree`, `stat` |
| **Pattern Search** | `glob`, `grep` |
| **Retrieval** | `semantic_search`, `lexical_search`, `vector_search` |
| **Graph Traversal** | `predecessors`, `successors`, `ancestors`, `descendants`, `neighborhood`, `meeting_subgraph`, `min_meeting_subgraph` |
| **Graph Ranking** | `pagerank`, `betweenness_centrality`, `closeness_centrality`, `degree_centrality`, `in_degree_centrality`, `out_degree_centrality`, `hits` |
| **Query Engine** | `run_query`, `cli` |
| **Lifecycle** | `add_mount`, `remove_mount` |

### Core Components

1. **File System.** A versioned, chunkable, permission-aware, database-backed file system for text and documents. All operations are reversible and protected against data loss.
2. **Retrieval.** Pluggable vector search and BM25 lexical search enable semantic and keyword retrieval across the file system. Embedding and indexing happen automatically on write.
3. **Graph.** Connections between files are first-class objects. Graph algorithms like PageRank, centrality, and subgraph extraction operate on the same paths as every other operation.

## How It Works

Everything in `grover` is addressable by path. Files, chunks, versions, and connections all live in a single namespace:

```
/workspace/
├── auth.py                                           File
│   ├── .chunks/
│   │   ├── login                                     Chunk (function)
│   │   └── AuthService                               Chunk (class)
│   ├── .versions/
│   │   ├── 1                                         Version (snapshot)
│   │   └── 2                                         Version (diff)
│   └── .connections/
│       └── imports/
│           └── workspace/utils.py                    Connection (dependency)
├── utils.py                                          File
└── main.py                                           File
/enterprise/
├── onboarding.md                                     File
└── security-policy.md                                File
```

Metadata directories (`.chunks/`, `.versions/`, `.connections/`) follow the Unix dotfile convention — hidden by default, always accessible by explicit path. `ls` shows files. `ls -a` reveals metadata. Search returns files by default. Metadata is opt-in.

### Composable Results

Every operation returns a `GroverResult` with typed `Candidate` objects. Results support set algebra, so different retrieval strategies can be combined without LLM re-interpretation:

```python
# Intersection — Python files that match a semantic query
semantic = g.semantic_search("authentication")
python_files = g.glob("/workspace/**/*.py")
candidates = semantic & python_files

# Union — expand to graph neighbors
expanded = candidates | g.neighborhood(candidates)

# Re-rank by centrality
ranked = g.pagerank(candidates=expanded)
```

Or the same thing through the CLI:

```python
print(g.cli('search "authentication" | glob "/workspace/**/*.py" | nbr | pagerank'))
```

> `grover` also provides `GroverAsync` as the async facade, which is the preferred path for application servers and long-running agents. The sync `Grover` wrapper shown in these examples is a convenience layer for scripts, notebooks, and data pipelines.

## Installation

Requires Python 3.12+.

```bash
pip install grover                # core (SQLite, rustworkx, BM25)
pip install grover[openai]        # OpenAI embeddings
pip install grover[langchain]     # LangChain embedding provider
pip install grover[postgres]      # PostgreSQL backend
pip install grover[mssql]         # MSSQL backend
pip install grover[pinecone]      # Pinecone vector store
pip install grover[databricks]    # Databricks Vector Search
pip install grover[search]        # usearch (local vector search)
pip install grover[treesitter]    # JS/TS/Go code analyzers
pip install grover[deepagents]    # deepagents integration
pip install grover[langgraph]     # LangGraph persistent store
pip install grover[all]           # everything
```

## Status and Roadmap

`grover` is in alpha. The core file system, CLI query engine, graph algorithms, and BM25 lexical search are implemented and tested (1,779 tests, 99% coverage).

**What's coming next:**

- **MCP single-tool interface** — expose `grover` as one MCP tool with progressive discovery via `--help`
- **Shell entrypoint** — run `grover 'grep "auth" | pagerank | top 15'` directly from the terminal
- **`.api/` control plane** — live API pass-through for external services (Jira, Slack, GitHub) alongside synced data in the same namespace
- **LocalFileSystem** — mount local directories with files on disk and metadata in SQLite
- **More analyzers** — Markdown, PDF, email, Slack, Jira, CSV/JSON (code analyzers for Python, JS/TS, Go exist in v1)
- **Automatic embedding on write** — background indexing for semantic search without manual setup

## Contributing

Contributions are welcome. See [CONTRIBUTING.md](CONTRIBUTING.md).

## License

Apache 2.0. See [LICENSE](LICENSE).
