Metadata-Version: 2.4
Name: zeta-mlx-cli
Version: 0.3.2
Summary: CLI for Zeta MLX - OpenAI-compatible LLM/Embedding on Apple Silicon
Keywords: mlx,llm,cli,apple-silicon,openai
Author: ZetaLab
Author-email: zeta9044@gmail.com
Requires-Python: >=3.10,<3.13
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: MacOS
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Requires-Dist: rich (>=13.7,<14.0)
Requires-Dist: typer[all] (>=0.16,<0.17)
Requires-Dist: zeta-mlx-core (>=0.3.2,<0.4.0)
Requires-Dist: zeta-mlx-embedding (>=0.3.2,<0.4.0)
Requires-Dist: zeta-mlx-inference (>=0.3.2,<0.4.0)
Project-URL: Homepage, https://github.com/zeta9044/zeta-mlx
Project-URL: Repository, https://github.com/zeta9044/zeta-mlx
Description-Content-Type: text/markdown

# zeta-mlx-cli

CLI for Zeta MLX - OpenAI-compatible LLM/Embedding inference on Apple Silicon.

## Installation

```bash
pip install zeta-mlx-cli
```

## Commands

```bash
# LLM Server (port 9044)
zeta-mlx llm start          # Start server
zeta-mlx llm start --daemon  # Start in background
zeta-mlx llm status         # Check status
zeta-mlx llm stop           # Stop server

# Embedding Server (port 9045)
zeta-mlx embedding start
zeta-mlx embedding status
zeta-mlx embedding stop

# Interactive chat
zeta-mlx chat

# Model management
zeta-mlx models list
```

## API Usage

```bash
# Chat completion
curl -X POST http://localhost:9044/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "qwen3-8b", "messages": [{"role": "user", "content": "Hello!"}]}'

# Tokenize
curl -X POST http://localhost:9044/tokenize \
  -d '{"prompt": "Hello, world!"}'
```

## Links

- [GitHub](https://github.com/zeta9044/zeta-mlx)
- [Documentation](https://github.com/zeta9044/zeta-mlx#readme)

