Metadata-Version: 2.4
Name: nexmind
Version: 0.1.0
Summary: NexMind AI — Clean ML training data with AI-powered agents
Author-email: NexMind AI <support@getnexmind.com>
License: MIT
Project-URL: Homepage, https://getnexmind.com
Project-URL: Documentation, https://api.getnexmind.com/docs
Project-URL: Repository, https://github.com/Anil175/dataclean-swarm
Keywords: data-cleaning,ml,ai,training-data,data-quality,pii-detection
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Requires-Python: >=3.8
Description-Content-Type: text/markdown
Requires-Dist: httpx>=0.24
Requires-Dist: pandas>=1.5

# NexMind AI — DataClean Swarm SDK

Clean your ML training data with AI-powered agents.

## Install

```bash
pip install nexmind
```

## Quick Start

```python
from nexmind import NexMindClient
import pandas as pd

# Initialize
client = NexMindClient(api_key="dcs_your_key")

# Clean a DataFrame
df = pd.read_csv("dirty_data.csv")
clean_df, report = client.clean(df)

print(f"Quality: {report['quality_score_before']:.0%} -> {report['quality_score_after']:.0%}")
print(f"Rows removed: {report['rows_removed']}")
print(f"Rows repaired: {report['rows_repaired']}")
```

## Choose Agents

```python
# Run specific agents only
clean_df, report = client.clean(
    df,
    agents=["schema", "dedup", "repair"],  # skip anomaly + labeling
)
```

## Async Jobs

```python
# Submit and don't wait
job = client.submit_job(
    name="Nightly data clean",
    source={"type": "s3", "bucket": "raw", "key": "data.csv"},
    output={"type": "s3", "bucket": "clean", "key": "clean.parquet"},
)

# Check later
job.refresh()
print(job.status)  # "running" | "completed" | "failed"

# Or wait
job.wait()
print(job.report)
```

## 5 AI Agents

| Agent | What it does |
|-------|-------------|
| **schema** | Fix column types, names, formats |
| **anomaly** | Detect outliers (Z-score, IQR, Isolation Forest) |
| **dedup** | Remove exact + fuzzy duplicates |
| **repair** | Fill missing values, fix typos, standardize |
| **labeling** | Detect PII, classify columns |

## Get Your API Key

1. Sign up at [app.getnexmind.com](https://app.getnexmind.com)
2. Go to API Keys page
3. Create a key

## Links

- [Dashboard](https://app.getnexmind.com)
- [API Docs](https://api.getnexmind.com/docs)
- [Website](https://getnexmind.com)
