Metadata-Version: 2.4
Name: mlboardkit
Version: 0.1.2
Summary: Utilities for data processing, model training, and analysis.
Author-email: Sohan <soh.venkatesh@gmail.com>
License: Apache-2.0
Project-URL: Homepage, https://github.com/sohv/mlboardkit
Project-URL: Issues, https://github.com/sohv/mlboardkit/issues
Keywords: machine-learning,data,nlp,ml,utilities,toolkit
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.9
Description-Content-Type: text/markdown
Requires-Dist: pandas
Requires-Dist: numpy
Requires-Dist: pyyaml
Requires-Dist: scikit-learn
Requires-Dist: psutil
Requires-Dist: imbalanced-learn
Requires-Dist: nltk
Requires-Dist: torch
Requires-Dist: tensorflow
Requires-Dist: pyarrow
Requires-Dist: openpyxl
Requires-Dist: matplotlib
Requires-Dist: requests
Requires-Dist: seaborn
Requires-Dist: ftfy
Requires-Dist: langdetect
Requires-Dist: joblib
Requires-Dist: schedule
Requires-Dist: transformers
Requires-Dist: datasets
Requires-Dist: build>=1.3.0
Requires-Dist: twine>=6.2.0
Provides-Extra: dev
Requires-Dist: pytest; extra == "dev"
Requires-Dist: ruff; extra == "dev"
Requires-Dist: mypy; extra == "dev"

# MLBoardKit

<div align="center">
  <img src="./logo.svg" width="120" alt="MLBoardKit Logo" />
</div>

A Python library that provides utilities for streamlined data processing, model training, and analysis tasks in machine learning workflows.

**mlboardkit** offers easy CLI commands and Python interfaces for dataset quality checks, format conversion, metric computation, plot generation, and model training — with support for popular frameworks and minimal setup.

[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
[![GitHub stars](https://img.shields.io/github/stars/adithya-s-k/mlboardkit?style=social)](https://github.com/sohv/mlboardkit)
[![PyPI version](https://img.shields.io/pypi/v/mlboardkit.svg)](https://pypi.org/project/mlboardkit/)



## Install

```bash
# from source (editable)
pip install -e .

# from PyPI (published)
pip install mlboardkit
```

## Quick start

```python
# After installing mlboardkit, import via the mlboardkit namespace
from mlboardkit.data_utils.dataset_processor import main as dataset_processor_main
from mlboardkit.analysis_tools.metrics_utils import classification_report

report = classification_report([1,0,1], [1,0,0])
```

CLI via python -m:
```bash
python -m mlboardkit.data_utils.dataset_processor quality-check dataset.csv --report report.json
python -m mlboardkit.data_utils.data_converter convert input.json output.csv --format csv
python -m mlboardkit.analysis_tools.plot_metrics training_log.json --plot-type training --output curves.png
python -m mlboardkit.model_utils.train_model --model-name bert-base-uncased --train-file train.jsonl --epochs 3
```

Python requirement: 3.9+

Full usage and CLI examples are in `usage.md`. Here is a [demo notebook](https://colab.research.google.com/drive/1Z7ltGDY89NFUT3Vyzl71nWls2y0DsjLb?usp=sharing) that demonstrates the usage of this library in a ML project.

