Metadata-Version: 2.1
Name: Lutil
Version: 0.1.8
Summary: Assist small-scale machine learning.
Home-page: https://github.com/Vopaaz/learning-utility
Author: Vopaaz
Author-email: liyifan945@163.com
License: UNKNOWN
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: License :: OSI Approved :: GNU General Public License v3 (GPLv3)
Description-Content-Type: text/markdown
Requires-Dist: pandas
Requires-Dist: chardet
Requires-Dist: numpy
Requires-Dist: joblib

# learning-utility

**Assist small-scale machine learning.**

learning-utility is a package of utilities for small-scale machine
learning tasks with scikit-learn.

![image](https://www.travis-ci.org/Vopaaz/learning-utility.svg?branch=master)
![image](https://codecov.io/gh/Vopaaz/learning-utility/branch/master/graph/badge.svg)
![PyPI - Python Version](https://img.shields.io/pypi/pyversions/Lutil)
[![Downloads](https://pepy.tech/badge/lutil)](https://pepy.tech/project/lutil)
![PyPI](https://img.shields.io/pypi/v/Lutil)

## Installation

```bash
pip install Lutil
```

## Key Features

### Cache Intermediate Results

`InlineCheckpoint` can cache the computation result in the first call.
Since then, if nothing has changed, it retrieves the cache and skips
computation.

Suppose you have such a .py file.

```python
from Lutil.checkpoints import InlineCheckpoint

a, b = 1, 2
with InlineCheckpoint(watch=["a", "b"], produce=["c"]):
   print("Heavy computation.")
   c = a + b

print(c)
```

Run the script, you will get:

```text
Heavy computation.
3
```

Run this script again, the with-statement will be skipped. You will get:

```text
3
```

Once a value among `watch` changes or the code inside the with-statement
changes, re-calculation takes place to ensure the correct output.

### Save Prediction Result According to the Given Format

Lots of machine learning competitions require a .csv file in a given format.
Most of them provide an example file.

In example.csv:

```text
id, pred
1, 0.25
2, 0.45
3, 0.56
```

Run:

```python
>>> import numpy as np
>>> from Lutil.dataIO import AutoSaver

>>> result = np.array([0.2, 0.4, 0.1, 0.5])
       # Typical output of a scikit-learn predictor

>>> ac = AutoSaver(save_dir="somedir", example_path="path/to/example.csv")
>>> ac.save(result, "some_name.csv")
```

Then in your somedir/some_name.csv:

```text
id, pred
1, 0.2
2, 0.4
3, 0.1
4, 0.5
```

It also works if the `result` is a pandas DataFrame, Series, 2-dim numpy array, etc.
Also, the encoding, seperator, header, index of the example.csv will all be recognized.


