Metadata-Version: 2.1
Name: tesstrain
Version: 0.1.1
Summary: Training utils for Tesseract
Home-page: https://github.com/stefan6419846/tesstrain_package
Author: Tesseract contributors
License: Apache Software License 2.0
Keywords: Tesseract,tesseract-ocr,OCR,optical character recognition
Platform: UNKNOWN
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: Topic :: Scientific/Engineering :: Image Recognition
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3 :: Only
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Requires-Python: >=3.6
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: tqdm

# tesstrain.py

Utilities for working with Tesseract >= 4 using artificial training data.

## About

This repository contains a standalone fork of the official/upstream code at https://github.com/tesseract-ocr/tesstrain/tree/main/src to allow easier packaging for PyPI.

## Installation

This package requires the Tesseract training tools to be available on your system. Additionally, a supported Python version (at least 3.6) is required for running.

You can install this package from PyPI:

```bash
python -m pip install tesstrain
```

Alternatively, you may use `pip install .` to install the package from a source checkout.

## Running

* Use the terminal interface to directly interact with the tools: `python -m tesstrain --help`.
* Call it from your own code using the high-level interface `tesstrain.run()`.

## License

This package is subject to the terms of the Apache-2.0 license.


