Metadata-Version: 2.1
Name: reconner
Version: 0.9.0
Summary: ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsitencies and get insights on improving the quality of your data.
Home-page: https://github.com/kabirkhan/recon
Author: Kabir Khan
Author-email: kabirkhan1137@outlook.com
Requires-Python: >=3.6
Description-Content-Type: text/markdown
Classifier: Intended Audience :: Information Technology
Classifier: Intended Audience :: System Administrators
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python
Classifier: Topic :: Software Development :: Libraries :: Application Frameworks
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Software Development :: Libraries
Classifier: Topic :: Software Development
Classifier: Typing :: Typed
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Programming Language :: Python :: 3 :: Only
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: License :: OSI Approved :: MIT License
Requires-Dist: spacy >= 3.0.0, <3.1.0
Requires-Dist: scipy
Requires-Dist: snorkel == 0.9.6
Requires-Dist: typer >= 0.0.8
Requires-Dist: colorama ; extra == "all"
Requires-Dist: click-completion ; extra == "all"
Requires-Dist: jupyterlab >= 1.2.0, <1.3 ; extra == "dev"
Requires-Dist: ipywidgets ; extra == "dev"
Requires-Dist: autoflake >=1.3.1,<2.0.0 ; extra == "dev"
Requires-Dist: flake8 >=3.8.3,<4.0.0 ; extra == "dev"
Requires-Dist: mkdocs >=1.1.2,<2.0.0 ; extra == "doc"
Requires-Dist: mkdocs-material >=5.4.0,<6.0.0 ; extra == "doc"
Requires-Dist: markdown-include >=0.5.1,<0.6.0 ; extra == "doc"
Requires-Dist: mkdocstrings ; extra == "doc"
Requires-Dist: pytest == 5.3.0 ; extra == "test"
Requires-Dist: pytest-cov ; extra == "test"
Requires-Dist: coverage ; extra == "test"
Requires-Dist: pytest-xdist ; extra == "test"
Requires-Dist: pytest-sugar ; extra == "test"
Requires-Dist: mypy ; extra == "test"
Requires-Dist: black ; extra == "test"
Requires-Dist: isort ; extra == "test"
Project-URL: Documentation, https://kabirkhan.github.io/recon
Provides-Extra: all
Provides-Extra: dev
Provides-Extra: doc
Provides-Extra: test

<p align="center">
  <a href="https://kabirkhan.github.io/recon"><img src="https://kabirkhan.github.io/recon/img/recon.svg" alt="Recon"></a>
</p>
<p align="center">
    <em>Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsitencies and get insights on improving the quality of your data.</em>
</p>
<p align="center">
<a href="https://pypi.org/project/reconner" target="_blank">
    <img src="https://img.shields.io/pypi/v/reconner?style=for-the-badge" alt="PyPi Package version">
</a>
<a href="https://dev.azure.com/kabirkhan1137/ReconNER/_build?definitionId=4" target="_blank">
    <img alt="Azure Pipelines Build badge" src="https://img.shields.io/azure-devops/build/kabirkhan1137/ReconNER/kabirkhan.recon?style=for-the-badge">
</a>
<a href="https://codecov.io/gh/kabirkhan/recon" rel="nofollow">
  <img alt="Codecov badge" src="https://img.shields.io/codecov/c/gh/kabirkhan/recon?style=for-the-badge" style="max-width:100%;">
</a>

<a href="https://pypi.org/project/reconner" target="_blank">
    <img src="https://img.shields.io/pypi/l/reconner?style=for-the-badge" alt="PyPi Package license">
</a>
</p>

---

**Documentation**: <a href="https://kabirkhan.github.io/recon" target="_blank">https://kabirkhan.github.io/recon</a>

**Source Code**: <a href="https://github.com/kabirkhan/recon" target="_blank">https://github.com/kabirkhan/recon</a>

---

Recon is a library to help you fix your annotated NER data and identify examples that are hardest for your model to predict so you can strategically prioritize the examples you annotate.

The key features are:

* **Data Validation and Cleanup**: Easily Validate the format of your NER data. Filter overlapping Entity Annotations, fix missing properties.
* **Model Insights**: Analyze how well your model does on your Dataset. Identify the top errors your model is making so you can prioritize data collection and correction strategically.
* **Dataset Management**: Recon provides a `Dataset` class to manage the train/dev/test split of your data and apply the same functions across all splits in your data + a concatenation of all examples. Operate inplace to consistently transform your data.
* **Serializable Dataset**: Serialize and Deserialize your data to and from JSON to the Recon type system.
* **Type Hints**: Comprehensive Typing system based on Python 3.6+ Type Hints

## Requirements

Python 3.6+

`Recon` is built on a few comprehensive, high-performing packages.

* <a href="https://spacy.io" class="external-link" target="_blank">spaCy</a>
* <a href="https://pydantic-docs.helpmanual.io/" class="external-link" target="_blank">Pydantic (Type system and JSON Serialization)</a>
* <a href="https://typer.tiangolo.com" class="external-link" target="_blank">Typer (CLI)</a>.


## Installation

<div class="termy">

```console
$ pip install reconner
---> 100%
Successfully installed reconner
```

</div>

## License

This project is licensed under the terms of the MIT license.

