Metadata-Version: 2.1
Name: cleanliness
Version: 0.1.1
Summary: Basic cleaning of text
Home-page: https://github.com/sorenlind/cleanliness/
Author: Soren Lind Kristiansen
Author-email: sorenlind@mac.com
License: UNKNOWN
Keywords: nlp text cleaning
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: Danish
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3.6
Classifier: Topic :: Text Processing :: Linguistic
Requires-Dist: regex
Provides-Extra: dev
Requires-Dist: pylint; extra == 'dev'
Requires-Dist: pycodestyle; extra == 'dev'
Requires-Dist: pydocstyle; extra == 'dev'
Requires-Dist: yapf; extra == 'dev'
Requires-Dist: pytest; extra == 'dev'
Requires-Dist: tox; extra == 'dev'
Provides-Extra: notebooks
Requires-Dist: jupyter; extra == 'notebooks'
Provides-Extra: test
Requires-Dist: pytest; extra == 'test'
Requires-Dist: tox; extra == 'test'

# 🛀 Cleanliness

Cleanliness is a package for doing (very) basic cleaning of text. It can be used for preprocessing input to various
natural language processing tasks.

## Install

Install using from PyPI as follows:

```python
pip install cleanliness
```

Alternatively, you can download the source from GitHub, then install using pip (from the repo directory):

```python
pip install -e .
```

## Usage

```python
from cleanliness import normalize_whitespace
clean_text = normalize_whitespace("This\ntext\t\contains  odd\n\t whitespace.")
```


