Metadata-Version: 2.2
Name: nlpclean
Version: 1.3.2
Summary: Utilities for cleaning up text corpus
Home-page: https://github.com/shkarupa-alex/nlpclean
Author: Shkarupa Alex
Author-email: shkarupa.alex@gmail.com
License: MIT
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Education
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Software Development
Classifier: Topic :: Software Development :: Libraries
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.7.0
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: beautifulsoup4>=4.13.1
Requires-Dist: fasttext>=0.9.3
Requires-Dist: tqdm>=4.67.1
Requires-Dist: rbloom>=1.5.2
Requires-Dist: ftfy>=6.1.1
Requires-Dist: lxml_html_clean>=0.4.1
Requires-Dist: iso639-lang>=2.6.0
Requires-Dist: trafilatura>=2.0.0
Requires-Dist: markdownify>=0.14.1
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: license
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

# nlpclean

Bunch of tools for corpus cleaning.
