Metadata-Version: 2.1
Name: retexto
Version: 1.6.1
Summary: Fast text processing
Home-page: https://github.com/Edux87/retexto
Author: Edgar Castañeda
Author-email: edaniel15@gmail.com
License: UNKNOWN
Description: =========================================================
        reTexto
        =========================================================
        Fast text processing for python
        
        Run
        ===
        
            cd /[project_path]
            docker build -t retexto .
            docker run -v $(pwd):/retexto:rw -it retexto bash
        
        Basic Use
        ===========
        
        ::
        
            if __name__ == '__main__':
            s = '@Edux87, i need this www.google.com | https://github.com <br> \
            <strong>UserName: çarlos </strong> \
            i\'m from Perú 😛 \
            #Friends #Text jajajajaja so fffunny  \
            loooveee thiiis 😌😎 \
            @florenciaflor19 Si!!! sé vo… 🐷JUANA🐷 \
            smile! haha jejeje jojojo jujuju jijijijajaja 😂'
            text = ReTexto(s)
        
            s = text.remove_html() \
                .remove_mentions() \
                .remove_tags() \
                .remove_smiles(by='SMILING') \
                .convert_specials() \
                .convert_emoji() \
                .remove_nochars(preserve_tilde=True) \
                .remove_url() \
                .remove_duplicate(r='a-jp-z') \
                .remove_duplicate_vowels() \
                .remove_duplicate_consonants() \
                .remove_punctuation() \
                .remove_multispaces() \
                .lower() \
                .remove_stopwords() \
                .split_words(uniques=True)
            print(s)
            ['username', 'from', 'love', 'i', 'ned', 'funy', 'juana', 'vo', 'this', 'si', 'im', 'se', 'peru', 'smile', 'so', 'smiling', 'carlos']
        
Platform: linux
Platform: windows
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 2.6
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3
Classifier: Environment :: Console
Classifier: Operating System :: OS Independent
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Topic :: Text Editors :: Text Processing
Classifier: Topic :: Text Editors :: Word Processors
Description-Content-Type: text/x-rst
