Metadata-Version: 2.1
Name: maleo
Version: 0.0.3
Summary: Wrapper library for data cleansing, preprocessing in text
Home-page: https://github.com/jakartaresearch/maleo
Author: Ruben Stefanus
Author-email: researchjair@gmail.com
License: MIT
Download-URL: https://github.com/jakartaresearch/maleo/archive/v0.0.3.tar.gz
Description: # Maleo
        Wrapper library for text cleansing, preprocessing in NLP
        
        ## Overview of features
            - Scanner : get insight about your text dataset (ex: number of chars, words, emojis, etc)
            - Remove hyperlink, punctuation, stopword, emoticon, etc
            - Extract hashtags, price from text
            - Convert email, phone number, date to <TAG>
            - Convert Indonesian slang to formal word
            - Convert emoji to word
            - Convert word to number
        
        ## Installation
        ```
        pip install maleo
        ```
        
        ## Getting Started
        ```python
        from maleo.wizard import Wizard
        
        wiz = Wizard()
        
        wiz.scanner(df, 'text')
        wiz.emoji_to_word(df.text)
        wiz.slang_to_formal(df.text)
        ```
        
        ## Instance Attribute
        ```
        ['scanner',
         'rm_multiple_space',
         'rm_link',
         'rm_punc',
         'rm_char',
         'rm_html',
         'rm_non_ascii',
         'rm_stopword',
         'rm_emoticon',
         'word_to_number',
         'get_hashtag',
         'get_price',
         'email_to_tag',
         'date_to_tag',
         'phone_num_to_tag',
         'slang_to_formal',
         'emoji_to_word']
        ```
        
        ## Contributor:
        - Ruben Stefanus
        
Keywords: nlp,text-processing,machine-learning
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Topic :: Software Development :: Build Tools
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Text Processing
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Description-Content-Type: text/markdown
