Metadata-Version: 2.4
Name: socialpretext
Version: 0.1.0
Summary: A lightweight utility for preprocessing and cleaning social media text (slang, emojis, hashtags, etc.).
Author-email: Akash Goyal <akashpgoyal@gmail.com>
License: MIT
Project-URL: Homepage, https://github.com/akashgoyalll/socialpretext
Project-URL: Issues, https://github.com/akashgoyalll/socialpretext/issues
Keywords: nlp,text cleaning,preprocessing,slang,emoji,social media,socialpretext
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Topic :: Text Processing :: Linguistic
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: Natural Language :: English
Requires-Python: >=3.8
Description-Content-Type: text/markdown
Requires-Dist: emoji>=2.0.0

# SocialPreText

A simple, lightweight Python utility for preprocessing and cleaning social media text.

`socialpretext` cleans raw text by expanding slang, expanding contractions, removing emojis, URLs, hashtags, and more, making it ready for NLP models and analysis.

## Features

* Expand common slang (e.g., `idk` -> `I do not know`)
* Expand English contractions (e.g., `don't` -> `do not`)
* Remove or "demojize" emojis (e.g., `😊` -> `:smiling_face:`)
* Remove URLs
* Remove user mentions (`@mentions`)
* Remove hashtags (`#hashtags`)
* Normalize whitespace

## Installation

```bash
pip install socialpretext
