Metadata-Version: 2.4
Name: autoclean-ds
Version: 0.1.0
Summary: Automated data cleaning library for pandas DataFrames
Home-page: https://github.com/jd878-gif/autoclean
Author: Jeet Dave
Author-email: Jeet Dave <jd.878@njit.edu>
Requires-Python: >=3.9
Description-Content-Type: text/markdown
License-File: license
Requires-Dist: pandas>=1.5
Requires-Dist: numpy>=1.21
Dynamic: author
Dynamic: home-page
Dynamic: license-file
Dynamic: requires-python

<!-- Day-3 -->
<!-- git add .
git commit -m "Day 3: Added remove_duplicates, drop_empty_columns, detect_constant_columns, updated pipeline"
git push -u origin day3-cleaning -->

# AutoClean 🧹

**AutoClean** is an automated data cleaning library for pandas DataFrames.  
It intelligently detects column types, handles missing values, removes outliers, 
drops useless columns, and generates a clear before/after report.

Built for **data scientists**, **ML engineers**, and **students** who want
clean data in one line.

---

## ✨ Features

- Automatic column standardization
- Intelligent data type detection (numerical, categorical, datetime)
- Missing value handling with smart strategies
- Outlier detection & removal (IQR-based)
- Constant & empty column removal
- Before/After cleaning report
- Configurable & extensible pipeline

---

## 🚀 Installation (Local / Dev)

```bash
pip install .

