Metadata-Version: 2.1
Name: datarefiner
Version: 0.1.0
Summary: DataRefiner: An Advanced Toolkit for Data Transformation and Processing
Home-page: https://github.com/thieu1995/datarefiner
Author: Thieu
Author-email: nguyenthieu2102@gmail.com
License: GPLv3
Project-URL: Documentation, https://datarefiner.readthedocs.io/
Project-URL: Source Code, https://github.com/thieu1995/datarefiner
Project-URL: Bug Tracker, https://github.com/thieu1995/datarefiner/issues
Project-URL: Change Log, https://github.com/thieu1995/datarefiner/blob/master/ChangeLog.md
Project-URL: Forum, https://t.me/+fRVCJGuGJg1mNDg1
Keywords: Data refining,Data tidying,Data munging,Data grooming,Data optimization,Data formatting,Data structuringData filtering,Data cleansing,Data standardization,Data enrichment,Feature engineering,Data wrangling,Data transformation,Data scaling,Data normalizationData preprocessing,Data preparation,Data analysis
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Education
Classifier: Intended Audience :: Information Technology
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: GNU General Public License v3 (GPLv3)
Classifier: Natural Language :: English
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: System :: Benchmark
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Mathematics
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Scientific/Engineering :: Visualization
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
Classifier: Topic :: Software Development :: Build Tools
Classifier: Topic :: Software Development :: Libraries
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Utilities
Requires-Python: >=3.8
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: numpy >=1.17.1
Requires-Dist: scipy >=1.7.1
Requires-Dist: scikit-learn >=1.0.2
Requires-Dist: pandas >=1.3.5
Requires-Dist: permetrics >=2.0.0
Provides-Extra: dev
Requires-Dist: pytest >=7.0 ; extra == 'dev'
Requires-Dist: pytest-cov ==4.0.0 ; extra == 'dev'
Requires-Dist: flake8 >=4.0.1 ; extra == 'dev'


# DataRefiner: An Advanced Toolkit for Data Transformation and Processing

[![GitHub release](https://img.shields.io/badge/release-0.1.0-yellow.svg)](https://github.com/thieu1995/DataRefiner/releases)
[![Wheel](https://img.shields.io/pypi/wheel/gensim.svg)](https://pypi.python.org/pypi/DataRefiner) 
[![PyPI version](https://badge.fury.io/py/DataRefiner.svg)](https://badge.fury.io/py/DataRefiner)
![PyPI - Python Version](https://img.shields.io/pypi/pyversions/DataRefiner.svg)
![PyPI - Status](https://img.shields.io/pypi/status/DataRefiner.svg)
![PyPI - Downloads](https://img.shields.io/pypi/dm/DataRefiner.svg)
[![Downloads](https://static.pepy.tech/badge/DataRefiner)](https://pepy.tech/project/DataRefiner)
[![Tests & Publishes to PyPI](https://github.com/thieu1995/DataRefiner/actions/workflows/publish-package.yaml/badge.svg)](https://github.com/thieu1995/DataRefiner/actions/workflows/publish-package.yaml)
[![Documentation Status](https://readthedocs.org/projects/DataRefiner/badge/?version=latest)](https://DataRefiner.readthedocs.io/en/latest/?badge=latest)
[![Chat](https://img.shields.io/badge/Chat-on%20Telegram-blue)](https://t.me/+fRVCJGuGJg1mNDg1)
[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.12820732.svg)](https://doi.org/10.5281/zenodo.12820732)
[![License: GPL v3](https://img.shields.io/badge/License-GPLv3-blue.svg)](https://www.gnu.org/licenses/gpl-3.0)

---

`DataRefiner` is a cutting-edge Python library designed to streamline and enhance the data transformation and 
processing workflow. Whether you're dealing with complex datasets in machine learning, data analysis, or any 
data-driven domain, DataRefiner provides an extensive suite of tools to ensure your data is clean, well-structured, 
and ready for insightful analysis.

Key features of DataRefiner include:

- **Advanced Data Transformation:** Perform sophisticated data manipulation tasks with ease, including normalization, scaling, encoding, and more.
- **Customizable Processing Pipelines:** Create flexible and reusable data processing pipelines tailored to your specific needs.
- **Integration with Popular Libraries:** Seamlessly integrate with libraries such as scikit-learn, pandas, and NumPy to enhance your existing data processing workflows.
- **User-Friendly API:** Enjoy an intuitive and easy-to-use interface that simplifies complex data transformation tasks.

DataRefiner is designed for data scientists, machine learning engineers, and researchers who demand precision and 
efficiency in their data preparation processes. Empower your data projects with DataRefiner and unlock the full potential of your data.


| **DataRefiner**     | **An Advanced Toolkit for Data Transformation and Processing**   |
|---------------------|------------------------------------------------------------------|
| **Free software**   | GNU General Public License (GPL) V3 license                      |
| **Documentation**   | https://datarefiner.readthedocs.io                               | 
| **Python versions** | \>= 3.8.x                                                        |  
| **Dependencies**    | numpy, scipy, scikit-learn, pandas, permetrics                   |



# Usage

* Install the [current PyPI release](https://pypi.python.org/pypi/datarefiner):
```sh 
$ pip install datarefiner
```

After installation, you can check DataRefiner version:

```sh
$ python
>>> import datarefiner
>>> datarefiner.__version__
```

Please go check out the [examples](/examples) folder. You'll be surprised by what this library can do for your data.
You can also read the [documentation](https://datarefiner.readthedocs.io/) for more detailed installation 
instructions, explanations, and examples.

# Citation Request

If you use this library for your project, please cite us with:

```bibtex
@software{thieu_2024_12820732,
  author       = {Nguyen Van Thieu},
  title        = {DataRefiner: An Advanced Toolkit for Data Transformation and Processing},
  month        = jul,
  year         = 2024,
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.12820731},
  url          = {https://github.com/thieu1995/DataRefiner}
}
```


# Official Links (Get support for questions and answers)

* [Official source code repository](https://github.com/thieu1995/DataRefiner)
* [Official document](https://datarefiner.readthedocs.io/)
* [Download releases](https://pypi.org/project/datarefiner/) 
* [Issue tracker](https://github.com/thieu1995/DataRefiner/issues) 
* [Notable changes log](/ChangeLog.md)
* [Official discussion group](https://t.me/+fRVCJGuGJg1mNDg1)
