Metadata-Version: 2.1
Name: pd_explain
Version: 1.0.0
Summary: Create explanation to dataframe
Project-URL: Git, https://github.com/analysis-bots/pd-explain
Description-Content-Type: text/markdown
License-File: LICENSE
License-File: NOTICE
Requires-Dist: wheel
Requires-Dist: pandas >=1.4.2
Requires-Dist: numpy >=1.20.3
Requires-Dist: python-dotenv
Requires-Dist: singleton-decorator
Requires-Dist: matplotlib
Requires-Dist: fedex-generator >=0.0.7


# PD-EXPLAIN

PD-EXPLAIN is a Python library that wraps Pandas, allowing users to obtain multiple type of query explanations over Pandas DataFrames. 
PD-EXPLAIN is under active development, currently featuring deviation-based explanations (for filter, join, and set operations), and explanations for high-variance group-by-and-aggregate operations. Both explainers utilizes the [FEDEX](https://www.vldb.org/pvldb/vol15/p3854-gilad.pdf) system.

* We will soon support outlier explanations based on the [SCORPION](https://sirrice.github.io/files/papers/scorpion-vldb13.pdf) systems, and Boolean-query explanations based on [this paper](https://arxiv.org/abs/2112.08874).






## Installation

Install pd-explain with pip or by git ssh

```bash
  pip install pd-explain
  
  pip install git+ssh://git@github.com/analysis-bots/pd-explain.git
```

For cloning this project use
```bash
git clone git@github.com:analysis-bots/pd-explain.git

cd pd_explain

pip install -r requirements.txt
```

## Demo

[Demo Spotify example](https://github.com/analysis-bots/pd-explain/blob/main/Examples/Notebooks/PD-explain%20DEMO.ipynb)


![Demo Spotify example notebook - click to view](./assets/pdexplain_demo.gif)

## Documentation

[Documentation](https://stirring-medovik-ba9b36.netlify.app/src/pd_explain.html)


## Citation Information
TBD



## Authors

- [@Eden Isakov](https://github.com/edenIsakov)
- [@analysis-bots](https://github.com/analysis-bots)

