Metadata-Version: 2.4
Name: pd_explain
Version: 1.1.0
Summary: Create explanation to dataframe
Project-URL: Git, https://github.com/analysis-bots/pd-explain
Description-Content-Type: text/markdown
License-File: LICENSE
License-File: NOTICE
Requires-Dist: wheel
Requires-Dist: pandas>=2.2.3
Requires-Dist: numpy>=2.1.3
Requires-Dist: python-dotenv
Requires-Dist: singleton-decorator>=1.0.0
Requires-Dist: matplotlib>=3.9.0
Requires-Dist: fedex-generator>=1.0.6
Requires-Dist: cluster-explorer>=1.0.2
Requires-Dist: external-explainers>=1.0.1
Requires-Dist: python-dotenv~=1.0.0
Requires-Dist: openai~=1.66.0
Requires-Dist: ipywidgets>=8.1.0
Requires-Dist: together>=1.4.6
Requires-Dist: openai>=1.66.5
Requires-Dist: dill>=0.3.8
Dynamic: description
Dynamic: description-content-type
Dynamic: license-file
Dynamic: project-url
Dynamic: requires-dist
Dynamic: summary


# PD-EXPLAIN

PD-EXPLAIN is a Python library that wraps Pandas, allowing users to obtain multiple type of query explanations over Pandas DataFrames. 
PD-EXPLAIN is under active development, currently featuring interestingness based explanations, those are deviation-based explanations (for filter, join, and set operations) and explanations for high-variance group-by-and-aggregate operations. Both explainers utilizes the [FEDEX](https://www.vldb.org/pvldb/vol15/p3854-gilad.pdf) system.

The system also supports aggregate outlier explanations, based on the [SCORPION](https://sirrice.github.io/files/papers/scorpion-vldb13.pdf) systems
, and will soon fully support Boolean-query explanations based on [this paper](https://arxiv.org/abs/2112.08874).

PD-EXPLAIN was demonstrated at VLDB '24.




## Installation

Install pd-explain with pip or by git ssh

```bash
  pip install pd-explain
  
  pip install git+ssh://git@github.com/analysis-bots/pd-explain.git
```

For cloning this project use
```bash
git clone git@github.com:analysis-bots/pd-explain.git

cd pd_explain

pip install -r requirements.txt
```

## Demo

[Demo Spotify example](https://github.com/analysis-bots/pd-explain/blob/main/src/demo.ipynb)


![Demo Spotify example notebook - click to view](./assets/pdexplain_demo.gif)

## Documentation

[Documentation](https://pd-explain-documentation.readthedocs.io/en/main/index.html)


## Citation Information
TBD



## Authors

- [@Eden Isakov](https://github.com/edenIsakov)
- [@Itay Elyashiv](https://github.com/ItayELY)
- [@analysis-bots](https://github.com/analysis-bots)

