Metadata-Version: 2.1
Name: kedro-datasets
Version: 1.4.2
Summary: Kedro-Datasets is where you can find all of Kedro's data connectors.
Author: Kedro
License: Apache Software License (Apache 2.0)
Project-URL: Source, https://github.com/kedro-org/kedro-plugins/tree/main/kedro-datasets
Project-URL: Documentation, https://docs.kedro.org
Project-URL: Tracker, https://github.com/kedro-org/kedro-plugins/issues
Requires-Python: <3.11,>=3.7
Description-Content-Type: text/markdown
Requires-Dist: kedro (>=0.16)
Provides-Extra: all
Requires-Dist: Pillow (~=9.0) ; extra == 'all'
Requires-Dist: PyYAML (<7.0,>=4.2) ; extra == 'all'
Requires-Dist: SQLAlchemy (<3.0,>=1.4) ; extra == 'all'
Requires-Dist: biopython (~=1.73) ; extra == 'all'
Requires-Dist: compress-pickle[lz4] (~=2.1.0) ; extra == 'all'
Requires-Dist: dask[complete] (~=2021.10) ; extra == 'all'
Requires-Dist: delta-spark (<3.0,>=1.0) ; extra == 'all'
Requires-Dist: delta-spark (~=1.2.1) ; extra == 'all'
Requires-Dist: geopandas (<1.0,>=0.6.0) ; extra == 'all'
Requires-Dist: hdfs (<3.0,>=2.5.8) ; extra == 'all'
Requires-Dist: holoviews (~=1.13.0) ; extra == 'all'
Requires-Dist: lxml (~=4.6) ; extra == 'all'
Requires-Dist: matplotlib (<4.0,>=3.0.3) ; extra == 'all'
Requires-Dist: networkx (~=2.4) ; extra == 'all'
Requires-Dist: opencv-python (~=4.5.5.64) ; extra == 'all'
Requires-Dist: openpyxl (<4.0,>=3.0.6) ; extra == 'all'
Requires-Dist: pandas-gbq (<0.18.0,>=0.12.0) ; extra == 'all'
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'all'
Requires-Dist: plotly (<6.0,>=4.8.0) ; extra == 'all'
Requires-Dist: polars (~=0.17.0) ; extra == 'all'
Requires-Dist: pyarrow (>=6.0) ; extra == 'all'
Requires-Dist: pyarrow (~=8.0) ; extra == 'all'
Requires-Dist: pyodbc (~=4.0) ; extra == 'all'
Requires-Dist: pyproj (~=3.0) ; extra == 'all'
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'all'
Requires-Dist: redis (~=4.1) ; extra == 'all'
Requires-Dist: requests (~=2.20) ; extra == 'all'
Requires-Dist: s3fs (<0.5,>=0.3.0) ; extra == 'all'
Requires-Dist: scikit-learn (~=1.0.2) ; extra == 'all'
Requires-Dist: scipy (~=1.7.3) ; extra == 'all'
Requires-Dist: snowflake-snowpark-python (~=1.0.0) ; extra == 'all'
Requires-Dist: triad (<1.0,>=0.6.7) ; extra == 'all'
Requires-Dist: tensorflow (~=2.0) ; (platform_system != "Darwin" or platform_machine != "arm64") and extra == 'all'
Requires-Dist: tables (~=3.6) ; (platform_system != "Windows") and extra == 'all'
Requires-Dist: tensorflow-macos (~=2.0) ; (platform_system == "Darwin" and platform_machine == "arm64") and extra == 'all'
Requires-Dist: tables (~=3.6.0) ; (platform_system == "Windows") and extra == 'all'
Provides-Extra: api
Requires-Dist: requests (~=2.20) ; extra == 'api'
Provides-Extra: api.apidataset
Requires-Dist: requests (~=2.20) ; extra == 'api.apidataset'
Provides-Extra: biosequence
Requires-Dist: biopython (~=1.73) ; extra == 'biosequence'
Provides-Extra: biosequence.biosequencedataset
Requires-Dist: biopython (~=1.73) ; extra == 'biosequence.biosequencedataset'
Provides-Extra: dask
Requires-Dist: dask[complete] (~=2021.10) ; extra == 'dask'
Requires-Dist: triad (<1.0,>=0.6.7) ; extra == 'dask'
Provides-Extra: dask.parquetdataset
Requires-Dist: dask[complete] (~=2021.10) ; extra == 'dask.parquetdataset'
Requires-Dist: triad (<1.0,>=0.6.7) ; extra == 'dask.parquetdataset'
Provides-Extra: databricks
Requires-Dist: delta-spark (~=1.2.1) ; extra == 'databricks'
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'databricks'
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'databricks'
Provides-Extra: databricks.managedtabledataset
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'databricks.managedtabledataset'
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'databricks.managedtabledataset'
Requires-Dist: delta-spark (~=1.2.1) ; extra == 'databricks.managedtabledataset'
Provides-Extra: docs
Requires-Dist: docutils (==0.16) ; extra == 'docs'
Requires-Dist: sphinx (~=5.3.0) ; extra == 'docs'
Requires-Dist: sphinx-rtd-theme (==1.2.0) ; extra == 'docs'
Requires-Dist: sphinx-autodoc-typehints (==1.20.2) ; extra == 'docs'
Requires-Dist: sphinx-copybutton (==0.3.1) ; extra == 'docs'
Requires-Dist: sphinx-notfound-page ; extra == 'docs'
Requires-Dist: ipykernel (<7.0,>=5.3) ; extra == 'docs'
Requires-Dist: sphinxcontrib-mermaid (~=0.7.1) ; extra == 'docs'
Requires-Dist: myst-parser (~=1.0.0) ; extra == 'docs'
Requires-Dist: Jinja2 (<3.1.0) ; extra == 'docs'
Provides-Extra: geopandas
Requires-Dist: geopandas (<1.0,>=0.6.0) ; extra == 'geopandas'
Requires-Dist: pyproj (~=3.0) ; extra == 'geopandas'
Provides-Extra: geopandas.geojsondataset
Requires-Dist: geopandas (<1.0,>=0.6.0) ; extra == 'geopandas.geojsondataset'
Requires-Dist: pyproj (~=3.0) ; extra == 'geopandas.geojsondataset'
Provides-Extra: holoviews
Requires-Dist: holoviews (~=1.13.0) ; extra == 'holoviews'
Provides-Extra: holoviews.holoviewswriter
Requires-Dist: holoviews (~=1.13.0) ; extra == 'holoviews.holoviewswriter'
Provides-Extra: matplotlib
Requires-Dist: matplotlib (<4.0,>=3.0.3) ; extra == 'matplotlib'
Provides-Extra: matplotlib.matplotlibwriter
Requires-Dist: matplotlib (<4.0,>=3.0.3) ; extra == 'matplotlib.matplotlibwriter'
Provides-Extra: networkx
Requires-Dist: networkx (~=2.4) ; extra == 'networkx'
Provides-Extra: networkx.networkxdataset
Requires-Dist: networkx (~=2.4) ; extra == 'networkx.networkxdataset'
Provides-Extra: pandas
Requires-Dist: SQLAlchemy (<3.0,>=1.4) ; extra == 'pandas'
Requires-Dist: lxml (~=4.6) ; extra == 'pandas'
Requires-Dist: openpyxl (<4.0,>=3.0.6) ; extra == 'pandas'
Requires-Dist: pandas-gbq (<0.18.0,>=0.12.0) ; extra == 'pandas'
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas'
Requires-Dist: pyarrow (>=6.0) ; extra == 'pandas'
Requires-Dist: pyodbc (~=4.0) ; extra == 'pandas'
Provides-Extra: pandas.csvdataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas.csvdataset'
Provides-Extra: pandas.exceldataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas.exceldataset'
Requires-Dist: openpyxl (<4.0,>=3.0.6) ; extra == 'pandas.exceldataset'
Provides-Extra: pandas.featherdataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas.featherdataset'
Provides-Extra: pandas.gbqquerydataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas.gbqquerydataset'
Requires-Dist: pandas-gbq (<0.18.0,>=0.12.0) ; extra == 'pandas.gbqquerydataset'
Provides-Extra: pandas.gbqtabledataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas.gbqtabledataset'
Requires-Dist: pandas-gbq (<0.18.0,>=0.12.0) ; extra == 'pandas.gbqtabledataset'
Provides-Extra: pandas.genericdataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas.genericdataset'
Provides-Extra: pandas.hdfdataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas.hdfdataset'
Requires-Dist: tables (~=3.6) ; (platform_system != "Windows") and extra == 'pandas.hdfdataset'
Requires-Dist: tables (~=3.6.0) ; (platform_system == "Windows") and extra == 'pandas.hdfdataset'
Provides-Extra: pandas.jsondataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas.jsondataset'
Provides-Extra: pandas.parquetdataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas.parquetdataset'
Requires-Dist: pyarrow (>=6.0) ; extra == 'pandas.parquetdataset'
Provides-Extra: pandas.sqlquerydataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas.sqlquerydataset'
Requires-Dist: SQLAlchemy (<3.0,>=1.4) ; extra == 'pandas.sqlquerydataset'
Requires-Dist: pyodbc (~=4.0) ; extra == 'pandas.sqlquerydataset'
Provides-Extra: pandas.sqltabledataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas.sqltabledataset'
Requires-Dist: SQLAlchemy (<3.0,>=1.4) ; extra == 'pandas.sqltabledataset'
Provides-Extra: pandas.xmldataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'pandas.xmldataset'
Requires-Dist: lxml (~=4.6) ; extra == 'pandas.xmldataset'
Requires-Dist: tables (~=3.6) ; (platform_system != "Windows") and extra == 'pandas'
Requires-Dist: tables (~=3.6.0) ; (platform_system == "Windows") and extra == 'pandas'
Provides-Extra: pickle
Requires-Dist: compress-pickle[lz4] (~=2.1.0) ; extra == 'pickle'
Provides-Extra: pickle.pickledataset
Requires-Dist: compress-pickle[lz4] (~=2.1.0) ; extra == 'pickle.pickledataset'
Provides-Extra: pillow
Requires-Dist: Pillow (~=9.0) ; extra == 'pillow'
Provides-Extra: pillow.imagedataset
Requires-Dist: Pillow (~=9.0) ; extra == 'pillow.imagedataset'
Provides-Extra: plotly
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'plotly'
Requires-Dist: plotly (<6.0,>=4.8.0) ; extra == 'plotly'
Provides-Extra: plotly.jsondataset
Requires-Dist: plotly (<6.0,>=4.8.0) ; extra == 'plotly.jsondataset'
Provides-Extra: plotly.plotlydataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'plotly.plotlydataset'
Requires-Dist: plotly (<6.0,>=4.8.0) ; extra == 'plotly.plotlydataset'
Provides-Extra: polars
Requires-Dist: polars (~=0.17.0) ; extra == 'polars'
Provides-Extra: polars.csvdataset
Requires-Dist: polars (~=0.17.0) ; extra == 'polars.csvdataset'
Provides-Extra: redis
Requires-Dist: redis (~=4.1) ; extra == 'redis'
Provides-Extra: snowflake
Requires-Dist: pyarrow (~=8.0) ; extra == 'snowflake'
Requires-Dist: snowflake-snowpark-python (~=1.0.0) ; extra == 'snowflake'
Provides-Extra: snowflake.snowparktabledataset
Requires-Dist: snowflake-snowpark-python (~=1.0.0) ; extra == 'snowflake.snowparktabledataset'
Requires-Dist: pyarrow (~=8.0) ; extra == 'snowflake.snowparktabledataset'
Provides-Extra: spark
Requires-Dist: delta-spark (<3.0,>=1.0) ; extra == 'spark'
Requires-Dist: hdfs (<3.0,>=2.5.8) ; extra == 'spark'
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'spark'
Requires-Dist: s3fs (<0.5,>=0.3.0) ; extra == 'spark'
Provides-Extra: spark.deltatabledataset
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'spark.deltatabledataset'
Requires-Dist: hdfs (<3.0,>=2.5.8) ; extra == 'spark.deltatabledataset'
Requires-Dist: s3fs (<0.5,>=0.3.0) ; extra == 'spark.deltatabledataset'
Requires-Dist: delta-spark (<3.0,>=1.0) ; extra == 'spark.deltatabledataset'
Provides-Extra: spark.sparkdataset
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'spark.sparkdataset'
Requires-Dist: hdfs (<3.0,>=2.5.8) ; extra == 'spark.sparkdataset'
Requires-Dist: s3fs (<0.5,>=0.3.0) ; extra == 'spark.sparkdataset'
Provides-Extra: spark.sparkhivedataset
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'spark.sparkhivedataset'
Requires-Dist: hdfs (<3.0,>=2.5.8) ; extra == 'spark.sparkhivedataset'
Requires-Dist: s3fs (<0.5,>=0.3.0) ; extra == 'spark.sparkhivedataset'
Provides-Extra: spark.sparkjdbcdataset
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'spark.sparkjdbcdataset'
Requires-Dist: hdfs (<3.0,>=2.5.8) ; extra == 'spark.sparkjdbcdataset'
Requires-Dist: s3fs (<0.5,>=0.3.0) ; extra == 'spark.sparkjdbcdataset'
Provides-Extra: svmlight
Requires-Dist: scikit-learn (~=1.0.2) ; extra == 'svmlight'
Requires-Dist: scipy (~=1.7.3) ; extra == 'svmlight'
Provides-Extra: svmlight.svmlightdataset
Requires-Dist: scikit-learn (~=1.0.2) ; extra == 'svmlight.svmlightdataset'
Requires-Dist: scipy (~=1.7.3) ; extra == 'svmlight.svmlightdataset'
Provides-Extra: tensorflow
Provides-Extra: tensorflow.tensorflowmodeldataset
Requires-Dist: tensorflow (~=2.0) ; (platform_system != "Darwin" or platform_machine != "arm64") and extra == 'tensorflow.tensorflowmodeldataset'
Requires-Dist: tensorflow-macos (~=2.0) ; (platform_system == "Darwin" and platform_machine == "arm64") and extra == 'tensorflow.tensorflowmodeldataset'
Requires-Dist: tensorflow (~=2.0) ; (platform_system != "Darwin" or platform_machine != "arm64") and extra == 'tensorflow'
Requires-Dist: tensorflow-macos (~=2.0) ; (platform_system == "Darwin" and platform_machine == "arm64") and extra == 'tensorflow'
Provides-Extra: video
Requires-Dist: opencv-python (~=4.5.5.64) ; extra == 'video'
Provides-Extra: video.videodataset
Requires-Dist: opencv-python (~=4.5.5.64) ; extra == 'video.videodataset'
Provides-Extra: yaml
Requires-Dist: PyYAML (<7.0,>=4.2) ; extra == 'yaml'
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'yaml'
Provides-Extra: yaml.yamldataset
Requires-Dist: pandas (<3.0,>=1.3) ; extra == 'yaml.yamldataset'
Requires-Dist: PyYAML (<7.0,>=4.2) ; extra == 'yaml.yamldataset'

# Kedro-Datasets

[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
[![Python Version](https://img.shields.io/badge/python-3.7%20%7C%203.8%20%7C%203.9%20%7C%203.10-blue.svg)](https://pypi.org/project/kedro-datasets/)
[![PyPI Version](https://badge.fury.io/py/kedro-datasets.svg)](https://pypi.org/project/kedro-datasets/)
[![Code Style: Black](https://img.shields.io/badge/code%20style-black-black.svg)](https://github.com/ambv/black)

A Kedro plugin that powers Kedro's DataCatalog.

## Installation

`kedro-datasets` is a Python plugin. To install it:

```bash
pip install kedro-datasets
```

# Datasets

Welcome to `kedro_datasets`, the home of Kedro's data connectors. Here you will find `AbstractDataSet` implementations created by QuantumBlack and external contributors.

## What `AbstractDataSet` implementations are supported?

We support a range of data connectors, including CSV, Excel, Parquet, Feather, HDF5, JSON, Pickle, SQL Tables, SQL Queries, Spark DataFrames and more. We even allow support for working with images.

These data connectors are supported with the APIs of `pandas`, `spark`, `networkx`, `matplotlib`, `yaml` and more.

[The Data Catalog](https://kedro.readthedocs.io/en/stable/data/data_catalog.html) allows you to work with a range of file formats on local file systems, network file systems, cloud object stores, and Hadoop.

Here is a full list of [supported data connectors and APIs](https://kedro.readthedocs.io/en/stable/kedro.datasets.html).

## How can I create my own `AbstractDataSet` implementation?


Take a look at our [instructions on how to create your own `AbstractDataSet` implementation](https://kedro.readthedocs.io/en/stable/extend_kedro/custom_datasets.html).
