Metadata-Version: 2.1
Name: kedro
Version: 0.18.2
Summary: Kedro helps you build production-ready data and analytics pipelines
Home-page: https://github.com/kedro-org/kedro
Author: Kedro
License: Apache Software License (Apache 2.0)
Keywords: pipelines,machine learning,data pipelines,data science,data engineering
Classifier: Development Status :: 4 - Beta
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Requires-Python: >=3.7, <3.11
Description-Content-Type: text/markdown
License-File: LICENSE.md
Requires-Dist: anyconfig (~=0.10.0)
Requires-Dist: attrs (~=21.3)
Requires-Dist: cachetools (~=4.1)
Requires-Dist: click (<9.0)
Requires-Dist: cookiecutter (<3.0,>=2.1.1)
Requires-Dist: dynaconf (<4.0,>=3.1.2)
Requires-Dist: fsspec (<=2022.1,>=2021.4)
Requires-Dist: gitpython (~=3.0)
Requires-Dist: importlib-metadata (>=3.6)
Requires-Dist: jmespath (<1.0,>=0.9.5)
Requires-Dist: pip-tools (~=6.5)
Requires-Dist: pluggy (~=1.0.0)
Requires-Dist: python-json-logger (~=2.0)
Requires-Dist: PyYAML (<7.0,>=4.2)
Requires-Dist: rich (~=12.0)
Requires-Dist: rope (~=0.21.0)
Requires-Dist: setuptools (>=38.0)
Requires-Dist: toml (~=0.10)
Requires-Dist: toposort (~=1.5)
Provides-Extra: all
Requires-Dist: Pillow (~=9.0) ; extra == 'all'
Requires-Dist: PyYAML (<7.0,>=4.2) ; extra == 'all'
Requires-Dist: SQLAlchemy (~=1.2) ; extra == 'all'
Requires-Dist: biopython (~=1.73) ; extra == 'all'
Requires-Dist: dask[complete] (~=2021.10) ; extra == 'all'
Requires-Dist: delta-spark (~=1.0) ; extra == 'all'
Requires-Dist: docutils (==0.16) ; extra == 'all'
Requires-Dist: geopandas (<1.0,>=0.6.0) ; extra == 'all'
Requires-Dist: hdfs (<3.0,>=2.5.8) ; extra == 'all'
Requires-Dist: holoviews (~=1.13.0) ; extra == 'all'
Requires-Dist: ipykernel (<7.0,>=5.3) ; extra == 'all'
Requires-Dist: lxml (~=4.6) ; extra == 'all'
Requires-Dist: matplotlib (<4.0,>=3.0.3) ; extra == 'all'
Requires-Dist: myst-parser (~=0.17.2) ; extra == 'all'
Requires-Dist: nbsphinx (==0.8.1) ; extra == 'all'
Requires-Dist: nbstripout (~=0.4) ; extra == 'all'
Requires-Dist: networkx (~=2.4) ; extra == 'all'
Requires-Dist: openpyxl (<4.0,>=3.0.6) ; extra == 'all'
Requires-Dist: pandas-gbq (<1.0,>=0.12.0) ; extra == 'all'
Requires-Dist: pandas (~=1.3) ; extra == 'all'
Requires-Dist: plotly (<6.0,>=4.8.0) ; extra == 'all'
Requires-Dist: pyarrow (<7.0,>=1.0) ; extra == 'all'
Requires-Dist: pyproj (~=3.0) ; extra == 'all'
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'all'
Requires-Dist: redis (~=4.1) ; extra == 'all'
Requires-Dist: requests (~=2.20) ; extra == 'all'
Requires-Dist: s3fs (<0.5,>=0.3.0) ; extra == 'all'
Requires-Dist: sphinx-autodoc-typehints (==1.11.1) ; extra == 'all'
Requires-Dist: sphinx-copybutton (==0.3.1) ; extra == 'all'
Requires-Dist: sphinx-rtd-theme (==0.4.1) ; extra == 'all'
Requires-Dist: sphinxcontrib-mermaid (~=0.7.1) ; extra == 'all'
Requires-Dist: sphinx (~=3.4.3) ; extra == 'all'
Requires-Dist: tensorflow (~=2.0) ; extra == 'all'
Requires-Dist: tables (~=3.6) ; (platform_system != "Windows") and extra == 'all'
Requires-Dist: tables (~=3.6.0) ; (platform_system == "Windows") and extra == 'all'
Provides-Extra: api
Requires-Dist: requests (~=2.20) ; extra == 'api'
Provides-Extra: api.apidataset
Requires-Dist: requests (~=2.20) ; extra == 'api.apidataset'
Provides-Extra: biosequence
Requires-Dist: biopython (~=1.73) ; extra == 'biosequence'
Provides-Extra: biosequence.biosequencedataset
Requires-Dist: biopython (~=1.73) ; extra == 'biosequence.biosequencedataset'
Provides-Extra: dask
Requires-Dist: dask[complete] (~=2021.10) ; extra == 'dask'
Provides-Extra: dask.parquetdataset
Requires-Dist: dask[complete] (~=2021.10) ; extra == 'dask.parquetdataset'
Provides-Extra: docs
Requires-Dist: docutils (==0.16) ; extra == 'docs'
Requires-Dist: sphinx (~=3.4.3) ; extra == 'docs'
Requires-Dist: sphinx-rtd-theme (==0.4.1) ; extra == 'docs'
Requires-Dist: nbsphinx (==0.8.1) ; extra == 'docs'
Requires-Dist: nbstripout (~=0.4) ; extra == 'docs'
Requires-Dist: sphinx-autodoc-typehints (==1.11.1) ; extra == 'docs'
Requires-Dist: sphinx-copybutton (==0.3.1) ; extra == 'docs'
Requires-Dist: ipykernel (<7.0,>=5.3) ; extra == 'docs'
Requires-Dist: sphinxcontrib-mermaid (~=0.7.1) ; extra == 'docs'
Requires-Dist: myst-parser (~=0.17.2) ; extra == 'docs'
Provides-Extra: geopandas
Requires-Dist: geopandas (<1.0,>=0.6.0) ; extra == 'geopandas'
Requires-Dist: pyproj (~=3.0) ; extra == 'geopandas'
Provides-Extra: geopandas.geojsondataset
Requires-Dist: geopandas (<1.0,>=0.6.0) ; extra == 'geopandas.geojsondataset'
Requires-Dist: pyproj (~=3.0) ; extra == 'geopandas.geojsondataset'
Provides-Extra: holoviews
Requires-Dist: holoviews (~=1.13.0) ; extra == 'holoviews'
Provides-Extra: holoviews.holoviewswriter
Requires-Dist: holoviews (~=1.13.0) ; extra == 'holoviews.holoviewswriter'
Provides-Extra: matplotlib
Requires-Dist: matplotlib (<4.0,>=3.0.3) ; extra == 'matplotlib'
Provides-Extra: matplotlib.matplotlibwriter
Requires-Dist: matplotlib (<4.0,>=3.0.3) ; extra == 'matplotlib.matplotlibwriter'
Provides-Extra: networkx
Requires-Dist: networkx (~=2.4) ; extra == 'networkx'
Provides-Extra: networkx.networkxdataset
Requires-Dist: networkx (~=2.4) ; extra == 'networkx.networkxdataset'
Provides-Extra: pandas
Requires-Dist: SQLAlchemy (~=1.2) ; extra == 'pandas'
Requires-Dist: lxml (~=4.6) ; extra == 'pandas'
Requires-Dist: openpyxl (<4.0,>=3.0.6) ; extra == 'pandas'
Requires-Dist: pandas-gbq (<1.0,>=0.12.0) ; extra == 'pandas'
Requires-Dist: pandas (~=1.3) ; extra == 'pandas'
Requires-Dist: pyarrow (<7.0,>=1.0) ; extra == 'pandas'
Provides-Extra: pandas.csvdataset
Requires-Dist: pandas (~=1.3) ; extra == 'pandas.csvdataset'
Provides-Extra: pandas.exceldataset
Requires-Dist: pandas (~=1.3) ; extra == 'pandas.exceldataset'
Requires-Dist: openpyxl (<4.0,>=3.0.6) ; extra == 'pandas.exceldataset'
Provides-Extra: pandas.featherdataset
Requires-Dist: pandas (~=1.3) ; extra == 'pandas.featherdataset'
Provides-Extra: pandas.gbqquerydataset
Requires-Dist: pandas (~=1.3) ; extra == 'pandas.gbqquerydataset'
Requires-Dist: pandas-gbq (<1.0,>=0.12.0) ; extra == 'pandas.gbqquerydataset'
Provides-Extra: pandas.gbqtabledataset
Requires-Dist: pandas (~=1.3) ; extra == 'pandas.gbqtabledataset'
Requires-Dist: pandas-gbq (<1.0,>=0.12.0) ; extra == 'pandas.gbqtabledataset'
Provides-Extra: pandas.genericdataset
Requires-Dist: pandas (~=1.3) ; extra == 'pandas.genericdataset'
Provides-Extra: pandas.hdfdataset
Requires-Dist: pandas (~=1.3) ; extra == 'pandas.hdfdataset'
Requires-Dist: tables (~=3.6) ; (platform_system != "Windows") and extra == 'pandas.hdfdataset'
Requires-Dist: tables (~=3.6.0) ; (platform_system == "Windows") and extra == 'pandas.hdfdataset'
Provides-Extra: pandas.jsondataset
Requires-Dist: pandas (~=1.3) ; extra == 'pandas.jsondataset'
Provides-Extra: pandas.parquetdataset
Requires-Dist: pandas (~=1.3) ; extra == 'pandas.parquetdataset'
Requires-Dist: pyarrow (<7.0,>=1.0) ; extra == 'pandas.parquetdataset'
Provides-Extra: pandas.sqlquerydataset
Requires-Dist: pandas (~=1.3) ; extra == 'pandas.sqlquerydataset'
Requires-Dist: SQLAlchemy (~=1.2) ; extra == 'pandas.sqlquerydataset'
Provides-Extra: pandas.sqltabledataset
Requires-Dist: pandas (~=1.3) ; extra == 'pandas.sqltabledataset'
Requires-Dist: SQLAlchemy (~=1.2) ; extra == 'pandas.sqltabledataset'
Provides-Extra: pandas.xmldataset
Requires-Dist: pandas (~=1.3) ; extra == 'pandas.xmldataset'
Requires-Dist: lxml (~=4.6) ; extra == 'pandas.xmldataset'
Requires-Dist: tables (~=3.6) ; (platform_system != "Windows") and extra == 'pandas'
Requires-Dist: tables (~=3.6.0) ; (platform_system == "Windows") and extra == 'pandas'
Provides-Extra: pillow
Requires-Dist: Pillow (~=9.0) ; extra == 'pillow'
Provides-Extra: pillow.imagedataset
Requires-Dist: Pillow (~=9.0) ; extra == 'pillow.imagedataset'
Provides-Extra: plotly
Requires-Dist: pandas (~=1.3) ; extra == 'plotly'
Requires-Dist: plotly (<6.0,>=4.8.0) ; extra == 'plotly'
Provides-Extra: plotly.jsondataset
Requires-Dist: plotly (<6.0,>=4.8.0) ; extra == 'plotly.jsondataset'
Provides-Extra: plotly.plotlydataset
Requires-Dist: pandas (~=1.3) ; extra == 'plotly.plotlydataset'
Requires-Dist: plotly (<6.0,>=4.8.0) ; extra == 'plotly.plotlydataset'
Provides-Extra: redis
Requires-Dist: redis (~=4.1) ; extra == 'redis'
Provides-Extra: spark
Requires-Dist: delta-spark (~=1.0) ; extra == 'spark'
Requires-Dist: hdfs (<3.0,>=2.5.8) ; extra == 'spark'
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'spark'
Requires-Dist: s3fs (<0.5,>=0.3.0) ; extra == 'spark'
Provides-Extra: spark.deltatabledataset
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'spark.deltatabledataset'
Requires-Dist: hdfs (<3.0,>=2.5.8) ; extra == 'spark.deltatabledataset'
Requires-Dist: s3fs (<0.5,>=0.3.0) ; extra == 'spark.deltatabledataset'
Requires-Dist: delta-spark (~=1.0) ; extra == 'spark.deltatabledataset'
Provides-Extra: spark.sparkdataset
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'spark.sparkdataset'
Requires-Dist: hdfs (<3.0,>=2.5.8) ; extra == 'spark.sparkdataset'
Requires-Dist: s3fs (<0.5,>=0.3.0) ; extra == 'spark.sparkdataset'
Provides-Extra: spark.sparkhivedataset
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'spark.sparkhivedataset'
Requires-Dist: hdfs (<3.0,>=2.5.8) ; extra == 'spark.sparkhivedataset'
Requires-Dist: s3fs (<0.5,>=0.3.0) ; extra == 'spark.sparkhivedataset'
Provides-Extra: spark.sparkjdbcdataset
Requires-Dist: pyspark (<4.0,>=2.2) ; extra == 'spark.sparkjdbcdataset'
Requires-Dist: hdfs (<3.0,>=2.5.8) ; extra == 'spark.sparkjdbcdataset'
Requires-Dist: s3fs (<0.5,>=0.3.0) ; extra == 'spark.sparkjdbcdataset'
Provides-Extra: tensorflow
Requires-Dist: tensorflow (~=2.0) ; extra == 'tensorflow'
Provides-Extra: tensorflow.tensorflowmodeldataset
Requires-Dist: tensorflow (~=2.0) ; extra == 'tensorflow.tensorflowmodeldataset'
Provides-Extra: yaml
Requires-Dist: PyYAML (<7.0,>=4.2) ; extra == 'yaml'
Requires-Dist: pandas (~=1.3) ; extra == 'yaml'
Provides-Extra: yaml.yamldataset
Requires-Dist: pandas (~=1.3) ; extra == 'yaml.yamldataset'
Requires-Dist: PyYAML (<7.0,>=4.2) ; extra == 'yaml.yamldataset'

![Kedro Logo Banner](https://raw.githubusercontent.com/kedro-org/kedro/develop/static/img/kedro_banner.png)
[![Python version](https://img.shields.io/badge/python-3.7%20%7C%203.8%20%7C%203.9%20%7C%203.10-blue.svg)](https://pypi.org/project/kedro/)
[![PyPI version](https://badge.fury.io/py/kedro.svg)](https://pypi.org/project/kedro/)
[![Conda version](https://img.shields.io/conda/vn/conda-forge/kedro.svg)](https://anaconda.org/conda-forge/kedro)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/kedro-org/kedro/blob/main/LICENSE.md)
[![Discord Server](https://img.shields.io/discord/778216384475693066.svg?color=7289da&label=Kedro%20Discord&logo=discord&style=flat-square)](https://discord.gg/akJDeVaxnB)
![CircleCI - Main Branch](https://img.shields.io/circleci/build/github/kedro-org/kedro/main?label=main)
![Develop Branch Build](https://img.shields.io/circleci/build/github/kedro-org/kedro/develop?label=develop)
[![Documentation](https://readthedocs.org/projects/kedro/badge/?version=stable)](https://kedro.readthedocs.io/)


## What is Kedro?

Kedro is an open-source Python framework for creating reproducible, maintainable and modular data science code. It borrows concepts from software engineering and applies them to machine-learning code; applied concepts include modularity, separation of concerns and versioning. Kedro is hosted by the [LF AI & Data Foundation](https://lfaidata.foundation/).

## How do I install Kedro?

To install Kedro from the Python Package Index (PyPI) simply run:

```
pip install kedro
```

It is also possible to install Kedro using `conda`:

```
conda install -c conda-forge kedro
```

Our [Get Started guide](https://kedro.readthedocs.io/en/stable/get_started/prerequisites.html) contains full installation instructions, and includes how to set up Python virtual environments.


## What are the main features of Kedro?

![Kedro-Viz Pipeline Visualisation](https://github.com/kedro-org/kedro-viz/blob/main/.github/img/banner.png)
*A pipeline visualisation generated using [Kedro-Viz](https://github.com/kedro-org/kedro-viz)*


| Feature | What is this? |
|----------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Project Template | A standard, modifiable and easy-to-use project template based on [Cookiecutter Data Science](https://github.com/drivendata/cookiecutter-data-science/). |
| Data Catalog | A series of lightweight data connectors used to save and load data across many different file formats and file systems, including local and network file systems, cloud object stores, and HDFS. The Data Catalog also includes data and model versioning for file-based systems. |
| Pipeline Abstraction | Automatic resolution of dependencies between pure Python functions and data pipeline visualisation using [Kedro-Viz](https://github.com/kedro-org/kedro-viz). |
| Coding Standards | Test-driven development using [`pytest`](https://github.com/pytest-dev/pytest), produce well-documented code using [Sphinx](http://www.sphinx-doc.org/en/master/), create linted code with support for [`flake8`](https://github.com/PyCQA/flake8), [`isort`](https://github.com/PyCQA/isort) and [`black`](https://github.com/psf/black) and make use of the standard Python logging library. |
| Flexible Deployment | Deployment strategies that include single or distributed-machine deployment as well as additional support for deploying on Argo, Prefect, Kubeflow, AWS Batch and Databricks. |


## How do I use Kedro?

The [Kedro documentation](https://kedro.readthedocs.io/en/stable/) includes three examples to help get you started:
- A typical "Hello World" example, for an [entry-level description of the main Kedro concepts](https://kedro.readthedocs.io/en/stable/get_started/hello_kedro.html)
- An [introduction to the project template](https://kedro.readthedocs.io/en/stable/get_started/example_project.html) using the Iris dataset
- A more detailed [spaceflights tutorial](https://kedro.readthedocs.io/en/stable/tutorial/tutorial_template.html) to give you hands-on experience


## Why does Kedro exist?

Kedro is built upon our collective best-practice (and mistakes) trying to deliver real-world ML applications that have vast amounts of raw unvetted data. We developed Kedro to achieve the following:
 - To address the main shortcomings of Jupyter notebooks, one-off scripts, and glue-code because there is a focus on
  creating **maintainable data science code**
 - To enhance **team collaboration** when different team members have varied exposure to software engineering concepts
 - To increase efficiency, because applied concepts like modularity and separation of concerns inspire the creation of
  **reusable analytics code**


## The humans behind Kedro

Kedro is maintained by [a product team](https://kedro.readthedocs.io/en/stable/faq/faq.html) and a number of [contributors from across the world](https://github.com/kedro-org/kedro/releases).


## Can I contribute?

Yes! Want to help build Kedro? Check out our [guide to contributing to Kedro](https://github.com/kedro-org/kedro/blob/main/CONTRIBUTING.md).


## Where can I learn more?

There is a growing community around Kedro. Have a look at the [Kedro FAQs](https://kedro.readthedocs.io/en/stable/faq/faq.html#how-can-i-find-out-more-about-kedro) to find projects using Kedro and links to articles, podcasts and talks.


## Who likes Kedro?

There are Kedro users across the world, who work at start-ups, major enterprises and academic institutions like [Absa](https://www.absa.co.za/),
[Acensi](https://acensi.eu/page/home),
[Advanced Programming Solutions SL](https://www.linkedin.com/feed/update/urn:li:activity:6863494681372721152/),
[AI Singapore](https://makerspace.aisingapore.org/2020/08/leveraging-kedro-in-100e/),
[Augment Partners](https://www.linkedin.com/posts/augment-partners_kedro-cheat-sheet-by-augment-activity-6858927624631283712-Ivqk),
[AXA UK](https://www.axa.co.uk/),
[Belfius](https://www.linkedin.com/posts/vangansen_mlops-machinelearning-kedro-activity-6772379995953238016-JUmo),
[Beamery](https://medium.com/hacking-talent/production-code-for-data-science-and-our-experience-with-kedro-60bb69934d1f),
[Caterpillar](https://www.caterpillar.com/),
[CRIM](https://www.crim.ca/en/),
[Dendra Systems](https://www.dendra.io/),
[Element AI](https://www.elementai.com/),
[GetInData](https://getindata.com/blog/running-machine-learning-pipelines-kedro-kubeflow-airflow),
[GMO](https://recruit.gmo.jp/engineer/jisedai/engineer/jisedai/engineer/jisedai/engineer/jisedai/engineer/jisedai/blog/kedro_and_mlflow_tracking/),
[Indicium](https://medium.com/indiciumtech/how-to-build-models-as-products-using-mlops-part-2-machine-learning-pipelines-with-kedro-10337c48de92),
[Imperial College London](https://github.com/dssg/barefoot-winnie-public),
[ING](https://www.ing.com),
[Jungle Scout](https://junglescouteng.medium.com/jungle-scout-case-study-kedro-airflow-and-mlflow-use-on-production-code-150d7231d42e),
[Helvetas](https://www.linkedin.com/posts/lionel-trebuchon_mlflow-kedro-ml-ugcPost-6747074322164154368-umKw),
[Leapfrog](https://www.lftechnology.com/blog/ai-pipeline-kedro/),
[McKinsey & Company](https://www.mckinsey.com/alumni/news-and-insights/global-news/firm-news/kedro-from-proprietary-to-open-source),
[Mercado Libre Argentina](https://www.mercadolibre.com.ar),
[Modec](https://www.modec.com/),
[Mosaic Data Science](https://www.youtube.com/watch?v=fCWGevB366g),
[NaranjaX](https://www.youtube.com/watch?v=_0kMmRfltEQ),
[NASA](https://github.com/nasa/ML-airport-taxi-out),
[Open Data Science LatAm](https://www.odesla.org/),
[Prediqt](https://prediqt.co/),
[QuantumBlack](https://medium.com/quantumblack/introducing-kedro-the-open-source-library-for-production-ready-machine-learning-code-d1c6d26ce2cf),
[Retrieva](https://tech.retrieva.jp/entry/2020/07/28/181414),
[Roche](https://www.roche.com/),
[Sber](https://www.linkedin.com/posts/seleznev-artem_welcome-to-kedros-documentation-kedro-activity-6767523561109385216-woTt),
[Société Générale](https://www.societegenerale.com/en),
[Telkomsel](https://medium.com/life-at-telkomsel/how-we-build-a-production-grade-data-pipeline-7004e56c8c98),
[Universidad Rey Juan Carlos](https://github.com/vchaparro/MasterThesis-wind-power-forecasting/blob/master/thesis.pdf),
[UrbanLogiq](https://urbanlogiq.com/),
[Wildlife Studios](https://wildlifestudios.com),
[WovenLight](https://www.wovenlight.com/) and
[XP](https://youtu.be/wgnGOVNkXqU?t=2210).

Kedro has also won [Best Technical Tool or Framework for AI](https://awards.ai/the-awards/previous-awards/the-4th-ai-award-winners/) in the 2019 Awards AI competition and a merit award for the 2020 [UK Technical Communication Awards](https://uktcawards.com/announcing-the-award-winners-for-2020/). It is listed on the 2020 [ThoughtWorks Technology Radar](https://www.thoughtworks.com/radar/languages-and-frameworks/kedro) and the 2020 [Data & AI Landscape](https://mattturck.com/data2020/).

## How can I cite Kedro?

If you're an academic, Kedro can also help you, for example, as a tool to solve the problem of reproducible research. Use the "Cite this repository" button on [our repository](https://github.com/kedro-org/kedro) to generate a citation from the [CITATION.cff file](https://docs.github.com/en/repositories/managing-your-repositorys-settings-and-features/customizing-your-repository/about-citation-files).
