Metadata-Version: 2.4
Name: mpljourney
Version: 0.1.0
Summary: A collection of cool datasets for Python.
Author-email: Joseph Barbier <joseph.barbierdarnal@mail.com>
License-Expression: MIT
Project-URL: Homepage, https://github.com/JosephBARBIERDARNAL/mpljourney
Project-URL: Issues, https://github.com/JosephBARBIERDARNAL/mpljourney/issues
Project-URL: Documentation, https://github.com/JosephBARBIERDARNAL/mpljourney
Project-URL: Repository, https://github.com/JosephBARBIERDARNAL/mpljourney
Keywords: datasets,data,exploration,matplotlib,journey,visualization,analysis
Classifier: Programming Language :: Python :: 3
Classifier: Operating System :: OS Independent
Classifier: Development Status :: 3 - Alpha
Requires-Python: >=3.9
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: narwhals>=1.0.0
Provides-Extra: geo
Requires-Dist: geopandas>=1.0.1; extra == "geo"
Dynamic: license-file



<!-- Automatically generated, uses README.qmd to modify README.md -->

# `mpljourney`

A collection of cool datasets for Python.

Those datasets are primarly used for
[matplotlib-journey.com](https://www.matplotlib-journey.com/), an online
course to master data visualization with Python, but anyone can use
those datasets too.

> Note that `mpljourney` does not embed datasets directly with it, but
> fetches them from a [separate
> repo](https://github.com/JosephBARBIERDARNAL/data-matplotlib-journey).

<br>

To load of one the available datasets:

``` python
from mpljourney import load_dataset

df = load_dataset("accident-london")
```

By default it loads it as a `pandas` dataframe, but it can also be any
of: "polars", "cudf", "pyarrow", "modin", assuming you have the
associated library installed on your machine:

``` python
from mpljourney import load_dataset

df = load_dataset("accident-london", output_format="polars")
```

<br>

Install with:

``` shell
pip install mpljourney
```

<br><br>

## All datasets

## accident-london

![](img/accident-london.png)

## CO2

![](img/CO2.png)

## earthquakes

![](img/earthquakes.png)

## economic

![](img/economic.png)

## footprint

![](img/footprint.png)

## game-sales

![](img/game-sales.png)

## london

'london' is a geo dataset. The `geometry` column is hidden here to make
the table snippet readable.

![](img/london.png)

## mariokart

![](img/mariokart.png)

## natural-disasters

![](img/natural-disasters.png)

## netflix

![](img/netflix.png)

## newyork-airbnb

![](img/newyork-airbnb.png)

## newyork

'newyork' is a geo dataset. The `geometry` column is hidden here to make
the table snippet readable.

![](img/newyork.png)

## storms

![](img/storms.png)

## ufo

![](img/ufo.png)

## us-counties

'us-counties' is a geo dataset. The `geometry` column is hidden here to
make the table snippet readable.

![](img/us-counties.png)

## walks

![](img/walks.png)

## wine

![](img/wine.png)

## world

'world' is a geo dataset. The `geometry` column is hidden here to make
the table snippet readable.

![](img/world.png)
