Metadata-Version: 2.1
Name: gisco-geodata
Version: 0.1.2.dev0
Summary: GISCO Geodata is a Python package that provides access to the European Commission's GISCO geospatial data.
Author-email: Alex-Andrei Cuvuliuc <cuvuliucalexandrei@gmail.com>
License: MIT
Project-URL: Homepage, https://github.com/alecsandrei/gisco-geodata
Keywords: gisco-geodata
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: English
Classifier: Development Status :: 4 - Beta
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Requires-Python: >=3.9
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: httpx
Requires-Dist: async-cache
Provides-Extra: all
Requires-Dist: gisco-geodata[dev,geo]; extra == "all"
Provides-Extra: dev
Requires-Dist: pytest; extra == "dev"
Requires-Dist: matplotlib; extra == "dev"
Requires-Dist: zest.releaser[recommended]; extra == "dev"
Requires-Dist: eurostat; extra == "dev"
Requires-Dist: geopandas; extra == "dev"
Requires-Dist: ruff; extra == "dev"
Requires-Dist: mypy; extra == "dev"
Provides-Extra: geo
Requires-Dist: geopandas; extra == "geo"

Can be used to download geodata from the [GISCO API](https://gisco-services.ec.europa.eu/distribution/v2/).

[![DOI](https://zenodo.org/badge/789341717.svg)](https://zenodo.org/doi/10.5281/zenodo.11477852)

# Installation

## Unreleased version

```sh
git clone https://github.com/alecsandrei/gisco-geodata.git
cd gisco-geodata
python3 -m pip install .
```

## [PyPI](https://pypi.org/project/gisco-geodata/)

```sh
pip install gisco-geodata
```

# Requirements
- httpx
- async-cache
- Python >= 3.9
- Optional: GeoPandas

# Examples

Also check the [examples](https://github.com/alecsandrei/gisco-geodata/tree/main/examples) folder.

```python
import os
from pathlib import Path

from gisco_geodata import (
    NUTS,
    Countries,
    set_httpx_args,
    set_semaphore_value
)


if __name__ == '__main__':
    out_dir = Path(
        os.path.normpath(os.path.expanduser("~/Desktop"))
    )  # Desktop path

    set_httpx_args(verify=False)  # prevents SSLError in my case
    set_semaphore_value(20) # the maximum number of asynchronous API calls

    nuts = NUTS()

    nuts.download(
        file_format='shp',
        year='2021',
        spatial_type='BN',
        scale='60M',
        projection='4326',
        nuts_level='LEVL_3',
        out_dir=out_dir,
    )

    # Equivalent to the above
    datasets = nuts.get_datasets()
    datasets[-1].download(
        file_format='shp',
        spatial_type='BN',
        scale='60M',
        projection='4326',
        nuts_level='LEVL_3',
        out_dir=out_dir,
    )

    # Retrieve Country information as Polygons
    countries = Countries()

    # If you have geopandas installed, this will be a GDF.
    gdf = countries.get(
        countries=['RO', 'IT'],
        spatial_type='RG',
    )
    if not isinstance(gdf, list):
        print(gdf.head(5))
    else:
        print(gdf)

```

You can also use it with the [eurostat](https://pypi.org/project/eurostat/) python package.

```python
"""To use this script you will need to also install the 'eurostat' and the 'mapclassify' packages."""

"""
To use this script you will need to also install the following packages
    - eurostat
    - geopandas
    - mapclassify
"""

import geopandas as gpd
import matplotlib.pyplot as plt
from eurostat import get_data_df, get_toc_df, set_requests_args

from gisco_geodata import NUTS, set_httpx_args


def get_plot():
    set_httpx_args(verify=False)
    set_requests_args(verify=False)

    # Get the geometries from the gisco service.
    nuts = NUTS()
    level_2 = nuts.get(spatial_type='RG', nuts_level='LEVL_2')
    assert isinstance(level_2, gpd.GeoDataFrame)

    # Get the dataset information.
    eurostat_database = get_toc_df()
    title, code = eurostat_database.loc[
        eurostat_database['title']
        == 'Unemployment by sex, age, educational attainment level and NUTS 2 region (1 000)',
        ['title', 'code'],
    ].iloc[0]
    dataset = get_data_df(code)
    assert dataset is not None

    # Preprocess the dataset.
    dataset = dataset.loc[
        (dataset['isced11'] == 'TOTAL') & (dataset['sex'] == 'T')
    ]  # total unemployment rate
    # Join with the geometries.
    dataset = level_2.merge(
        dataset, left_on='NUTS_ID', right_on=r'geo\TIME_PERIOD'
    )
    assert isinstance(dataset, gpd.GeoDataFrame)

    # Plot.
    dataset.plot(
        column='2023', scheme='NaturalBreaks', legend=True, edgecolor='black'
    )
    plt.title(title, fontdict={'size': 15, 'wrap': True})
    plt.xlim(-25, 47)
    plt.ylim(30, 75)
    plt.tight_layout()
    plt.show()


if __name__ == '__main__':
    get_plot()


```

# Disclaimer

This plugin, Eurostat Downloader, is an independent project created by Cuvuliuc Alex-Andrei. It is not an official product of Eurostat, and Cuvuliuc Alex-Andrei is not affiliated, associated, authorized, endorsed by, or in any way officially connected with Eurostat or any of its subsidiaries or its affiliates.

# Copyright notice

Before using this package, please read the information [provided by Eurostat](https://ec.europa.eu/eurostat/web/gisco/geodata).
