Metadata-Version: 2.1
Name: dataset-format-benchmark
Version: 0.0.10
Summary: Image dataset format benchmark
Home-page: https://github.com/kamikaze/dataset-format-benchmark
Author: Oleg Korsak
Author-email: kamikaze.is.waiting.you@gmail.com
License: gpl-3
Project-URL: Documentation, https://github.com/kamikaze/dataset-format-benchmark/wiki
Platform: any
Classifier: Development Status :: 4 - Beta
Classifier: Programming Language :: Python
Requires-Python: >=3.11
Description-Content-Type: text/x-rst; charset=UTF-8
License-File: LICENSE
License-File: AUTHORS.rst
Requires-Dist: Cython (==0.29.33)
Requires-Dist: h5py (==3.8.0)
Requires-Dist: imageio[freeimage,opencv,pillow,pyav] (==2.25.1)
Requires-Dist: kaggle (==1.5.12)
Requires-Dist: matplotlib (==3.7.0)
Requires-Dist: numpy (==1.24.2)
Requires-Dist: Pillow (==9.4.0)
Requires-Dist: pkgconfig (==1.5.5)
Requires-Dist: pytorch-lightning (==1.9.2)
Requires-Dist: rawpy (==0.18.0)
Requires-Dist: scikit-learn (==1.2.1)
Requires-Dist: scipy (==1.10.0)
Requires-Dist: seaborn (==0.12.2)
Requires-Dist: torch
Requires-Dist: torchvision
Requires-Dist: tqdm (==4.64.1)
Requires-Dist: zarr (==2.14.1)
Provides-Extra: cuda
Requires-Dist: cupy-cuda12x (==11.5.0) ; extra == 'cuda'
Provides-Extra: testing
Requires-Dist: pytest ; extra == 'testing'
Requires-Dist: pytest-cov ; extra == 'testing'

dataset-format-benchmark
========================

This package runs different image format benchmarks for dataset ML tasks

Installation
------------

Make sure you have some system deps installed:

.. code:: bash

   sudo apt install pkg-config libhdf5-dev

.. code:: bash

   python3.11 -m venv venv --upgrade-deps
   source venv/bin/activate
   python -m pip install -U -r requirements_dev.txt

   # For running on Nvidia GPU:
   python -m pip install -U torch torchvision

   # For running on CPU:
   python -m pip install -U torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cpu

   # For some reason h5py fails to install Cython while it needs it
   python -m pip install -U Cython

   python setup.py develop

Running dataset format benchmark
--------------------------------

.. code:: bash

   python -m dataset_format_benchmark --data-root /path/to/datasets/
