Metadata-Version: 2.4
Name: meddatasets
Version: 0.1.0
Summary: A curated collection of medical and healthcare datasets for data analysis, clinical research, epidemiology, and education. Includes cancer data, chronic disease diagnostics, hospital management records, public health, statistics and more from Kaggle sources.
Home-page: https://github.com/lightbluetitan/meddatasets-py
Author: Renzo Caceres Rossi
Author-email: Renzo Caceres Rossi <arenzocaceresrossi@gmail.com>
Maintainer-email: Renzo Caceres Rossi <arenzocaceresrossi@gmail.com>
License: MIT License
        
        Copyright (c) 2026 Renzo Caceres Rossi
        
        Permission is hereby granted, free of charge, to any person obtaining a copy
        of this software and associated documentation files (the "Software"), to deal
        in the Software without restriction, including without limitation the rights
        to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
        copies of the Software, and to permit persons to whom the Software is
        furnished to do so, subject to the following conditions:
        
        The above copyright notice and this permission notice shall be included in all
        copies or substantial portions of the Software.
        
        THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
        IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
        FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
        AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
        LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
        OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
        SOFTWARE.
        
Keywords: datasets,medicine,health,public health,cancer,clinical data,chronic diseases,diabetes,health statistics,epidemiology,data science,research,data analysis,hospital management,machine learning,kaggle
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Education
Classifier: Intended Audience :: Science/Research
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Scientific/Engineering :: Medical Science Apps.
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3 :: Only
Classifier: Operating System :: OS Independent
Classifier: Natural Language :: English
Requires-Python: >=3.8
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pandas>=1.5
Dynamic: author
Dynamic: home-page
Dynamic: license-file
Dynamic: requires-python

# meddatasets

[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
[![Python 3.8+](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/downloads/)

The `meddatasets` package provides a curated collection of medical and healthcare
datasets for data analysis, clinical research, epidemiology, and education in Python.
It includes cancer data, chronic disease diagnostics, hospital management records,
public health statistics, and more — sourced from Kaggle.

## Installation
You can install the `meddatasets` package from PyPI:
```bash
pip install meddatasets
```

## Usage
```python

import meddatasets as md

# List all available datasets

datasets = md.list_datasets()
print(datasets)

# Load a specific dataset

df = md.load_dataset('breast_cancer')
print(df.head())

# Describe dataset

df_01 = md.describe('smoking_cancer_risk')
print(df_01)

```

## 📊 Some Available Datasets

| Dataset | Description | 
|---------|-------------|
| `breast_cancer` | Breast Cancer dataset derived from the Breast Cancer Wisconsin (Diagnostic) dataset.| 
| `smoking_cancer_risk` | Smoking and cancer risk analysis dataset.| 
| `covid_worldwide` | Dataset containing COVID-19 cases and deaths worldwide.| 
| `water_pollution_disease` | Dataset containing data on water pollution and its impact on public health.| 

> Run `meddatasets.list_datasets()` or `md.list_datasets()` (using `md` as alias) to see the full list of available datasets.

## Disclaimer

`meddatasets` is intended for **educational and research purposes only**.
The datasets provided should not be used for clinical diagnosis or medical
decision-making.

## License

The `meddatasets` library is released under the **MIT License**, allowing free use for both commercial and non-commercial purposes.
See the [LICENSE](LICENSE) file for details.
