Metadata-Version: 2.1
Name: pycellmech
Version: 2.1.7
Summary: A shape-based feature extractor for use in biological and medical studies.
Home-page: https://github.com/icm-dac/pycellmech
Author: Janan Arslan
Author-email: janan.arslan@icm-institute.org
License: UNKNOWN
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: Other/Proprietary License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.6
Description-Content-Type: text/markdown
Requires-Dist: matplotlib
Requires-Dist: numpy
Requires-Dist: opencv-python
Requires-Dist: pandas
Requires-Dist: scikit-learn

<div align="left">
  <img width="15%" src="/figures/pyrcellmech_logo_v2.png" alt="pycellmech Logo">
</div>

# PyCellMech

## Introduction

PyCellMech is designed to understand how shapes impact our understanding of medical conditions and/or their progression.

This package currently begins by introducing some preliminary shape-based analyses, with focus on geometric, polygonal and one-dimensional shape features.

## License

This project is licensed under the Creative Commons Attribution-NonCommercial 4.0 International Public License. See the [LICENSE](./LICENSE) file for details.


## Requirements

Packages used in PyCellMech include:

```
    matplotlib
    numpy
    math
    opencv-python
    pandas
    scikit-learn

```

## Usage

PyCellMech takes binarized masks as its input. There are no restrictions in terms of 
regions of interest (ROI), as the method will iterate through each ROI
within each binarized image. Whether you have a single image or multiple images to process,
binarized masks should be contained within a folder. 

Once you have your input images ready, download the code, head to the ``main.py`` 
file, and change the following settings (located on lines 176-178)::

```
    folder_path = '/path/to/binarized/images'
    csv_file_path = '/path/to/save/shape_features.csv'
    output_folder = '/path/to/save/feature/maps'
```

``csv_file_path`` will reflect the location you would like to save the CSV file, and ``output_folder`` is where the feature visualization map for every image processed will be saved. 

Feature visualization maps select the largest ROI and superimpose the extracted features on the ROI. Thus, this package provides quantitative and qualitative assessments. 

A handful of samples from the Kvasir dataset have been included in this repository for testing. 


## Shape Features

The current iteration of PyCellMech involves three shape-based feature classes: 

    - One-dimensional function shape features
    - Geometric Shape Features
    - Polygonal Approximation Shape Features

These feature classes will continue to be built upon in future iterations. A summary of 
these feature descriptions are found below. 

### One-dimensional function shape features

*Complex Coordinate*

A complex coordinate function derives a complex number from a point on an object's contour.

*Centroid Distance Function (CDF)*

CDF the distance from the centroid of an object to points on its boundary, providing a way to describe the shape's geometry relative to its center.

*Area Function (AF)*

AF quantifies the total surface area enclosed by the boundaries of the shape.

*Triange Area Representation (TAR)*

TAR calculates the area of triangles formed within or around the shape.

*Chord Length Function (CLF)*

CLF measures the distances between pairs of points on the shape's boundary.

### Geometric Shape Features

*Axis Minimum Inertia (AMI)*

AMI identifies the axis along which the shape exhibits the least resistance to rotational motion.

*Average Bending Energy (ABE)*

ABE quantifies the mean amount of effort required to deform the shape.

*Eccentricity*

Eccentricity describes the degree to which a shape deviates from being circular.

*Minimum Bounding Rectangle (MBR)*

MBR represents the smallest rectangle that entirely encloses the shape, representing its rectilinear approximation. 

*Circularity Ratio (CR)*

CR compares the area of the shape to the area of a circle with the same perimeter.

*Ellipse Variance and Moment Invariants (EM and EV)*

EV measures the deviation of a shape from an elliptical form, and EM quantifies shape characteristics that remain constant under transformations (e.g., rotation, scaling, and translation).

*Solidity*

Solidity calculates the ratio of the shape's area to the area of its convex hull.

### Polygonal Approximation Shape Features

*Distance Threshold Method (DTM)*

DTM  involves setting a specific distance limit to differentiate between relevant and irrelevant points.

*Polygon Evolution by Vertex Deletion (PEVD)*

PEVD is a process where vertices are incrementally removed from a polygonal shape to simplify its structure while trying to preserve its overall form and characteristics.

*Splitting Method (SM)*

SM involves dividing a shape into smaller, manageable segments or components.

*Minimum Perimeter Polygon (MPP)*

MPP is the polygon with the smallest possible perimeter that can enclose a given shape.

*K-Means Method*

KMeans clusters points on the shape into a specified number of groups based on their proximity.


## References

The PyCellMech package was inspired by the below works. The package will be continually updated based
on the latest research developments. 

```
P.-Y. Yin. Pattern Recognition Techniques, Technology and Applications. InTech, November
2008. ISBN 9789537619244. doi: 10.5772/90.
```

```
J. Chaki and N. Dey. A Beginners Guide to Image Shape Feature Extraction Techniques.
Taylor Francis Group, 2019. ISBN 9781000034301. doi: 10.5772/90.
```

Kvasir Dataset 
```
Konstantin Pogorelov, Kristin Ranheim Randel, Carsten Griwodz, Sigrun Losada Eskeland,
Thomas de Lange, Dag Johansen, Concetto Spampinato, Duc-Tien Dang-Nguyen, Math-
ias Lux, Peter Thelin Schmidt, Michael Riegler, and Pal Halvorsen. Kvasir: A multi-class
image dataset for computer aided gastrointestinal disease detection. In Proceedings of the
8th ACM on Multimedia Systems Conference, MMSys’17, pages 164–169, New York, NY,
USA, 2017. ACM. ISBN 978-1-4503-5002-0. doi: 10.1145/3083187.3083212.
```

## Citation
```
@misc{
arslan2024pycellmech,
title={PyCellMech: A shape-based feature extraction pipeline for elucidating biological differences between disease states},
author={Janan Arslan and Henri Chhoa and Ines Khemir and Romain Valabregue and Stephen Whitmarsh},
year={2024},
url={https://openreview.net/forum?id=VzvRnPMDhX}
}
```


