Metadata-Version: 2.1
Name: stereoAlign
Version: 0.0.3
Summary: A toolkit package of data integration
Author: zhangchao
Author-email: zhangchao5@genomics.cn
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.8
Description-Content-Type: text/markdown
Requires-Dist: matplotlib >=2.2
Requires-Dist: tensorflow
Requires-Dist: scanpy
Requires-Dist: louvain
Requires-Dist: python-igraph
Requires-Dist: h5py
Requires-Dist: pandas
Requires-Dist: numpy
Requires-Dist: scikit-learn
Requires-Dist: harmony-pytorch
Requires-Dist: scanorama
Requires-Dist: scgen
Requires-Dist: scvi-tools
Requires-Dist: mnnpy
Requires-Dist: bbknn
Requires-Dist: desc

[![python >3.8.8](https://img.shields.io/badge/python-3.8.8-brightgreen)](https://www.python.org/)
[![Downloads](https://static.pepy.tech/badge/stereoalign)](https://pepy.tech/project/stereoalign)
[![Downloads](https://static.pepy.tech/badge/stereoalign/month)](https://pepy.tech/project/stereoalign)
[![Downloads](https://static.pepy.tech/badge/stereoalign/week)](https://pepy.tech/project/stereoalign)
[![Documentation Status](https://readthedocs.org/projects/stereoalign-tutorial/badge/?version=latest)](https://stereoalign-tutorial.readthedocs.io/en/latest/?badge=latest)
# A toolkit for data integration
***        
Behold, a magnificent toolkit emerges, designed to seamlessly integrate the vast realm of data in the realm of single-cell genomics.   Within its hallowed digital confines, this toolkit bestows upon researchers the power to harmoniously merge and analyze the intricate tapestry of cellular information. Crafted with meticulous care, this toolkit embodies the epitome of elegance and efficiency.   It serves as a conduit, enabling the synthesis of diverse datasets from single-cell genomics experiments, unlocking new realms of knowledge and understanding. The power of data integration within this toolkit extends beyond mere aggregation.   It empowers researchers to unravel the intricate web of cellular interactions, uncovering hidden patterns, identifying novel cell types, and discerning the complex dynamics that govern cellular behavior. Through its refined algorithms and advanced statistical techniques, this toolkit illuminates the path towards deeper insights and discoveries.   It refines and enhances the quality of data, mitigating confounding factors and removing noise, thus unveiling the true essence of the cellular landscape. As the sun sets on the horizon of single-cell genomics, this toolkit emerges as a guiding beacon, illuminating the path towards a more comprehensive understanding of cellular complexity.   Embrace its power and unlock the secrets hidden within the realm of single-cell genomics.          
            
# Dependence        
[![torch-1.10.0](https://img.shields.io/badge/torch-1.10.0-red)](https://pytorch.org/get-started/previous-versions/)
[![pandas-1.2.4](https://img.shields.io/badge/pandas-1.2.4-lightgrey)](https://github.com/pandas-dev/pandas)
[![scikit-learn-0.24](https://img.shields.io/badge/scikit-0.24.x-brightgreen)](https://github.com/scikit-learn/scikit-learn/tree/0.24.X)
[![scipy-0.12.x](https://img.shields.io/badge/scipy-0.12.x-yellow)](https://github.com/scipy/scipy/tree/maintenance/0.12.x)
[![scanpy-1.9.1](https://img.shields.io/badge/scanpy-1.9.1-informational)](https://pypi.org/project/scanpy/)           
                
# Install         
The `stereoAlign` python package is available on [Pypi](https://pypi.org/project/stereoAlign/) and can be installed through      
```python
pip install stereoAlign
```            
Import `stereoAlign` in python  
```python
import stereoAlign
```
We created the python package called `stereoAlign` that uses `scanpy` to streamline the integration of single-cell datasets and evaluate the results. The package contains several modules for preprocessing an `anndata` object, running integration methods and evaluating the resulting using a number of metrics. Functions for the data integration methods are in `stereoAlign.alignment` or for short `stereoAlign.alg` and metrics are under `stereoAlign.metrics`.
       
# Tutorials        
Quick Start [https://stereoalign-tutorial.readthedocs.io/en/latest/index.html#](https://stereoalign-tutorial.readthedocs.io/en/latest/index.html#)        
        
# Integration Tools        
This repository contains the code for the `stereoAlign` package for data integration tools.           
This toolkit that is compared include:          

| **Integration Method** |                                 **Invoke**                                  |      **Output**      |                                     **Recommendation index**                                      |                                                                                    **Describe**                                                                                     |  Language  |
|:----------------------:|:---------------------------------------------------------------------------:|:--------------------:|:-------------------------------------------------------------------------------------------------:|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------:|:----------:|
|       `Harmony`        |           <font size=2>`stereoAlign.alg.harmony_alignment`</font>           |      Embedding       |              <font color=YellowGreen size=5 face=Rockwell Extra Bold>__***__</font>               |                                   <font size=2>Leverage iterative clustering with maximum diversity for batch correction and integration.</font>                                    | Python / R |
|      `Scanorama`       |          <font size=2>`stereoAlign.alg.scanorama_alignment`</font>          |      Embedding       |              <font color=YellowGreen size=5 face=Rockwell Extra Bold>__****__</font>              |                            <font size=2>Leverage computer vision algorithms for panorama stitching matches for batch correction and integration.</font>                             |   Python   |
|        `scGEN`         |            <font size=2>`stereoAlign.alg.scgen_alignment`</font>            |       Features       |              <font color=YellowGreen size=5 face=Rockwell Extra Bold>__****__</font>              |           <font size=2>Use variational autoencoders to reduce the dimension of gene expression matrix, and apply extra label to batch correction and integrations.</font>           |   Python   |
|         `scvi`         |            <font size=2>`stereoAlign.alg.scvi_alignment`</font>             |      Embedding       |              <font color=YellowGreen size=5 face=Rockwell Extra Bold>__****__</font>              |                                                  <font size=2>Use variational autoencoders to batch correct and integrate.</font>                                                   |   Python   |
|         `MNN`          |             <font size=2>`stereoAlign.alg.mnn_alignment`</font>             |       Features       |               <font color=YellowGreen size=5 face=Rockwell Extra Bold>__**__</font>               |                                             <font size=2>Leverage mutual nearest neighbor for batch correction and integration.</font>                                              |   Python   |                                                                                                                                                                                                                  
|        `BBKNN`         |            <font size=2>`stereoAlign.alg.bbknn_alignment`</font>            |        Graph         |              <font color=YellowGreen size=5 face=Rockwell Extra Bold>__***__</font>               |     <font size=2>BBKNN taking each cell and identifying a (smaller) k nearest neighbours in each batch separately, and merged into a final neighbour list for the cell.</font>      |   Python   |                                                                                                                                                                                                                  
|        `Combat`        |           <font size=2>`stereoAlign.alg.combat_alignment`</font>            |       Features       |               <font color=YellowGreen size=5 face=Rockwell Extra Bold>__*__</font>                |                                             <font size=2>Leverage empirical Bayesian model for batch correction and integration.</font>                                             |   Python   |                                                                                                                                                                                                                 
|         `DESC`         |            <font size=2>`stereoAlign.alg.desc_alignment`</font>             |      Embedding       |               <font color=YellowGreen size=5 face=Rockwell Extra Bold>__**__</font>               | <font size=2>DESC eventually eliminates batch effects by recurrent self-learning, as long as technological changes between batches are fewer than real biological variances.</font> |   Python   |                                                                                                                                                                                                                  
|       `PRECAST`        |                            <font size=2></font>                             |      Embedding       |              <font color=YellowGreen size=5 face=Rockwell Extra Bold>__****__</font>              |                            <font size=2>PRECAST is a probabilistic model-based approach that integrates SRT datasets from multiple tissue slides.</font>                            |     R      |                                                                                                                                                                                                                  
|      `spatiAlign`      |         <font size=2>`stereoAlign.alg.spatialign_alignment`</font>          | Embedding / Features |             <font color=YellowGreen size=5 face=Rockwell Extra Bold>__*****__</font>              |                       <font size=2>spatiAlign is an unsupervised across domain adapatation methods for SRT datasets batch correction and integration.</font>                        |   Python   |
|        `SCALEX`        |           <font size=2>`stereoAlign.alg.scalex_alignment`</font>            |      Embedding       |               <font color=YellowGreen size=5 face=Rockwell Extra Bold>__**__</font>               |                                    <font size=2>SCALEX leverage domain specific batch normalization for batch correction and integration.</font>                                    |   Python   |


        
# Metric        
|  **Metric Method**  |                  **Invoke**                   |
|:-------------------:|:---------------------------------------------:|
|         ARI         |         `stereoAlign.metrics.cal_ari`         |
| Graph connectivity  | `stereoAlign.metrics.cal_graph_connectivity`  |
|        KBET         |        `stereoAlign.metrics.cal_kbet`         |
|        LISI         |        `stereoAlign.metrics.cal_lisi`         |
|     Silhouette      |     `stereoAlign.metrics.cal_silhouette`      |
        

 
# Disclaimer        
***This is not an official product.***    
            
            
