Metadata-Version: 2.1
Name: pypairs
Version: 2.0.2
Summary: A Python-reimplementation of the Pairs algorithm described by A. Scialdone et al. (2015)
Home-page: https://github.com/rfechtner/pypairs
Author: Ron Fechtner
Author-email: ronfechtner@gmail.com
License: https://github.com/rfechtner/pypairs/blob/master/LICENSE
Project-URL: Original Paper, http://dx.doi.org/10.1016/j.ymeth.2015.06.021
Project-URL: GitHub URL, https://github.com/rfechtner/pypairs
Description: # PyPairs - A python scRNA-Seq classifier
        
        This is a python-reimplementation of the _Pairs_ algorithm as described by A. Scialdone et. al. (2015).
        Original Paper available under: https://doi.org/10.1016/j.ymeth.2015.06.021
        
        The algorithm aims to classify single cells based on their transcriptomic signal. Initially created to predict cell
        cycle phase from scRNA-Seq data, this algorithm can be used for various applications.
        
        It is a supervised maschine learning algorithm and as such it consits of two components: 
        training (sandbag) and prediction (cyclone)
        
        ## sandbag
        
        
        This function implements the training step of the pair-based prediction method. Pairs of genes _(A, B)_ are identified 
        from a training data set, with known category for each sample. In each pair, the fraction of cells in category 1
        with expression of A > B (based on expression values in the dataset) and the fraction with B > A in each other category 
        exceeds a set threshold fraction. These pairs are defined as the marker pairs for category 1. This is repeated for each 
        category to obtain a separate set of marker pairs.
        
        ## cyclone
        
        This function implements the classification step. To illustrate, consider classification of cell cycles into G1 phase (a 
        category of cell cycle). Pairs of marker genes are identified with sandbag, where the expression of the first gene in 
        the training data is greater than the second in G1 phase but less than the second in all other phases. 
        For each cell, cyclone calculates the proportion of all marker pairs where the expression of the first gene is greater 
        than the second in the new data. A high proportion suggests that the cell is likely to belong in G1 phase, as the 
        expression ranking in the new data is consistent with that in the training data. Proportions are not directly comparable
        between phases due to the use of different sets of gene pairs for each phase. Instead, proportions are converted into 
        scores (see below) that account for the size and precision of the proportion estimate. The same process is repeated for 
        all phases, using the corresponding set of marker pairs in pairs. 
        
        ## example
        
        .. tbd ..
Keywords: scRNA classifier machine-learning marker pairs
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Science/Research
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3 :: Only
Provides-Extra: plot
