Metadata-Version: 2.1
Name: riboflow
Version: 1.0.0
Summary: Classifying Putative Riboswitch Sequences
Home-page: https://github.com/KeshavAdityaRP/riboflow
Author: Keshav Aditya R.P
Author-email: keshavaditya26896@gmail.com
License: UNKNOWN
Description: # RIBOFLOW - classifying riboswitches with >99% accuracy
        
        [riboflow](https://test.pypi.org/project/riboflow/) is a python package for classifying putative riboswitch sequences into one of 24 classes with > 99% accuracy. It is based on a [tensorflow](https://www.tensorflow.org) deep learning model. ``riboflow`` has been tested using ``Python 3.5.2``. 
        
        Installation
        ------------
        
        The easiest way to install the package is via ``pip``
        
            $ pip install riboflow
            
        Dependencies:
            
            numpy==1.14.5
            tensorflow==1.8.0   
            keras==2.2.0 
            
        A trained ``Bi-directional Recurent Neural Network (RNN) Model`` is integrated into the ``riboflow`` package (and installed automatically with the ``pip``). Note that the source code to generate the ``Bi-directional Recurent Neural Network Model`` is available. The git repository [Riboswitch Classification](https://github.com/RiboswitchClassifier/RiboswitchClassification) could be forked to generate a new model.
        
        Problem Statement
        -------------------
        Riboswitches are metabolite-sensing mRNAs, for e.g, amino acid or metal ion sensors, that switch conformation upon binding the cognate ligand, thereby exerting control on translation. It would be of interest to classfify the ligand-specificity of riboswitches given their sequence. 
        
        **The prediction problem**:
        
        Given the riboswitch sequence, predict the riboswitch class (as given by the ligand-specificity of the riboswitch).
        
        **Machine learning formulation**:
         - Input: Riboswitch sequence
         - Source dataset: Rfam database (rfam.org)
         - Output: Riboswitch class 
         - Best-performing Classifier: Bi-directional RNN (>99% accuracy)
         - Features used in the best-performing classifier: the full riboswitch sequence
        
        Usage
        -------------------
        
        Once `riboflow` is installed, please follow the steps to predict the class of a new riboswitch sequence:
        
        **1. Import the package**:
        
          - Inside the python shell or in the python file::
        
                > import riboflow
        
        **2. Construct a list of riboswitch sequences**:
        
                > # A sequence is a string in alphabet 'ATGC'
                > sequences = [
                    "TTTTTTTTGCAGGGGTGGCTTTAGGGCCTGAGAAGATACCCATTGAACCTGACCTGGCTAAAACCAGGGTAGGGAATTGCAGAAATGTCCTCATT",
                    "CTCTTATCCAGAGCGGTAGAGGGACTGGCCCTTTGAAGCCCAGCAACCTACACTTTTTGTTGTAAGGTGCTAACCTGAGCAGGAGAAATCCTGACCGATGAGAG",
                    "CCACGATAAAGGTAAACCCTGAGTGATCAGGGGGCGCAAAGTGTAGGATCTCAGCTCAAGTCATCTCCAGATAAGAAATATCAGAAAGATAGCCTTACTGCCGAA"
                  ]
        
        **3a. Predict the class for each riboswitch sequence**:
        
                > # Predict the most probable riboswitch class of each sequence
                > riboflow.predict(sequences, "predict_class")
                
        **3b. Predict a vector of class probabilities for each riboswitch sequence**:
        
                > # Predict probabilty of each riboswitch class associated with each sequence 
                > riboflow.predict(sequences, "predict_prob")
        
        Riboswitches Accounted For 
        ------------
        
            1.  'RF00050 - Flavin Mononucleotide Riboswitch',
            2.  'RF00059 - Thiamine pyrophosphate riboswitch',
            3.  'RF00162 - SAM - 1 Riboswitch',
            4.  'RF00167 - Purine Riboswitch',
            5.  'RF00168 - Lysine Riboswitch',
            6.  'RF00174 - Cobalamin riboswitch',
            7.  'RF00234 - Glucosamine-6-phosphate riboswitch',
            8.  'RF00380 - Ykok riboswitch(Magnesium sensing riboswitch)',
            9.  'RF00504 - Glycine riboswitch',
            10. 'RF00521 - SAM - 2 Riboswitch',
            11. 'RF00522 - pre-queosine riboswitch1',
            12. 'RF00634 - SAM - 4 Riboswitch',
            13. 'RF01051 - Cyclic di-GMP-I riboswitches',
            14. 'RF01054 - pre-queosine riboswitch2',
            15. 'RF01055 - Molybdenum Co-factor riboswitch',
            16. 'RF01057 - SAH Riboswitch',
            17. 'RF01725 - SAM -1 -4 Variant riboswitch',
            18. 'RF01726 - SAM - 2 Long loop riboswitch',
            19. 'RF01727 - SAM-SAH Riboswitch',
            20. 'RF01734 - Fluoride Riboswitch',
            21. 'RF01739 - Glutamine riboswitch',
            22. 'RF01763 - Guanidine - 3 Riboswitch',
            23. 'RF01767 - SAM - 3 Riboswitch',
            24. 'RF02683 - NiCo riboswitch(sense Nickel or Cobalt)'
            
        Additional information
        -----
        For more information, please refer to our manuscript below. 
        
        *Premkumar KAR, Bharanikumar R, Palaniappan A.* (2019) Classifying riboswitches with >99% accuracy. **Microorganisms** (to be submitted)
        
        Please cite us if you use our services.
        
        Package Structure
        -----
        
            .
            ├── build                       # Buildout project configuration
            ├── dist                        # Consists of  .whl and .tar package files
            ├── riboflow                    # Package Directory
            │   ├── __init__.py             # main file
            │   ├── rnn_24_model.h5         # Bi-directional Recurent Neural Network Model
            ├── riboflow.egg-info           # Egg information of the project
            ├── LICENSE                     # License
            ├── MANIFEST.in                 # To include the Bi-directional Recurent Neural Network Model within the package
            ├── README.md                   # Package description
            └── setup.py                    # Package metadata
        
        References and acknowledgements for pypi package development
        ----------
        
          * http://fouryears.eu/2014/03/19/structure-of-a-python-project/
          * http://www.jeffknupp.com/blog/2013/08/16/open-sourcing-a-python-project-the-right-way/
          * [Bharat Goel](https://github.com/BharatGoel36) provided help in packaging the application. 
          
        Authors
        ----------
        
          * [Keshav Aditya R.P](https://keshavadityarp.github.io)
            - [Github](https://github.com/KeshavAdityaRP)
            - [LinkedIn](https://www.linkedin.com/in/keshavadityarp/)
          * Ramit Bharanikumar
            - [Github](https://github.com/ramit29)
            - [LinkedIn](https://www.linkedin.com/in/ramit-bharanikumar-12a014114/)
          * Ashok Palaniappan
            - [Senior Assistant Professor](http://www.sastra.edu/staffprofiles/schools/scbt.php?staff_id=C2164)
            - [Github](https://github.com/apalania)
            - [LinkedIn](https://www.linkedin.com/in/ashokpalaniappan/)
        
        
        Copyright & License
        -------------------
        
        Copyright (c) 2019, the Authors. MIT License.
        
        
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Description-Content-Type: text/markdown
