Metadata-Version: 2.4
Name: sigmap
Version: 1.0.0
Summary: SigmaP: Python package for predicting sigma70 promoter in Escherichia coli K-12
Project-URL: Homepage, https://github.com/Goosang-Yu/sigmap
Project-URL: Repository, https://github.com/Goosang-Yu/sigmap
Project-URL: Source, https://github.com/Goosang-Yu/sigmap
Project-URL: Tracker, https://github.com/Goosang-Yu/sigmap/issues
Author-email: Goosang Yu <gsyu93@gmail.com>
License-File: LICENSE
Keywords: analysis,bacteria,bioinformatics,genetics,machine-learning,promoter,python
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: Operating System :: MacOS :: MacOS X
Classifier: Operating System :: Microsoft :: Windows
Classifier: Operating System :: POSIX
Classifier: Operating System :: Unix
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3 :: Only
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Topic :: Software Development
Classifier: Topic :: Software Development :: Libraries
Classifier: Topic :: Software Development :: Libraries :: Application Frameworks
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.7
Requires-Dist: pandas
Requires-Dist: scikit-learn
Description-Content-Type: text/markdown

# SigmaP
Python package for Sigma70 promoter Prediction. This package used Sigma70Pred [(Patiyal et al. 2022)](https://www.frontiersin.org/journals/microbiology/articles/10.3389/fmicb.2022.1042127/full).

### Installation
This package can be installed by pip.
```python
pip install sigmap
```

### How to use
First, prepare fasta file containing DNA sequence. Minimum length for prediction is 81nt. Then, calculate probability score by `SigmaFactor`. Run prediction model by `.predict` method. Results will be returned as `pd.DataFrame`.
```python
from sigmap import SigmaFactor

sigma = SigmaFactor()

df_out = sigma.predict('tutorial/example_seq.fa')
```

| ID      | Sequence                                          | Probability Score | Prediction   |
| ------- | ------------------------------------------------- | ----------------- | ------------ |
| \>Seq_1 | TAGCACGACGATAATATAAACGCAGCAAAAAAAAAAAAAAAAAAAA... | 0.145             | Non-Promoter |
| \>Seq_2 | AGCTTGCGTCAATGGGCAAGGTGGGCTTGCATTTGCTTAATAGAAA... | 0.478             | Promoter     |
| \>Seq_3 | TCGTTTTATTTCTTTTTTCTCCATTGAACTTTCAGTTTCTTTTCTA... | 0.692             | Promoter     |
| \>Seq_4 | CGCAGCGGGTTTACCCTCTGACCGTTTCTGTTACGAAGGCTTTTTA... | 0.216             | Non-Promoter |
| \>Seq_5 | TGCTGCTTGGTCTGTGGGTTGCCGCACAGGTTGCCGGTTCCACCAA... | 0.162             | Non-Promoter |
| \>Seq_6 | GAATCCAACTAATGTTGTAAACTGGCAAGGTAATGTCATTAGTCAT... | 0.418             | Promoter     |

Contact: Goosang Yu (gsyu93@gmail.com)