Metadata-Version: 2.1
Name: blastbesties
Version: 1.1.2
Summary: Rapid discovery of reciprocal best blast pairs from BLAST output files.
Author-email: Adam Taranto <adam.p.taranto@gmail.com>
License: MIT
Project-URL: homepage, https://github.com/adamtaranto/blast-besties
Project-URL: documentation, https://github.com/adamtaranto/blast-besties
Project-URL: repository, https://github.com/adamtaranto/blast-besties
Classifier: Programming Language :: Python :: 3
Classifier: Intended Audience :: Science/Research
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
Classifier: License :: OSI Approved :: MIT License
Requires-Python: >=3.7
Description-Content-Type: text/markdown
License-File: LICENSE

<a href="https://opensource.org/licenses/MIT">
<img src="https://img.shields.io/badge/License-MIT-yellow.svg" align="left" height="20"/>
</a> 
<br clear="left"/>


# Blast-Besties

Rapid discovery of reciprocal best blast pairs from BLAST output files.  


## Table of contents
- [Blast-Besties](#blast-besties)
  - [Table of contents](#table-of-contents)
  - [Install Blast-Besties](#install-blast-besties)
  - [Example usage](#example-usage)
    - [Input requirements](#input-requirements)
    - [Standard options](#standard-options)
  - [License](#license)

## Install Blast-Besties

There are 3 options available for installing Blast-Besties locally:

1) Clone from this repository and install as a local Python package.

```bash
git clone https://github.com/Adamtaranto/blast-besties.git && cd blast-besties && pip install -e .
```

1) Pip install directly from this git repository.

```bash
pip install git+https://github.com/Adamtaranto/blast-besties.git
```

3) Install from PyPi.

```bash
pip install blastbesties
```

## Example usage

Run BLASTp for each protein set AvsB and BvsA.
Require vaild alignments to cover 90% of the query sequence and with an e-value of < 0.001.

```bash
blastp -qcov_hsp_perc 90 -query A_prot.fa -subject B_prot.fa -out AvB.tab -evalue 0.001 -outfmt 6 -use_sw_tback
blastp -qcov_hsp_perc 90 -query B_prot.fa -subject A_prot.fa -out BvA.tab -evalue 0.001 -outfmt 6 -use_sw_tback
```

Report reciprocal best match pairs where each hit meets criteria e-value 
<= 0.001, hit-length >= 40, bitscore >= 100.  

```bash
blastbesties -e 0.001 -l 40 -s 100 -a AvB.tab -b BvA.tab -o pairs.tab
```

### Input requirements

  - Takes two BLAST output files as input. SetA vs SetB, and SetB vs SetA.
  - BLAST output format 6 (tabular)
  - Hits must be sorted by query name then descending match quality - this is default BLAST behaviour.

### Standard options

```
Usage: blastbesties [-h] [-v] -a BLASTAVB -b BLASTBVA [-l MINLEN] [-e EVAL]
                     [-s BITSCORE] [-o OUTFILE] [-d OUTDIR]

Options:

  # Info
  -h, --help        Show this help message and exit.
  -v, --version     Show program's version number and exit.
  
  # Input 
  -a, --blastAvB    Blast tab result file for fastaA query against fastaB
                    subject.
  -b, --blastBvA    Blast tab result file for fastaB query against fastaA
                    subject.
  
  # Settings
  -l, --minLen      Minimum length of hit to consider valid.
  -e, --eVal        Maximum e-value to consider valid pair.
  -s, --bitScore    Minimum bitscore to consider valid pair.

  # Output
  -o, --outFile     Write reciprocal blast pairs to this file.
  -d, --outDir      Directory for new sequence files to be written to.
```

## License

Software provided under MIT license.

