Metadata-Version: 2.1
Name: mccoy
Version: 0.1.1
Summary: 
Author: Wytamma Wirth
Author-email: wytamma.wirth@me.com
Requires-Python: >=3.7,<4.0
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Requires-Dist: snakemake (>=7.5.0,<8.0.0)
Requires-Dist: tqdm (>=4.64.0,<5.0.0)
Requires-Dist: typer (>=0.4.1,<0.5.0)
Description-Content-Type: text/markdown

# Current workflow

[![pre-commit.ci status](https://results.pre-commit.ci/badge/github/smutch/mccoy/main.svg)](https://results.pre-commit.ci/latest/github/smutch/mccoy/main)
[![docs](https://github.com/smutch/mccoy/actions/workflows/docs.yaml/badge.svg?event=push)](https://github.com/smutch/mccoy/actions/workflows/docs.yaml)


This will be updated as pieces are developed and modified.

```mermaid
%%{init: { 'theme':'neutral' } }%%
flowchart TB
    sources[(fasta files)]
    sources --> combine --> MSA
    click combine href "https://github.com/smutch/mccoy/blob/main/mccoy/workflow/rules/combine.smk"

    MSA[multiple sequence alignment<br/>-- MAFFT] --> QC
    click MSA href "https://github.com/smutch/mccoy/blob/main/mccoy/workflow/rules/align.smk"

    subgraph QC["Quality control"]
        tree[L_max tree<br/>-- iqtree2] --> RTR[root-tip regression]
        click tree href "https://github.com/smutch/mccoy/blob/main/mccoy/workflow/rules/tree.smk"
        otherQC[other checks]
    end

    QC --> XML[Beast XML generation<br/>-- Wytamma's scripts + templates + FEAST] --> OnlineBEAST[run, pause & update BEAST analysis<br/>-- Online BEAST] .-> Beastiary[monitor running BEAST jobs<br/>-- Beastiary]
    click XML href "https://github.com/smutch/mccoy/blob/main/mccoy/workflow/rules/beast.smk"
    click OnlineBEAST href "https://github.com/Wytamma/online-beast"
    click Beastiary href "https://github.com/Wytamma/beastiary"

    classDef complete fill:#48b884;
    class gisaid,GISAIDR,sources,combine,MSA,tree,RTR,XML complete;

    classDef inProg fill:#cc8400;
    class otherQC,OnlineBEAST inProg;
```

# Instructions

Ensure you have [mamba](https://github.com/conda-forge/miniforge) (conda will work too, but mamba is strongly preferred), and [poetry](https://python-poetry.org) installed.

## Step 1 - install the workflow

```bash
poetry install
```

To start using McCoy, you can either spawn a new shell with the McCoy Poetry environment enabled:

```bash
poetry shell
```

**or** you can replace every instance of `mccoy` in the commands below with `poetry run mccoy`.

The workflow is being developed such that all required software will be automatically installed for each step of the pipeline in self-contained conda environments. These environments will be cached and reused whenever possible (all handled internally by snakemake), but if you want to remove them then they can be found in `.snakemake`.

## Step 2 - Create a McCoy project

First begin by creating a new McCoy project (called `test` in this example):

```bash
mccoy create test --reference resources/reference.fasta --template resources/templates/CoV_CE_fixed_clock_template.xml
```

The `reference` and `template` options are required. At the moment we are distributing these reference and template files, however, once we reach v1.0, these will be removed and the useer will have to ensure they have appropriate reference and template files available.

The config for this project can be altered by editing the newly created file `test/config.yaml`.

## Step 3 - Run the project!

To run the newly created project:

```bash
mccoy run test --data resources/omicron_test-original.fasta
```

Again, the `data` option here is required. This command will create a new directory in `test/runs` with the workflow results and output.

## Step 4 - Add new data

Subsequent calls to `mccoy run` will result in a whole new run of the pipeline from start-to-finsh unless the `--inherit` or `--inherit-last` flags are used. See `mccoy run --help` for more information. Inheriting from a previous run will use the data and MCMC state as a starting point for the new data.

```bash
mccoy run test --data resources/omicron_test-extra.fasta --inherit-last
```

As well as directly altering a project's `config.yaml`, config variables can be overridden on the command line. e.g.:
```bash
mccoy run --data resources/omicron_test-original.fasta --config align='{mafft: ["--6merpair", "--addfragments"]}'
```

Any options passed to `mccoy run` that are not listed in `mccoy run --help` will be directly forwarded on to Snakemake. See `mccoy run --help-snakemake` for a list of all available options.

