Metadata-Version: 1.1
Name: targqc
Version: 1.1.0
Summary: Genome capture target coverage evaluation tool
Home-page: https://github.com/vladsaveliev/TargQC
Author: Vlad Saveliev and Alla Mikheenko
Author-email: vladislav.sav@gmail.com
License: GPLv3
Download-URL: https://github.com/vladsaveliev/TargQC/releases
Description: # Alignment target coverage analysis tool 
        
        ## Input
        - BAM file(s) (or FastQ files).
        - BED file (optional).
        
        ## Output
        - `summary.html` – sample-level coverage statistics and plots.
        - `summary.tsv` – sample-level coverage, parsable version
        - `regions.txt` – region-level coverage statistics.
        
        ## Installation
        ```
        git clone --recursive https://github.com/vladsaveliev/TargQC
        cd TargQC
        python setup.py install
        ```
        
        If you are not an administator in your system, create a virtual environment first:
        ```
        virtualenv venv
        source venv/bin/activate
        ```
        
        ## Usage
        ```
        targqc *.bam --bed target.bed -g hg19 -o targqc_results
        ```
        The results will be written to `targqc_results` folder.
        
        The BED file may be omitted. In this case statistics reported will be based of off the whole genome.
        
        ## FastQ and downsampled coverage
        Instead of the BAM files, input FastQ are also allowed. The reads will be aligned by BWA to the reference genome specified by `--bwa-prefix`. Option `--downsample-to N` (default value `5e5`) specifies the number ofread pairs will be randomly selected from each input set. This feature allows to quickly estimate approximate coverage quality before full alignment. To turn downsampling off and align all reads, set `--downsample-to off`.
        ```
        targqc *.fastq --bed target.bed -g hg19 -o targqc_results --bwa-prefix /path/to/ref.bwa
        ```
        
        ## Parallel running
        ### Threads
        Run using 3 threads:
        ```
        targqc *.bam --bed target.bed -g hg19 -o targqc_results -t 3
        ```
        ### Cluster
        Run using 3 jobs, using SGE scheduler, and queue "queue":
        ```
        targqc *.bam --bed target.bed -g hg19 -o targqc_results -t 3 -s sge -q queue -r pename=smp
        ```
        If the number of samples is higher than the requested number of jobs, the processes within job will be additionally parallelized using threads, so the full number of occupied cores will equal the number of requested threads (-t)
        
        Other supported schedulers: Platform LSF ("lsf"), Sun Grid Engine ("sge"), Torque ("torque"), SLURM ("slurm") (see details at https://github.com/roryk/ipython-cluster-helper)
        
        ## For HPC users
        ```
        module load python
        export PATH=/group/ngs/src/TargQC/venv/bin:$PATH
        ```
Keywords: bioinformatics
Platform: UNKNOWN
Classifier: Environment :: Console
Classifier: Environment :: Web Environment
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: GNU General Public License v3 (GPLv3)
Classifier: Natural Language :: English
Classifier: Operating System :: MacOS :: MacOS X
Classifier: Operating System :: POSIX
Classifier: Operating System :: Unix
Classifier: Programming Language :: Python
Classifier: Programming Language :: JavaScript
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
