Metadata-Version: 2.1
Name: cptac
Version: 0.5.2
Summary: Python packaging for CPTAC data
Home-page: http://github.com/PayneLab/cptac
Author: Dr. Samuel Payne
Author-email: sam_payne@byu.edu
License: Apache 2.0
Project-URL: Documentation, https://github.com/PayneLab/cptac/tree/master/docs
Keywords: bioinformatics cancer proteomics genomics open science open data
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Science/Research
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
Classifier: License :: OSI Approved :: Apache Software License
Requires-Python: >=3.6.*
Description-Content-Type: text/markdown
Requires-Dist: numpy (>=1.16.3)
Requires-Dist: pandas (==0.24.2)
Requires-Dist: requests (>=2.21.0)
Requires-Dist: beautifulsoup4 (>=4.7.1)
Requires-Dist: scipy (>=1.2.1)
Requires-Dist: urllib3 (>=1.24.2)

# cptac
This project is intended to facilitate accessing and interacting with cancer data from the National Cancer Institute CPTAC consortium, which characterizes and studies the proteogenomic landscape of tumors. Currently, the datasets available are: endometrial cancer, ovarian cancer and colon cancer. These cancer studies are downloadable via our Python package as native dataframe objects and can therefore be integrated very quickly and easily with other Python-based data analysis tools. Follow our walkthrough tutorials for a demonstration of ways to use our system.

Setup instructions can be found in <code>doc/setup.md</code>

## Tutorials
Tutorials for this package describe how to use the package functions for research with the provided data. All the tutorials are written in Python using the interactive Jupyter notebooks. If you are unfamiliar with Jupyter, follow the instructions given at jupyter.org/install. You will then be able to run our tutorials as interactive, exploratory data analyses.
<ul>
  <li>Tutorial 1: CPTAC data introduction</li>
  <li>Tutorial 2: Using pandas to work with cptac dataframes</li>
  <li>Tutorial 3: Joining dataframes with cptac</li>
  <li>Use Case 1: Comparing transcriptomics and proteomics for a single gene</li>
  <li>Use Case 2: Looking for correlation between clinical attributes</li>
  <li>Use Case 3: Associating clinical variables with acetylation</li>
</ul>

## Requirements
This package is intended to run on Python 3.6 with pandas 0.23.4. In the tutorials, we use seaborn 0.9.0 for data visualization. 

## License
This package contains LICENSE.md document which describes the license for use. Please note the difference between the license as it applies to code versus data.


