Metadata-Version: 2.2
Name: oscb
Version: 0.1.2
Summary: OSCB aims to provide automated end-to-end single-cell analyses ML pipelines to simplify and standardize the process of single-cell data formatting, quality control, loading, model development, and model evaluation. 
Home-page: https://github.com/cirisjl/Machine-learning-development-environment-for-single-cell-sequencing-data-analyses
Author: Lei Jiang
Author-email: leijiang@missouri.edu
License: MIT
Keywords: single-cell,benchmarks
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.6, <=3.12
Description-Content-Type: text/markdown
Requires-Dist: leidenalg>=0.8.10
Requires-Dist: matplotlib>=3.5.1
Requires-Dist: networkx>=2.6.3
Requires-Dist: numpy>=1.26.4
Requires-Dist: pandas>=1.3.5
Requires-Dist: python_igraph>=0.9.9
Requires-Dist: python_louvain>=0.16
Requires-Dist: scanpy
Requires-Dist: muon
Requires-Dist: mudata
Requires-Dist: tqdm
Requires-Dist: requests
Requires-Dist: scib
Requires-Dist: zss
Requires-Dist: grakel
Requires-Dist: scikit_learn>=1.0.2
Requires-Dist: scipy>=1.7.3
Requires-Dist: umap_learn>=0.5.2
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: keywords
Dynamic: license
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

# Overview

--------------------------------------------------------------------------------


Machine learning (ML) is transforming single-cell sequencing data analysis; however, the barriers of technology complexity and biology knowledge remain challenging for the involvement of the ML community in single-cell data analysis. We present an ML development environment for single-cell sequencing data analyses with a diverse set of AI-Ready benchmark datasets. A cloud-based platform is built to dynamically scale workflows for collecting, processing, and managing various single-cell sequencing data to make them ML-ready. In addition, benchmarks for each problem formulation and a code-level and web-interface IDE for single-cell analysis method development are provided. 


![Workflow](https://oscb.missouri.edu/assets/30e6000a-5e6f-440f-bec5-5c7ceb256c55)

OSCB aims to provide automated end-to-end single-cell analyses ML pipelines to simplify and standardize the process of single-cell data formatting, quality control, loading, model development, and model evaluation. 


**Workflows** are developed for collecting, processing, and managing diverse single-cell sequencing data to make them ML-ready and build benchmarks.

**IDE** is provided for supporting partial method development.

**Assessment utilities** are provided for evaluating results and report generation.

This **end-to-end pipeline** transforms the traditional “static” machine Learning into **continuous learning** on extensive new data.


By **in-depth fusing models with data**, this platform could ultimately help many single-cell sequencing researchers substantially.

 
![Tools](https://oscb.missouri.edu/assets/c18ffb2a-814f-452c-921b-e399b99c41b4)


OSCB is an on-going effort, and we are planning to increase our coverage in the future.
