Metadata-Version: 2.4
Name: GEOdb
Version: 0.2.0.1
Summary: An asynchronous Python library for the Gene Expression Omnibus (GEO) database
Author-email: KumaTea <KumaTea@outlook.com>
Maintainer-email: KumaTea <KumaTea@outlook.com>
License-Expression: GPL-3.0-or-later
Project-URL: Homepage, https://github.com/KumaTea
Project-URL: Repository, https://github.com/KumaTea/GEOdb
Keywords: bioinformatics,gene-expression-omnibus
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Programming Language :: Python :: Implementation
Classifier: Programming Language :: Python :: Implementation :: CPython
Classifier: Programming Language :: Python :: Implementation :: PyPy
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
Classifier: Topic :: Software Development :: Libraries
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.8
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: aiohttp>=3.10.11
Requires-Dist: beautifulsoup4>=4.14.3
Requires-Dist: tqdm>=4.67.1
Dynamic: license-file

# GEOdb
An asynchronous Python library for the Gene Expression Omnibus (GEO) database

## Usage

### parser

`from GEOdb.parser.series import parse_file, parse_item`

Either parse a file:

`parse_file('/path/to/GEO/downloaded.txt')`

Or parse a text:

```python
t = """1. Transcriptional profiling of human KIR+ CD8 T cells
(Submitter supplied) This SuperSeries is composed of the SubSeries listed below.
Organism:	Homo sapiens
Type:		Expression profiling by high throughput sequencing; Other
Platforms: GPL20301 GPL18573 4548 Samples
FTP download: GEO (TXT) ftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE193nnn/GSE193442/
Series		Accession: GSE193442	ID: 200193442"""

parse_item(t)
```

```
GEOSeriesInfo(title='Transcriptional profiling of human KIR+ CD8 T cells',
              link='https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE193442',
              url='https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE193442',
              summary='(Submitter supplied) This SuperSeries is composed of '
                      'the SubSeries listed below.',
              organism='Homo sapiens',
              type='Expression profiling by high throughput sequencing; Other',
              platform='GPL20301',
              samples=4548,
              id='GSE193442',
              accession='GSE193442',
              series_id=200193442,
              ftp='ftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE193nnn/GSE193442/',
              sra=None)
```
