Metadata-Version: 2.1
Name: json-streams
Version: 0.13.0
Summary: Stream JSON and JSON-Lines lazily.
Project-URL: Homepage, https://github.com/spraakbanken/json-streams-py
Project-URL: Repository, https://github.com/spraakbanken/json-streams-py
Project-URL: Bug Tracker, https://github.com/spraakbanken/json-streams-py/issues
Author-email: Kristoffer Andersson <kristoffer.andersson@gu.se>, Språkbanken at Göteborgs universitet <sb-info@svenska.gu.se>
License-Expression: MIT
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: POSIX
Classifier: Operating System :: Unix
Classifier: Programming Language :: Python :: Implementation :: CPython
Classifier: Topic :: Utilities
Requires-Python: >=3.9
Requires-Dist: ijson>=3.2.3
Provides-Extra: ci
Requires-Dist: ruff>=0.1.1; extra == 'ci'
Provides-Extra: orjson
Requires-Dist: orjson>=3.9.9; extra == 'orjson'
Description-Content-Type: text/markdown

# json-streams

[![codecov](https://codecov.io/gh/spraakbanken/json-streams-py/branch/main/graph/badge.svg)](https://codecov.io/gh/spraakbanken/json-streams-py/)
[![CI](https://github.com/spraakbanken/json-streams-py/workflows/CI/badge.svg)](https://github.com/spraakbanken/json-streams-py/actions)
[![PyPI status](https://badge.fury.io/py/json-streams.svg)](https://pypi.org/project/json-streams/)
[![image](https://img.shields.io/pypi/pyversions/json-streams.svg)](https://pypi.python.org/pypi/json-streams)

Read and write JSON lazy, especially json-arrays.

Handles both the JSON format:

```json
[
  {
    "a": 1
  },
  {
    "a": 2
  }
]
```

As well as JSON LINES format:

```json
{"a":1}
{"a": 2}
```

Also supports streaming from gzipped files.

Uses `orjson` if present, otherwise standard `json`.

## Usage

### Installation

```bash
# Using standard json
pip install json-streams

# Using orjson
pip install json-streams[orjson]

```

### Note

This library prefers files opened in binary mode.
Therefore does all `dumps`-methods return `bytes`.

All `loads` methods handles `str`, `bytes` and `bytesarray` arguments.

### Examples

Allows you to use `json.load` and `json.dump` with
both json and json-lines files as well as dumping generators.

```python
import json_streams

# This command tries to guess format and opens the file
data = json_streams.load_from_file("data.json") # or data.jsonl

# Write to file, again guessing format
json_streams.dump_to_file(data, "data.jsonl")
```

```python
from json_streams import json_iter, jsonl_iter

# Open and read the file without guessing
data = json_iter.load_from_file("data.json")

# Process file

# Write to file without guessing
jsonl_iter.dump_to_file(data, "data.jsonl")
```

```python
import json_streams
def process(data):
    for entry in data:
        # process
        yield entry

def read_process_and_write(filename_in, filename_out):

    json_streams.dump_to_file(
        process(
            json_streams.load_from_file(filename_in)
        ),
        filename_out
    )
```

You can also use json_streams as a sink, that you can send data to.

```python
import json_streams

with open("out.json", "bw") as fp:
  # guessing format
  with json_streams.sink(fp) as sink:
    for data in data_source():
      sink.send(data)
```

## Release Notes

This projects [keeps a CHANGELOG](./CHANGELOG.md).

## Development

After cloning the repo, just run

```
$ make dev
$ make test
```

to setup a virtual environment,
install dev dependencies
and run the unit tests.

_Note:_ If you run the command in a activated virtual environment,
that environment is used instead.

## Deployment

Push a tag in the format `v\d+.\d+.\d+`to `main`-branch, to build & publish package to PyPi.
