Metadata-Version: 2.1
Name: json-arrays
Version: 0.14.0
Summary: Stream JSON and JSON-Lines lazily.
Project-URL: Homepage, https://github.com/spraakbanken/json-arrays-py
Project-URL: Repository, https://github.com/spraakbanken/json-arrays-py
Project-URL: Bug Tracker, https://github.com/spraakbanken/json-arrays-py/issues
Author-email: Kristoffer Andersson <kristoffer.andersson@gu.se>, Språkbanken at Göteborgs universitet <sb-info@svenska.gu.se>
License: MIT
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: POSIX
Classifier: Operating System :: Unix
Classifier: Programming Language :: Python :: Implementation :: CPython
Classifier: Topic :: Utilities
Requires-Python: >=3.9
Requires-Dist: ijson>=3.2.3
Provides-Extra: orjson
Requires-Dist: orjson>=3.9.13; extra == 'orjson'
Description-Content-Type: text/markdown

# json-arrays

[![codecov](https://codecov.io/gh/spraakbanken/json-arrays-py/branch/main/graph/badge.svg)](https://codecov.io/gh/spraakbanken/json-arrays-py/)
[![CI](https://github.com/spraakbanken/json-arrays-py/workflows/CI/badge.svg)](https://github.com/spraakbanken/json-arrays-py/actions)
[![PyPI status](https://badge.fury.io/py/json-arrays.svg)](https://pypi.org/project/json-arrays/)

Read and write JSON lazy, especially json-arrays.

Handles both the JSON format:

```json
[
  {
    "a": 1
  },
  {
    "a": 2
  }
]
```

As well as JSON LINES format:

```json
{"a":1}
{"a": 2}
```

Also supports streaming from gzipped files.

Uses `orjson` if present, otherwise standard `json`.

## Usage

### Installation

```bash
# Using standard json
pip install json-arrays

# Using orjson
pip install json-arrays[orjson]

```

### Note

This library prefers files opened in binary mode.
Therefore does all `dumps`-methods return `bytes`.

All `loads` methods handles `str`, `bytes` and `bytesarray` arguments.

### Examples

Allows you to use `json.load` and `json.dump` with
both json and json-lines files as well as dumping generators.

```python
import json_arrays

# This command tries to guess format and opens the file
data = json_arrays.load_from_file("data.json") # or data.jsonl

# Write to file, again guessing format
json_arrays.dump_to_file(data, "data.jsonl")
```

```python
from json_arrays import json_iter, jsonl_iter

# Open and read the file without guessing
data = json_iter.load_from_file("data.json")

# Process file

# Write to file without guessing
jsonl_iter.dump_to_file(data, "data.jsonl")
```

```python
import json_arrays
def process(data):
    for entry in data:
        # process
        yield entry

def read_process_and_write(filename_in, filename_out):

    json_arrays.dump_to_file(
        process(
            json_arrays.load_from_file(filename_in)
        ),
        filename_out
    )
```

You can also use json_arrays as a sink, that you can send data to.

```python
import json_arrays

with open("out.json", "bw") as fp:
  # guessing format
  with json_arrays.sink(fp) as sink:
    for data in data_source():
      sink.send(data)
```

## Release Notes

This projects [keeps a CHANGELOG](./CHANGELOG.md).

## Development

This project uses [rye](https://rye-up.com).
After cloning the repo, just run

```bash
make dev
make test
```

to setup a virtual environment,
install dev dependencies
and run the unit tests.

_Note:_ If you run the command in a activated virtual environment,
that environment is used instead.

## Deployment

Push a tag in the format `v\d+.\d+.\d+`to `main`-branch, to build & publish package to PyPi.
