Metadata-Version: 2.1
Name: tableschema
Version: 1.7.2
Summary: A utility library for working with Table Schema in Python
Home-page: https://github.com/frictionlessdata/tableschema-py
Author: Open Knowledge Foundation
Author-email: info@okfn.org
License: MIT
Description: # tableschema-py
        
        [![Travis](https://travis-ci.org/frictionlessdata/tableschema-py.svg?branch=master)](https://travis-ci.org/frictionlessdata/tableschema-py)
        [![Coveralls](http://img.shields.io/coveralls/frictionlessdata/tableschema-py.svg?branch=master)](https://coveralls.io/r/frictionlessdata/tableschema-py?branch=master)
        [![PyPi](https://img.shields.io/pypi/v/tableschema.svg)](https://pypi.python.org/pypi/tableschema)
        [![SemVer](https://img.shields.io/badge/versions-SemVer-brightgreen.svg)](http://semver.org/)
        [![Gitter](https://img.shields.io/gitter/room/frictionlessdata/chat.svg)](https://gitter.im/frictionlessdata/chat)
        
        A library for working with [Table Schema](http://specs.frictionlessdata.io/table-schema/) in Python.
        
        ## Features
        
        - `Table` to work with data tables described by Table Schema
        - `Schema` representing Table Schema
        - `Field` representing Table Schema field
        - `validate` to validate Table Schema
        - `infer` to infer Table Schema from data
        - built-in command-line interface to validate and infer schemas
        - storage/plugins system to connect tables to different storage backends like SQL Database
        
        ## Contents
        
        <!--TOC-->
        
          - [Gettings Started](#gettings-started)
            - [Installation](#installation)
            - [Examples](#examples)
          - [Documentation](#documentation)
            - [Table](#table)
            - [Schema](#schema)
            - [Field](#field)
            - [validate](#validate)
            - [infer](#infer)
            - [Exceptions](#exceptions)
            - [Storage](#storage)
            - [Plugins](#plugins)
            - [CLI](#cli)
          - [Contributing](#contributing)
          - [Changelog](#changelog)
        
        <!--TOC-->
        
        ## Gettings Started
        
        ### Installation
        
        The package use semantic versioning. It means that major versions  could include breaking changes. It's highly recommended to specify `tableschema` version range in your `setup/requirements` file e.g. `tableschema>=1.0,<2.0`.
        
        ```bash
        $ pip install tableschema
        ```
        
        ### Examples
        
        Code examples in this readme requires Python 3.4+ interpreter. You could see even more example in [examples](https://github.com/frictionlessdata/tableschema-py/tree/master/examples) directory.
        
        ```python
        from tableschema import Table
        
        # Create table
        table = Table('path.csv', schema='schema.json')
        
        # Print schema descriptor
        print(table.schema.descriptor)
        
        # Print cast rows in a dict form
        for keyed_row in table.iter(keyed=True):
            print(keyed_row)
        ```
        
        ## Documentation
        
        ### Table
        
        A table is a core concept in a tabular data world. It represents data with metadata (Table Schema). Let's see how we can use it in practice.
        
        Consider we have some local csv file. It could be inline data or from a remote link - all supported by the `Table` class (except local files for in-brower usage of course). But say it's `data.csv` for now:
        
        ```csv
        city,location
        london,"51.50,-0.11"
        paris,"48.85,2.30"
        rome,N/A
        ```
        
        Let's create and read a table instance. We use the static `Table.load` method and the `table.read` method with the `keyed` option to get an array of keyed rows:
        
        ```python
        table = Table('data.csv')
        table.headers # ['city', 'location']
        table.read(keyed=True)
        # [
        #   {city: 'london', location: '51.50,-0.11'},
        #   {city: 'paris', location: '48.85,2.30'},
        #   {city: 'rome', location: 'N/A'},
        # ]
        ```
        
        As we can see, our locations are just strings. But they should be geopoints. Also, Rome's location is not available, but it's just a string `N/A` instead of `None`. First we have to infer Table Schema:
        
        ```python
        table.infer()
        table.schema.descriptor
        # { fields:
        #   [ { name: 'city', type: 'string', format: 'default' },
        #     { name: 'location', type: 'geopoint', format: 'default' } ],
        #  missingValues: [ '' ] }
        table.read(keyed=True)
        # Fails with a data validation error
        ```
        
        Let's fix the "not available" location. There is a `missingValues` property in Table Schema specification. As a first try we set `missingValues` to `N/A` in `table.schema.descriptor`. The schema descriptor can be changed in-place, but all changes should also be committed using `table.schema.commit()`:
        
        ```python
        table.schema.descriptor['missingValues'] = 'N/A'
        table.schema.commit()
        table.schema.valid # false
        table.schema.errors
        # [<ValidationError: "'N/A' is not of type 'array'">]
        ```
        
        As a good citizens we've decided to check our schema descriptor's validity. And it's not valid! We should use an array for the `missingValues` property. Also, don't forget to include "empty string" as a valid missing value:
        
        ```python
        table.schema.descriptor['missingValues'] = ['', 'N/A']
        table.schema.commit()
        table.schema.valid # true
        ```
        
        All good. It looks like we're ready to read our data again:
        
        ```python
        table.read(keyed=True)
        # [
        #   {city: 'london', location: [51.50,-0.11]},
        #   {city: 'paris', location: [48.85,2.30]},
        #   {city: 'rome', location: null},
        # ]
        ```
        
        Now we see that:
        - locations are arrays with numeric latitude and longitude
        - Rome's location is a native Python `None`
        
        And because there are no errors after reading, we can be sure that our data is valid against our schema. Let's save it:
        
        ```python
        table.schema.save('schema.json')
        table.save('data.csv')
        ```
        
        Our `data.csv` looks the same because it has been stringified back to `csv` format. But now we have `schema.json`:
        
        ```json
        {
            "fields": [
                {
                    "name": "city",
                    "type": "string",
                    "format": "default"
                },
                {
                    "name": "location",
                    "type": "geopoint",
                    "format": "default"
                }
            ],
            "missingValues": [
                "",
                "N/A"
            ]
        }
        
        ```
        
        If we decide to improve it even more we could update the schema file and then open it again. But now providing a schema path:
        
        ```python
        table = Table('data.csv', schema='schema.json')
        # Continue the work
        ```
        
        This is a basic introduction to the `Table` class. To learn more let's take a look at the `Table` class API reference.
        
        #### `Table(source, schema=None, strict=False, post_cast=[], storage=None, **options)`
        
        Constructor to instantiate `Table` class. If `references` argument is provided, foreign keys will be checked on any reading operation.
        
        - `source (str/list[])` - data source (one of):
          - local file (path)
          - remote file (url)
          - array of arrays representing the rows
        - `schema (any)` - data schema in all forms supported by `Schema` class
        - `strict (bool)` - strictness option to pass to `Schema` constructor
        - `post_cast (function[])` - list of post cast processors
        - `storage (None/str)` - storage name like `sql` or `bigquery`
        - `options (dict)` - `tabulator` or storage options
        - `(exceptions.TableSchemaException)` - raises any error that occurs in table creation process
        - `(Table)` - returns data table class instance
        
        #### `table.headers`
        
        - `(str[])` - returns data source headers
        
        #### `table.schema`
        
        - `(Schema)` - returns schema class instance
        
        #### `table.iter(keyed=Fase, extended=False, cast=True, relations=False)`
        
        Iterates through the table data and emits rows cast based on table schema. Data casting can be disabled.
        
        - `keyed (bool)` - iterate keyed rows
        - `extended (bool)` - iterate extended rows
        - `cast (bool)` - disable data casting if false
        - `relations (dict)` - dictionary of foreign key references in a form of `{resource1: [{field1: value1, field2: value2}, ...], ...}`. If provided, foreign key fields will checked and resolved to their references
        - `(exceptions.TableSchemaException)` - raises any error that occurs during this process
        - `(any[]/any{})` - yields rows:
          - `[value1, value2]` - base
          - `{header1: value1, header2: value2}` - keyed
          - `[rowNumber, [header1, header2], [value1, value2]]` - extended
        
        #### `table.read(keyed=False, extended=False, cast=True, relations=False, limit=None)`
        
        Read the whole table and returns as array of rows. Count of rows could be limited.
        
        - `keyed (bool)` - flag to emit keyed rows
        - `extended (bool)` - flag to emit extended rows
        - `cast (bool)` - flag to disable data casting if false
        - `relations (dict)` - dict of foreign key references in a form of `{resource1: [{field1: value1, field2: value2}, ...], ...}`. If provided foreign key fields will checked and resolved to its references
        - `limit (int)` - integer limit of rows to return
        - `(exceptions.TableSchemaException)` - raises any error that occurs during this process
        - `(list[])` - returns array of rows (see `table.iter`)
        
        #### `table.infer(limit=100, confidence=0.75)`
        
        Infer a schema for the table. It will infer and set Table Schema to `table.schema` based on table data.
        
        - `limit (int)` - limit rows sample size
        - `confidence (float)` - how many casting errors are allowed (as a ratio, between 0 and 1)
        - `(dict)` - returns Table Schema descriptor
        
        #### `table.save(target, storage=None, **options)`
        
        > To save schema use `table.schema.save()`
        
        Save data source to file locally in CSV format with `,` (comma) delimiter
        
        - `target (str)` - saving target (e.g. file path)
        - `storage (None/str)` - storage name like `sql` or `bigquery`
        - `options (dict)` - `tabulator` or storage options
        - `(exceptions.TableSchemaException)` - raises an error if there is saving problem
        - `(True/Storage)` - returns true or storage instance
        
        ### Schema
        
        A model of a schema with helpful methods for working with the schema and supported data. Schema instances can be initialized with a schema source as a url to a JSON file or a JSON object. The schema is initially validated (see [validate](#validate) below). By default validation errors will be stored in `schema.errors` but in a strict mode it will be instantly raised.
        
        Let's create a blank schema. It's not valid because `descriptor.fields` property is required by the [Table Schema](http://specs.frictionlessdata.io/table-schema/) specification:
        
        ```python
        schema = Schema()
        schema.valid # false
        schema.errors
        # [<ValidationError: "'fields' is a required property">]
        ```
        
        To avoid creating a schema descriptor by hand we will use a `schema.infer` method to infer the descriptor from given data:
        
        ```python
        schema.infer([
          ['id', 'age', 'name'],
          ['1','39','Paul'],
          ['2','23','Jimmy'],
          ['3','36','Jane'],
          ['4','28','Judy'],
        ])
        schema.valid # true
        schema.descriptor
        #{ fields:
        #   [ { name: 'id', type: 'integer', format: 'default' },
        #     { name: 'age', type: 'integer', format: 'default' },
        #     { name: 'name', type: 'string', format: 'default' } ],
        #  missingValues: [ '' ] }
        ```
        
        Now we have an inferred schema and it's valid. We can cast data rows against our schema. We provide a string input which will be cast correspondingly:
        
        ```python
        schema.cast_row(['5', '66', 'Sam'])
        # [ 5, 66, 'Sam' ]
        ```
        
        But if we try provide some missing value to the `age` field, the cast will fail because the only valid "missing" value is an empty string. Let's update our schema:
        
        ```python
        schema.cast_row(['6', 'N/A', 'Walt'])
        # Cast error
        schema.descriptor['missingValues'] = ['', 'N/A']
        schema.commit()
        schema.cast_row(['6', 'N/A', 'Walt'])
        # [ 6, None, 'Walt' ]
        ```
        
        We can save the schema to a local file, and resume work on it at any time by loading it from that file:
        
        ```python
        schema.save('schema.json')
        schema = Schema('schema.json')
        ```
        
        This was a basic introduction to the `Schema` class. To learn more, let's take a look at the `Schema` API reference.
        
        #### `Schema(descriptor, strict=False)`
        
        Constructor to instantiate `Schema` class.
        
        - `descriptor (str/dict)` - schema descriptor:
          -  local path
          -  remote url
          -  dictionary
        - `strict (bool)` - flag to specify validation behaviour:
          - if false, errors will not be raised but instead collected in `schema.errors`
          - if true, validation errors are raised immediately
        - `(exceptions.TableSchemaException)` - raise any error that occurs during the process
        - `(Schema)` - returns schema class instance
        
        #### `schema.valid`
        
        - `(bool)` - returns validation status. Always true in strict mode.
        
        #### `schema.errors`
        
        - `(Exception[])` - returns validation errors. Always empty in strict mode.
        
        #### `schema.descriptor`
        
        - `(dict)` - returns schema descriptor
        
        #### `schema.primary_key`
        
        - `(str[])` - returns schema primary key
        
        #### `schema.foreign_keys`
        
        - `(dict[])` - returns schema foreign keys
        
        #### `schema.fields`
        
        - `(Field[])` - returns an array of `Field` instances
        
        #### `schema.field_names`
        
        - `(str[])` - returns an array of field names.
        
        #### `schema.get_field(name)`
        
        Get schema field by name.
        
        Note: use `update_field` if you want to modify the field descriptor
        
        - `name (str)` - schema field name
        - `(Field/None)` - returns `Field` instance or `None` if not found
        
        #### `schema.add_field(descriptor)`
        
        Add new field to schema. The schema descriptor will be validated with newly added field descriptor.
        
        - `descriptor (dict)` - field descriptor
        - `(exceptions.TableSchemaException)` - raises any error that occurs during the process
        - `(Field/None)` - returns added `Field` instance or `None` if not added
        
        #### `schema.update_field(name, update)`
        
        Update existing descriptor field by name
        
        - `name (str)` - schema field name
        - `update (dict)` - update to apply to field's descriptor
        - `(bool)` - returns true on success and false if no field is found to be modified
        
        cf [`schema.commit()`](#schemacommitstrictnone) example
        
        #### `schema.remove_field(name)`
        
        Remove field resource by name. The schema descriptor will be validated after field descriptor removal.
        
        - `name (str)` - schema field name
        - `(exceptions.TableSchemaException)` - raises any error that occurs during the process
        - `(Field/None)` - returns removed `Field` instances or `None` if not found
        
        #### `schema.cast_row(row)`
        
        Cast row based on field types and formats.
        
        - `row (any[])` - data row as an array of values
        - `(any[])` - returns cast data row
        
        #### `schema.infer(rows, headers=1, confidence=0.75, guesser_cls=None, resolver_cls=None)`
        
        Infer and set `schema.descriptor` based on data sample.
        
        - `rows (list[])` - array of arrays representing rows.
        - `headers (int/str[])` - data sample headers (one of):
          - row number containing headers (`rows` should contain headers rows)
          - array of headers (`rows` should NOT contain headers rows)
        - `confidence (float)` - how many casting errors are allowed (as a ratio, between 0 and 1)
        - `guesser_cls` & `resolver_cls` - you can implement inferring strategies by providing type-guessing and type-resolving classes [experimental]
        - `{dict}` - returns Table Schema descriptor
        
        #### `schema.commit(strict=None)`
        
        Update schema instance if there are in-place changes in the descriptor.
        
        - `strict (bool)` - alter `strict` mode for further work
        - `(exceptions.TableSchemaException)` - raises any error that occurs during the process
        - `(bool)` - returns true on success and false if not modified
        
        ```python
        from tableschema import Schema
        descriptor = {'fields': [{'name': 'my_field', 'title': 'My Field', 'type': 'string'}]}
        schema = Schema(descriptor)
        print(schema.get_field('my_field').descriptor['type']) # string
        
        # Update descriptor by field position
        schema.descriptor['fields'][0]['type'] = 'number'
        # Update descriptor by field name
        schema.update_field('my_field', {'title': 'My Pretty Field'}) # True
        
        # Change are not committed
        print(schema.get_field('my_field').descriptor['type']) # string
        print(schema.get_field('my_field').descriptor['title']) # My Field
        
        
        # Commit change
        schema.commit()
        print(schema.get_field('my_field').descriptor['type']) # number
        print(schema.get_field('my_field').descriptor['title']) # My Pretty Field
        
        ```
        
        #### `schema.save(target)`
        
        Save schema descriptor to target destination.
        
        - `target (str)` - path where to save a descriptor
        - `(exceptions.TableSchemaException)` - raises any error that occurs during the process
        - `(bool)` - returns true on success
        
        ### Field
        
        ```python
        from tableschema import Field
        
        # Init field
        field = Field({'name': 'name', 'type': 'number'})
        
        # Cast a value
        field.cast_value('12345') # -> 12345
        ```
        
        Data values can be cast to native Python objects with a Field instance. Type instances can be initialized with [field descriptors](https://specs.frictionlessdata.io/table-schema/). This allows formats and constraints to be defined.
        
        Casting a value will check the value is of the expected type, is in the correct format, and complies with any constraints imposed by a schema. E.g. a date value (in ISO 8601 format) can be cast with a DateType instance. Values that can't be cast will raise an `InvalidCastError` exception.
        
        Casting a value that doesn't meet the constraints will raise a `ConstraintError` exception.
        
        Here is an API reference for the `Field` class:
        
        #### `new Field(descriptor, missingValues=[''])`
        
        Constructor to instantiate `Field` class.
        
        - `descriptor (dict)` - schema field descriptor
        - `missingValues (str[])` - an array with string representing missing values
        - `(exceptions.TableSchemaException)` - raises any error that occurs during the process
        - `(Field)` - returns field class instance
        
        #### `field.schema`
        
        - `(Schema)` - returns a schema instance if the field belongs to some schema
        
        #### `field.name`
        
        - `(str)` - returns field name
        
        #### `field.type`
        
        - `(str)` - returns field type
        
        #### `field.format`
        
        - `(str)` - returns field format
        
        #### `field.required`
        
        - `(bool)` - returns true if field is required
        
        #### `field.constraints`
        
        - `(dict)` - returns an object with field constraints
        
        #### `field.descriptor`
        
        - `(dict)` - returns field descriptor
        
        #### `field.castValue(value, constraints=true)`
        
        Cast given value according to the field type and format.
        
        - `value (any)` - value to cast against field
        - `constraints (boll/str[])` - gets constraints configuration
          - it could be set to true to disable constraint checks
          - it could be an Array of constraints to check e.g. ['minimum', 'maximum']
        - `(exceptions.TableSchemaException)` - raises any error that occurs during the process
        - `(any)` - returns cast value
        
        #### `field.testValue(value, constraints=true)`
        
        Test if value is compliant to the field.
        
        - `value (any)` - value to cast against field
        - `constraints (bool/str[])` - constraints configuration
        - `(bool)` - returns if value is compliant to the field
        
        ### validate
        
        Given a schema as JSON file, url to JSON file, or a Python dict, `validate` returns true for a valid Table Schema, or raises an exception, `exceptions.ValidationError`. It validates only **schema**, not data against schema!
        
        ```python
        from tableschema import validate, exceptions
        
        try:
            valid = validate(descriptor)
        except exceptions.ValidationError as exception:
           for error in exception.errors:
               # handle individual error
        ```
        
        #### `validate(descriptor)`
        
        Validate a Table Schema descriptor.
        
        - `descriptor (str/dict)` - schema descriptor (one of):
          - local path
          - remote url
          - object
        - (exceptions.ValidationError) - raises on invalid
        - `(bool)` - returns true on valid
        
        ### infer
        
        Given headers and data, `infer` will return a Table Schema as a Python dict based on the data values. Given the data file, `data_to_infer.csv`:
        
        ```
        id,age,name
        1,39,Paul
        2,23,Jimmy
        3,36,Jane
        4,28,Judy
        ```
        
        Let's call `infer` for this file:
        
        ```python
        from tableschema import infer
        
        descriptor = infer('data_to_infer.csv')
        #{'fields': [
        #    {
        #        'format': 'default',
        #        'name': 'id',
        #        'type': 'integer'
        #    },
        #    {
        #        'format': 'default',
        #        'name': 'age',
        #        'type': 'integer'
        #    },
        #    {
        #        'format': 'default',
        #        'name': 'name',
        #        'type': 'string'
        #    }]
        #}
        ```
        
        The number of rows used by `infer` can be limited with the `limit` argument.
        
        #### `infer(source, headers=1, limit=100, confidence=0.75, **options)`
        
        Infer source schema.
        
        - `source (any)` - source as path, url or inline data
        - `headers (int/str[])` - headers rows number or headers list
        - `confidence (float)` - how many casting errors are allowed (as a ratio, between 0 and 1)
        - `(exceptions.TableSchemaException)` - raises any error that occurs during the process
        - `(dict)` - returns schema descriptor
        
        ### Exceptions
        
        #### `exceptions.TableSchemaException`
        
        Base class for all library exceptions. If there are multiple errors, they can be read from the exception object:
        
        ```python
        
        try:
            # lib action
        except exceptions.TableSchemaException as exception:
            if exception.multiple:
                for error in exception.errors:
                    # handle error
        ```
        
        #### `exceptions.LoadError`
        
        All loading errors.
        
        #### `exceptions.ValidationError`
        
        All validation errors.
        
        #### `exceptions.CastError`
        
        All value cast errors.
        
        #### `exceptions.RelationError`
        
        All integrity errors.
        
        #### `exceptions.StorageError`
        
        All storage errors.
        
        ### Storage
        
        The library includes interface declaration to implement tabular `Storage`. This interface allow to use different data storage systems like SQL with `tableschema.Table` class (load/save) as well as on the data package level:
        
        ![Storage](https://raw.githubusercontent.com/frictionlessdata/tableschema-py/master/data/storage.png)
        
        For instantiation of concrete storage instances, `tableschema.Storage` provides a unified factory method `connect` (which uses the plugin system under the hood):
        
        ```python
        # pip install tableschema_sql
        from tableschema import Storage
        
        storage = Storage.connect('sql', **options)
        storage.create('bucket', descriptor)
        storage.write('bucket', rows)
        storage.read('bucket')
        ```
        
        #### `Storage.connect(name, **options)`
        
        Create tabular `storage` based on storage name.
        
        - `name (str)` - storage name like `sql`
        - `options (dict)` - concrete storage options
        - `(exceptions.StorageError)` - raises on any error
        - `(Storage)` - returns `Storage` instance
        
        ---
        
        
        An implementor should follow `tableschema.Storage` interface to write his own storage backend. Concrete storage backends could include additional functionality specific to conrete storage system. See `plugins` below to know how to integrate custom storage plugin into your workflow.
        
        #### `<<Interface>>Storage(**options)`
        
        Create tabular `storage`. Implementations should fully implement this interface to be compatible with the `Storage` API.
        
        - `options (dict)` - concrete storage options
        - `(exceptions.StorageError)` - raises on any error
        - `(Storage)` - returns `Storage` instance
        
        #### `storage.buckets`
        
        Return list of storage bucket names. A `bucket` is a special term which has almost the same meaning as `table`. You should consider `bucket` as a `table` stored in the `storage`.
        
        - `(exceptions.StorageError)` - raises on any error
        - `str[]` - return list of bucket names
        
        #### `create(bucket, descriptor, force=False)`
        
        Create one/multiple buckets.
        
        - `bucket (str/list)` - bucket name or list of bucket names
        - `descriptor (dict/dict[])` - schema descriptor or list of descriptors
        - `force (bool)` - whether to delete and re-create already existing buckets
        - `(exceptions.StorageError)` - raises on any error
        
        #### `delete(bucket=None, ignore=False)`
        
        Delete one/multiple/all buckets.
        
        - `bucket (str/list/None)` - bucket name or list of bucket names to delete. If `None`, all buckets will be deleted
        - `descriptor (dict/dict[])` - schema descriptor or list of descriptors
        - `ignore (bool)` - don't raise an error on non-existent bucket deletion from storage
        - `(exceptions.StorageError)` - raises on any error
        
        #### `describe(bucket, descriptor=None)`
        
        Get/set bucket's Table Schema descriptor.
        
        - `bucket (str)` - bucket name
        - `descriptor (dict/None)` - schema descriptor to set
        - `(exceptions.StorageError)` - raises on any error
        - `(dict)` - returns Table Schema descriptor
        
        #### `iter(bucket)`
        
        This method should return an iterator of typed values based on the schema of this bucket.
        
        - `bucket (str)` - bucket name
        - `(exceptions.StorageError)` - raises on any error
        - `(list[])` - yields data rows
        
        #### `read(bucket)`
        
        This method should read typed values based on the schema of this bucket.
        
        - `bucket (str)` - bucket name
        - `(exceptions.StorageError)` - raises on any error
        - `(list[])` - returns data rows
        
        #### `write(bucket, rows)`
        
        This method writes data rows into `storage`. It should store values of unsupported types as strings internally (like csv does).
        
        - `bucket (str)` - bucket name
        - `rows (list[])` - data rows to write
        - `(exceptions.StorageError)` - raises on any error
        
        ### Plugins
        
        Table Schema has a plugin system.  Any package with the name like `tableschema_<name>` can be imported as:
        
        ```python
        from tableschema.plugins import <name>
        ```
        
        If a plugin is not installed, an `ImportError` will be raised with a message describing how to install the plugin.
        
        #### Official plugins
        
        - [BigQuery Storage](https://github.com/frictionlessdata/tableschema-bigquery-py)
        - [Elasticsearch Storage](https://github.com/frictionlessdata/tableschema-elasticsearch-py)
        - [Pandas Storage](https://github.com/frictionlessdata/tableschema-pandas-py)
        - [SQL Storage](https://github.com/frictionlessdata/tableschema-sql-py)
        - [SPSS Storage](https://github.com/frictionlessdata/tableschema-spss-py)
        
        ### CLI
        
        > It's a provisional API excluded from SemVer. If you use it as a part of another program please pin `tableschema`  to a concrete version in your requirements file.
        
        Table Schema features a CLI called `tableschema`. This CLI exposes the `infer` and `validate` functions for command line use.
        
        Example of `validate` usage:
        
        ```
        $ tableschema validate path/to-schema.json
        ```
        
        Example of `infer` usage:
        
        ```
        $ tableschema infer path/to/data.csv
        ```
        
        The response is a schema as JSON. The optional argument `--encoding` allows a character encoding to be specified for the data file. The default is utf-8.
        
        ## Contributing
        
        The project follows the [Open Knowledge International coding standards](https://github.com/okfn/coding-standards).
        
        The recommended way to get started is to create and activate a project virtual environment.
        To install package and development dependencies into your active environment:
        
        ```
        $ make install
        ```
        
        To run tests with linting and coverage:
        
        ```bash
        $ make test
        ```
        
        For linting, `pylama` (configured in `pylama.ini`) is used. At this stage it's already
        installed into your environment and could be used separately with more fine-grained control
        as described in documentation - https://pylama.readthedocs.io/en/latest/.
        
        For example to sort results by error type:
        
        ```bash
        $ pylama --sort <path>
        ```
        
        For testing, `tox` (configured in `tox.ini`) is used.
        It's already installed into your environment and could be used separately with more fine-grained control as described in documentation - https://testrun.org/tox/latest/.
        
        For example to check subset of tests against Python 2 environment with increased verbosity.
        All positional arguments and options after `--` will be passed to `py.test`:
        
        ```bash
        tox -e py27 -- -v tests/<path>
        ```
        
        Under the hood `tox` uses `pytest` (configured in `pytest.ini`), `coverage`
        and `mock` packages. These packages are available only in tox envionments.
        
        ## Changelog
        
        Here described only breaking and the most important changes. The full changelog and documentation for all released versions can be found in the nicely formatted [commit history](https://github.com/frictionlessdata/tableschema-py/commits/master).
        
        #### v1.7
        
        - Added `field.schema` property
        
        #### v1.6
        
        - In `strict` mode raise an exception if there are problems in field construction
        
        #### v1.5
        
        - Allow providing custom guesser and resolver to schema infer
        
        #### v1.4
        
        - Added `schema.update_field` method
        
        #### v1.3
        
        - Support datetime with no time for date casting
        
        #### v1.2
        
        - Support floats like 1.0 for integer casting
        
        #### v1.1
        
        - Added the `confidence` parameter to `infer`
        
        #### v1.0
        
        - The library has been rebased on the Frictionless Data specs v1 - https://frictionlessdata.io/specs/table-schema/
Keywords: frictionless data,open data,json schema,table schema,data package,tabular data package
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Web Environment
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.4
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Topic :: Internet :: WWW/HTTP :: Dynamic Content
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Description-Content-Type: text/markdown
Provides-Extra: develop
