Metadata-Version: 2.0
Name: pydrill
Version: 0.0.2
Summary: Python Driver for Apache Drill.
Home-page: https://github.com/PythonicNinja/pydrill
Author: Wojciech Nowak
Author-email: mail@pythonic.ninja
License: MIT
Keywords: pydrill
Platform: UNKNOWN
Classifier: Development Status :: 2 - Pre-Alpha
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: English
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 2.6
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.3
Classifier: Programming Language :: Python :: 3.4
Classifier: Programming Language :: Python :: 3.5
Requires-Dist: requests

===============================
pydrill
===============================

.. image:: https://img.shields.io/travis/PythonicNinja/pydrill.svg
        :target: https://travis-ci.org/PythonicNinja/pydrill

.. image:: https://readthedocs.org/projects/pydrill/badge/?version=latest
        :target: https://readthedocs.org/projects/pydrill/?badge=latest
        :alt: Documentation Status

.. image:: https://coveralls.io/repos/PythonicNinja/pydrill/badge.svg?branch=master&service=github
  :target: https://coveralls.io/github/PythonicNinja/pydrill?branch=master


Python Driver for `Apache Drill <https://drill.apache.org/>`_.

*Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage*

* Free software: MIT license
* Documentation: https://pydrill.readthedocs.org.

Features
--------

* Python 2/3 compatibility,
* Mapping Results to internal python types,
* Compatibility with Pandas data frame,

Installation
------------
::

    pip install git+git://github.com/PythonicNinja/pydrill.git

Sample usage
------------
::

    from pydrill.client import PyDrill

    drill = PyDrill(host='localhost', port=8047)

    if not drill.is_active():
        raise ImproperlyConfigured('Please run Drill first')

    yelp_reviews = drill.query('''
      SELECT * FROM
      `dfs.root`.`./Users/macbookair/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
      LIMIT 5
    ''')

    for result in yelp_reviews:
        print("%s: %s" %(result['type'], result['date']))


    # pandas dataframe

    df = yelp_reviews.to_dataframe()
    print(df[df['stars'] > 3])



=======
History
=======

0.0.2 (2016-04-24)
------------------
* First release on PyPI.
* Implementation of metrics/storage/options/stats
* Builds are tested by docker container with Apache Drill running
* support for pandas with ResultQuery.to_dataframe

0.0.1 (2015-12-28)
------------------
* Project start


