Metadata-Version: 2.1
Name: koalas
Version: 0.2.0
Summary: Pandas DataFrame API on Apache Spark
Home-page: https://github.com/databricks/koalas
Maintainer: Databricks
Maintainer-email: koalas@databricks.com
License: http://www.apache.org/licenses/LICENSE-2.0
Project-URL: Bug Tracker, https://github.com/databricks/koalas/issues
Project-URL: Source Code, https://github.com/databricks/koalas
Platform: UNKNOWN
Requires-Python: >=2.7,!=3.0.*,!=3.1.*,!=3.2.*,!=3.3.*,!=3.4.*
Requires-Dist: pandas (>=0.23)
Requires-Dist: decorator
Requires-Dist: pyarrow (<0.11,>=0.10)
Provides-Extra: spark
Requires-Dist: pyspark (>=2.4.0) ; extra == 'spark'


Koalas makes data scientists more productive when interacting with big data,
by augmenting Apache Spark's Python DataFrame API to be compatible with
Pandas'.

Pandas is the de facto standard (single-node) dataframe implementation in
Python, while Spark is the de facto standard for big data processing.
With this package, data scientists can:

- Be immediately productive with Spark, with no learning curve, if one
  is already familiar with Pandas.
- Have a single codebase that works both with Pandas (tests, smaller datasets)
  and with Spark (distributed datasets).


