Metadata-Version: 2.1
Name: df-io
Version: 0.0.1
Summary: Helpers for doing IO with Pandas DataFrames
Home-page: https://github.com/Mikata-Project/df_io
Author: NAGY, Attila
Author-email: nagy.attila@gmail.com
License: UNKNOWN
Description: # df_io
        Python helpers for doing IO with Pandas DataFrames
        
        # Available methods
        ## write_df
        
        This method supports:
        * streaming writes
        * chunked writes
        * gzip compression
        * passing parameters to Pandas' writers
        * writing to AWS S3 and local files
        
        ### Examples
        
        Write a Pandas DataFrame (df) to an S3 path in CSV format:
        
        ```python
        import df_io
        
        df_io.write_df(df, 's3://bucket/dir/mydata.csv')
        ```
        
        The same with gzip compression:
        
        ```python
        df_io.write_df(df, 's3://bucket/dir/mydata.csv.gz')
        ```
        
        Using JSON lines (the default):
        
        ```python
        df_io.write_df(df, 's3://bucket/dir/mydata.json.gz', fmt='json')
        ```
        
        Passing writer parameters:
        
        ```python
        df_io.write_df(df, 's3://bucket/dir/mydata.json.gz', fmt='json', writer_options={'lines': False})
        ```
        
        Chunked write (splitting the df into equally sized parts and creating/writing outputs for them):
        
        ```python
        df_io.write_df(df, 's3://bucket/dir/mydata.json.gz', fmt='json', chunksize=10000)
        ```
        
Platform: UNKNOWN
Classifier: Programming Language :: Python
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Description-Content-Type: text/markdown
