Metadata-Version: 2.1
Name: remedian
Version: 0.1.2
Summary: Remedian: robust averaging of large data sets
Home-page: http://github.com/sappelhoff/remedian
Author: Stefan Appelhoff
Author-email: stefan.appelhoff@mailbox.org
License: MIT
Project-URL: Bug Reports, https://github.com/sappelhoff/remedian/issues
Project-URL: Source, https://github.com/sappelhoff/remedian
Description: [![Build Status](https://travis-ci.org/sappelhoff/remedian.svg?branch=master)](https://travis-ci.org/sappelhoff/remedian)
        [![codecov](https://codecov.io/gh/sappelhoff/remedian/branch/master/graph/badge.svg)](https://codecov.io/gh/sappelhoff/remedian)
        [![Documentation Status](https://readthedocs.org/projects/remedian/badge/?version=latest)](http://remedian.readthedocs.io/en/latest/?badge=latest)
        [![PyPI version](https://badge.fury.io/py/remedian.svg)](https://badge.fury.io/py/remedian)
        
        # remedian
        The  Remedian:  A  Robust  Averaging  Method  for  Large  Data  Sets - Python
        implementation
        
        This algorithm is used to approximate the median of several data chunks if
        these data chunks cannot (or should not) be loaded into memory at once.
        
        Given a data chunk of size `obs_size`, and `t` data chunks overall, the
        Remedian class sets up a number `k_arrs` of arrays of length `n_obs`.
        
        The median of the `t` data chunks of size `obs_size` is then approximated
        as follows: One data chunk after another is fed into the `n_obs` positions
        of the first array. When the first array is full, its median is calculated
        and stored in the first position of the second array. After this, the first
        array is re-used to fill the second position of the second array, etc.
        When the second array is full, the median of its values is stored in the
        first position of the third array, and so on.
        
        The final "Remedian" is the median of the last array, after all `t` data
        chunks have been fed into the object.
        
        # Installation
        
        `pip install remedian`
        
        The dependencies should be installed automatically by pip.
        
        # Installation of most recent version
        
        1. activate your python environment
        2. `git clone https://www.github.com/sappelhoff/remedian`
        3. `cd remedian`
        5. `pip install -e .`
        6. then you should be able to `from remedian.remedian import Remedian`
        
        # Usage
        
        See the [example in the docs](https://remedian.readthedocs.io/en/latest/examples.html).
        
        # References
        
        > P.J. Rousseeuw, G.W. Bassett Jr., "The remedian: A robust averaging method
          for large data sets", Journal of the American Statistical Association, vol.
          85 (1990), pp. 97-104
        
        > M. Chao, G. Lin, "The asymptotic distributions of the remedians", Journal of
          Statistical Planning and Inference, vol. 37 (1993), pp. 1-11
        
        > Domenico Cantone, Micha Hofri, "Further analysis of the remedian algorithm",
          Theoretical Computer Science, vol. 495 (2013), pp. 1-16
        
Keywords: remedian median memory efficient big data
Platform: UNKNOWN
Classifier: Topic :: Scientific/Engineering :: Mathematics
Classifier: Programming Language :: Python
Classifier: License :: OSI Approved :: MIT License
Classifier: Intended Audience :: Science/Research
Requires-Python: >=2.7
Description-Content-Type: text/markdown
Provides-Extra: test
