Metadata-Version: 2.1
Name: raylab
Version: 0.6.5
Summary: Reinforcement learning algorithms in RLlib and PyTorch.
Home-page: https://github.com/angelolovatto/raylab
Author: Ângelo Gregório Lovatto
Author-email: angelolovatto@gmail.com
License: MIT license
Keywords: raylab
Platform: UNKNOWN
Classifier: Development Status :: 2 - Pre-Alpha
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: English
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.7
Requires-Python: ~=3.7.0
Requires-Dist: Click (>=7.0)
Requires-Dist: bokeh
Requires-Dist: ray[dashboard,rllib] (>=0.8.5)
Requires-Dist: streamlit
Requires-Dist: torch

======
raylab
======

|PyPI| |Travis| |PyUp|

.. |PyPI| image:: https://img.shields.io/pypi/v/raylab?logo=PyPi&logoColor=white
        :alt: PyPI

.. |Travis| image:: https://img.shields.io/travis/com/angelolovatto/raylab?logo=travis-ci&logoColor=important
        :alt: Travis (.com)

.. |PyUp| image:: https://pyup.io/repos/github/angelolovatto/raylab/shield.svg
     :target: https://pyup.io/repos/github/angelolovatto/raylab/
     :alt: Updates


Reinforcement learning algorithms in `RLlib <https://github.com/ray-project/ray/tree/master/rllib>`_ and `PyTorch <https://pytorch.org>`_.


Introduction
------------

Raylab provides agents and environments to be used with a normal RLlib/Tune setup.

.. code-block:: python

             import ray
             from ray import tune
             import raylab

             def main():
                 raylab.register_all_agents()
                 raylab.register_all_environments()
                 ray.init()
                 tune.run(
                     "NAF",
                     local_dir=...,
                     stop={"timesteps_total": 100000},
                     config={
                         "env": "CartPoleSwingUp-v0",
                         "exploration_config": {
                             "type": tune.grid_search([
                                 "raylab.utils.exploration.GaussianNoise",
                                 "raylab.utils.exploration.ParameterNoise"
                             ])
                         }
                         ...
                     },
                 )

             if __name__ == "__main__":
                 main()


One can then visualize the results using `raylab dashboard`

.. image:: https://i.imgur.com/bVc6WC5.png
        :align: center


Installation
------------

.. code:: bash

          pip install raylab


Algorithms
----------

+--------------------------------------------------------+-------------------------+
| Paper                                                  | Agent Name              |
+--------------------------------------------------------+-------------------------+
| `Actor Critic using Kronecker-factored Trust Region`_  | ACKTR                   |
+--------------------------------------------------------+-------------------------+
| `Trust Region Policy Optimization`_                    | TRPO                    |
+--------------------------------------------------------+-------------------------+
| `Normalized Advantage Function`_                       | NAF                     |
+--------------------------------------------------------+-------------------------+
| `Stochastic Value Gradients`_                          | SVG(inf)/SVG(1)/SoftSVG |
+--------------------------------------------------------+-------------------------+
| `Soft Actor-Critic`_                                   | SoftAC                  |
+--------------------------------------------------------+-------------------------+
| `Streamlined Off-Policy`_ (DDPG)                       | SOP                     |
+--------------------------------------------------------+-------------------------+


.. _`Actor Critic using Kronecker-factored Trust Region`: https://arxiv.org/abs/1708.05144
.. _`Trust Region Policy Optimization`: http://proceedings.mlr.press/v37/schulman15.html
.. _`Normalized Advantage Function`: http://proceedings.mlr.press/v48/gu16.html
.. _`Stochastic Value Gradients`: http://papers.nips.cc/paper/5796-learning-continuous-control-policies-by-stochastic-value-gradients
.. _`Soft Actor-Critic`: http://proceedings.mlr.press/v80/haarnoja18b.html
.. _`Streamlined Off-Policy`: https://arxiv.org/abs/1910.02208


Command-line interface
----------------------

For a high-level description of the available utilities, run `raylab --help`

.. code:: bash

          Usage: raylab [OPTIONS] COMMAND [ARGS]...

          RayLab: Reinforcement learning algorithms in RLlib.

          Options:
            --help  Show this message and exit.

            Commands:
              dashboard    Launch the experiment dashboard to monitor training progress.
              experiment   Launch a Tune experiment from a config file.
              find-best    Find the best experiment checkpoint as measured by a metric.
              plot         Draw lineplots of the relevant variables and display them on...
              plot-export  Draw lineplots of the relevant variables and save them as...
              rollout      Simulate an agent from a given checkpoint in the desired...


Packages
--------

The project is structured as follows
::

    raylab
    ├── agents            # Trainer and Policy classes
    ├── cli               # Command line utilities
    ├── distributions     # Extendend and additional PyTorch distributions
    ├── envs              # Gym environments
    ├── logger            # Tune loggers
    ├── modules           # PyTorch neural network modules for algorithms
        ├── basic         # Building blocks for neural networks
        ├── flows         # Normalizing Flow modules
        ├── distributions # TorchScript compatible distribution modules
    ├── policy            # Extensions and customizations of RLlib's policy API
    ├── utils             # miscellaneous utilities



Credits
-------

This package was created with Cookiecutter_ and the `audreyr/cookiecutter-pypackage`_ project template.

.. _Cookiecutter: https://github.com/audreyr/cookiecutter
.. _`audreyr/cookiecutter-pypackage`: https://github.com/audreyr/cookiecutter-pypackage


=======
History
=======

0.6.5 (2020-05-21)
------------------

* First release on PyPI.


