Metadata-Version: 2.1
Name: law
Version: 0.1.9
Summary: Build large-scale task workflows using luigi, remote job submission, remote targets, and environment
Home-page: https://github.com/riga/law
Author: Marcel Rieger
Author-email: github.riga@icloud.com
License: BSD-3-Clause
Keywords: luigi workflow pipeline remote gfal submission cluster grid condor lsf glite arc sandboxing docker singularity
Platform: UNKNOWN
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Development Status :: 4 - Beta
Classifier: Operating System :: OS Independent
Classifier: License :: OSI Approved :: BSD License
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: Intended Audience :: Information Technology
Classifier: Topic :: System :: Monitoring
Requires-Python: >=2.7, !=3.0.*, !=3.1.*, !=3.2.*, !=3.3.*, !=3.4.*, !=3.5.*, <4
Description-Content-Type: text/x-rst
Provides-Extra: docs
License-File: LICENSE

.. figure:: https://raw.githubusercontent.com/riga/law/master/logo.png
   :target: https://github.com/riga/law
   :align: center
   :alt: law logo

.. image:: https://github.com/riga/law/workflows/Lint%20and%20test/badge.svg
   :target: https://github.com/riga/law/actions?query=workflow%3A%22Lint+and+test%22
   :alt: Build status

.. image:: https://github.com/riga/law/workflows/Deploy%20images/badge.svg
   :target: https://github.com/riga/law/actions?query=workflow%3A%22Deploy+images%22
   :alt: Docker images

.. image:: https://readthedocs.org/projects/law/badge/?version=latest
   :target: http://law.readthedocs.io/en/latest
   :alt: Documentation status

.. image:: https://img.shields.io/pypi/v/law.svg?style=flat
   :target: https://pypi.python.org/pypi/law
   :alt: Package version

.. image:: https://img.shields.io/github/license/riga/law.svg
   :target: https://github.com/riga/law/blob/master/LICENSE
   :alt: License

.. image:: https://zenodo.org/badge/75482295.svg
   :target: https://zenodo.org/badge/latestdoi/75482295
   :alt: DOI


**Note**: This project is currently under development.
Version 1.0.0 will be the first, fully documented beta release, targetted for end of 2022.

Use law to build complex and large-scale task workflows.
It is build on top of `luigi <https://github.com/spotify/luigi>`__ and adds abstractions for **run locations**, **storage locations** and **software environments**.
Law strictly disentangles these building blocks and ensures they remain interchangeable and resource-opportunistic.

Key features:

- CLI with auto-completion and interactive status and dependency inspection.
- Remote targets with automatic retries and local caching
   - WebDAV, HTTP, Dropbox, SFTP, all WLCG protocols (srm, xrootd, dcap, gsiftp, webdav, ...)
- Automatic submission to batch systems from within tasks
   - HTCondor, LSF, gLite, ARC, Slurm
- Environment sandboxing, configurable on task level
   - Docker, Singularity, Sub-Shells, Python-venv


.. marker-after-header


Contents
========

- `First steps`_
   - `Installation and dependencies`_
   - `Usage at CERN`_
   - `Overcomplete example config`_
- `Projects using law`_
- `Examples`_
- `Further topics`_
   - `Auto completion on the command-line`_
   - `Tests`_
   - `Development`_


.. marker-before-body


First steps
===========

Installation and dependencies
-----------------------------

Install via `pip <https://pypi.python.org/pypi/law>`__:

.. code-block:: bash

   pip install law

This command also installs `luigi <https://pypi.python.org/pypi/luigi>`__ and `six <https://pypi.python.org/pypi/six>`__.

The (default) remote target implementation also requires `gfal2 <https://dmc-docs.web.cern.ch/dmc-docs/gfal2/gfal2.html>`__ and `gfal2-python <https://pypi.python.org/pypi/gfal2-python>`__ (optional, also via pip) to be installed.



Usage at CERN
-------------

See the `wiki <https://github.com/riga/law/wiki/Usage-at-CERN>`__.


Overcomplete example config
---------------------------

See `law.cfg.example <https://github.com/riga/law/tree/master/law.cfg.example>`__.


Projects using law
==================

- CMS Di-Higgs Inference Tools:
   - Basis for statistical analysis for all Di-Higgs searches in CMS, starting at datacard-level
   - `repo <https://gitlab.cern.ch/hh/tools/inference>`__, `docs <https://cms-hh.web.cern.ch/cms-hh/tools/inference/index.html>`__
- CMS B-Tag SF Measurement:
   - Automated workflow for deriving shape-calibrating b-tag scale factors, starting at MiniAOD-level
   - `repo <https://github.com/cms-btv-pog/jet-tagging-sf>`__
- CMS Tau POG ML Tools:
   - Preprocessing pipeline for ML trainings in the TAU group
   - `repo <https://github.com/cms-tau-pog/TauMLTools>`__
- CMS HLT Config Parser:
   - Collects information from various databases (HLT, bril, etc.) and shows menus, triggers paths, filter names for configurable MC datasets or data runs
   - `repo <https://github.com/riga/cms-hlt-parser>`__
- UHH-CMS Analysis Framework:
   - Python based, fully automated, columnar framework, including job submission, resolution of systematics and ML pipelines, starting at NanoAOD-level with an optimized multi-threaded column reader
   - `repo <https://github.com/uhh-cms/analysis_playground>`__, `docs <http://analysis_playground.readthedocs.io>`__, `task structure <https://github.com/uhh-cms/analysis_playground/issues/25>`__
- RWTH-CMS Analysis Framework:
   - Basis for multiple CMS analyses ranging from Di-Higgs, to single Higgs and b-tag SF measurements, starting at NanoAOD-level and based on coffea processors
   - `repo <https://git.rwth-aachen.de/3pia/cms_analyses/common/-/tree/master/>`__
- CIEMAT-CMS Analysis Framework:
   - Python and RDataFrame based framework starting from NanoAOD and targetting multiple CMS analyses
   - `repo <https://gitlab.cern.ch/cms-phys-ciemat/nanoaod_base_analysis/>`__
- CMS 3D Z+jet 13TeV analysis
   - Analysis workflow management from NTuple production to final plots and fits
   - `repo <https://gitlab.etp.kit.edu/cverstege/zjet-analysis>`__
- NP-correction derivation tool
   - MC generation with Herwig and analysis of generated events with Rivet
   - `repo <https://github.com/HerrHorizontal/herwig-run>`__
- CMS SUSY Searches at DESY
	- Analysis framework for CMS SUSY searches going from custom NanoAODs -> NTuple production -> DNN-based inference -> final plots and fits
	- `repo <https://github.com/frengelk/Susy1LeptonAnalysis>`__
- YOUR REPOSITORY HERE

If your project uses law but is not yet listed here, feel free to open a pull request or mention your project details in a new `issue <https://github.com/riga/law/issues/new?template=register_project.md&title=New%20project%20using%20law&assignee=riga>`__ and it will be added.


Examples
========

All examples can be run either in a Jupyter notebook or a dedicated docker container.
For the latter, do

.. code-block:: bash

   docker run -ti riga/law:example <example_name>

- `loremipsum <https://github.com/riga/law/tree/master/examples/loremipsum>`__: The *hello world* example of law.
- `workflows <https://github.com/riga/law/tree/master/examples/workflows>`__: Law workflows.
- `dropbox_targets <https://github.com/riga/law/tree/master/examples/dropbox_targets>`__: Working with targets that are stored on Dropbox.
- `wlcg_targets <https://github.com/riga/law/tree/master/examples/wlcg_targets>`__: Working with targets that are stored on WLCG storage elements (dCache, EOS, ...). TODO.
- `htcondor_at_vispa <https://github.com/riga/law/tree/master/examples/htcondor_at_vispa>`__: HTCondor workflows at the `VISPA service <https://vispa.physik.rwth-aachen.de>`__.
- `htcondor_at_cern <https://github.com/riga/law/tree/master/examples/htcondor_at_cern>`__: HTCondor workflows at the CERN batch infrastructure.
- `sequential_htcondor_at_cern <https://github.com/riga/law/tree/master/examples/sequential_htcondor_at_cern>`__: Continuation of the `htcondor_at_cern <https://github.com/riga/law/tree/master/examples/htcondor_at_cern>`__ example, showing sequential jobs that eagerly start once jobs running previous requirements succeeded.
- `htcondor_at_naf <https://github.com/riga/law/tree/master/examples/htcondor_at_naf>`__: HTCondor workflows at German `National Analysis Facility (NAF) <https://confluence.desy.de/display/IS/NAF+-+National+Analysis+Facility>`__.
- `slurm_at_maxwell <https://github.com/riga/law/tree/master/examples/slurm_at_maxwell>`__: Slurm workflows at the `Desy Maxwell cluster <https://confluence.desy.de/display/MXW/Maxwell+Cluster>`__.
- `grid_at_cern <https://github.com/riga/law_example_WLCG>`__: Workflows that run jobs and store data on the WLCG.
- `lsf_at_cern <https://github.com/riga/law/tree/master/examples/lsf_at_cern>`__: LSF workflows at the CERN batch infrastructure.
- `docker_sandboxes <https://github.com/riga/law/tree/master/examples/docker_sandboxes>`__: Environment sandboxing using Docker. TODO.
- `singularity_sandboxes <https://github.com/riga/law/tree/master/examples/singularity_sandboxes>`__: Environment sandboxing using Singularity. TODO.
- `subshell_sandboxes <https://github.com/riga/law/tree/master/examples/subshell_sandboxes>`__: Environment sandboxing using Subshells. TODO.
- `parallel_optimization <https://github.com/riga/law/tree/master/examples/parallel_optimization>`__: Parallel optimization using `scikit optimize <https://scikit-optimize.github.io>`__.
- `notifications <https://github.com/riga/law/tree/master/examples/notifications>`__: Demonstration of slack and telegram task status notifications..
- `CMS Single Top Analysis <https://github.com/riga/law_example_CMSSingleTopAnalysis>`__: Simple physics analysis using law.


Further topics
==============

Auto completion on the command-line
-----------------------------------

bash
^^^^

.. code-block:: shell

   source "$( law completion )"


zsh
^^^

zsh is able to load and evaluate bash completion scripts via ``bashcompinit``.
In order for ``bashcompinit`` to work, you should run ``compinstall`` to enable completion scripts:

.. code-block:: shell

   autoload -Uz compinstall && compinstall

After following the instructions, these lines should be present in your ~/.zshrc:

.. code-block:: shell

   # The following lines were added by compinstall
   zstyle :compinstall filename '~/.zshrc'

   autoload -Uz +X compinit && compinit
   autoload -Uz +X bashcompinit && bashcompinit
   # End of lines added by compinstall

If this is the case, just source the law completion script (which internally enables ``bashcompinit``) and you're good to go:

.. code-block:: shell

   source "$( law completion )"


Tests
-----

To run and test law, there are various docker `riga/law` images available on the `DockerHub <https://cloud.docker.com/u/riga/repository/docker/riga/law>`__, corresponding to different OS and Python versions.


|    OS    | Python |                  Tags                  |
| -------- | ------ | -------------------------------------- |
| Centos 8 |    3.9 | c8-py39, c8-py3, c8, py39, py3, latest |
| Centos 8 |    3.8 | c8-py38, py38                          |
| Centos 8 |    3.7 | c8-py37, py37                          |
| Centos 7 |   3.10 | c7-py310, py310                        |
| Centos 7 |    3.9 | c7-py39, c7-py3                        |
| Centos 7 |    3.8 | c7-py38                                |
| Centos 7 |    3.7 | c7-py37                                |
| Centos 7 |    3.6 | c7-py36, py36                          |

.. code-block:: bash

   docker run -ti riga/law:latest


Development
-----------

- Source hosted at `GitHub <https://github.com/riga/law>`__
- Report issues, questions, feature requests on `GitHub Issues <https://github.com/riga/law/issues>`__


.. marker-after-body


