Metadata-Version: 2.1
Name: wageubn
Version: 1.2.0
Summary: torch implement of WAGEUBN
Home-page: https://github.com/PannenetsF/WAGEUBN
Author: Pannenets F
Author-email: pannenets.f@foxmail.com
License: BSD-3-Clause License
Description: # wageubn
        wageubn's pytorch implementation.
        
        - [wageubn](#wageubn)
          - [Notice](#notice)
          - [wageubn's modules](#wageubns-modules)
            - [wageubn.function](#wageubnfunction)
            - [wageubn.config](#wageubnconfig)
          - [wageubn's problem](#wageubns-problem)
          - [Supplement](#supplement)
            - [Where is the hardware-precision error, update and gradient?](#where-is-the-hardware-precision-error-update-and-gradient)
            - [How to get the data width of a tensor?](#how-to-get-the-data-width-of-a-tensor)
        - [Contributing](#contributing)
        - [Acknowledgment](#acknowledgment)
        
        ## Notice 
        
        This repo is based on the same framework as [tqt](https://github.com/PannenetsF/TQT) and focuses on inferrence only. Even the quantized error and gradient could be got from `wageubn.function.errorquant` and `wageubn.function.gradquant`, we will not use them. If they are essential for your training, please fork this repo and wrap the `wageubn.function` modules with it.
        
        Now available at [https://pypi.org/project/wageubn/0.1.0/](https://pypi.org/project/wageubn/0.1.0/).
        
        Networks quantized via this package could be find at [https://github.com/PannenetsF/QuantizationPool](https://github.com/PannenetsF/QuantizationPool).
        
        ## wageubn's modules
        
        ### wageubn.function 
        
        `function` is a re-impletement of `torch.nn.modules`. Besides all the args used in the original function, a quantized function get 2 kind of optional arguments: `bit_width` and `retrain`. 
        
        `bit_width` has 2 type: weight/bias or activation. 
        
        If the `retrain` is `True`, the Module will be in Retrain Mode, with the `log2_t` trainable. Else, in Static Mode, the `log2_t` are determined by initialization and not trainable.
        
        ### wageubn.config
        
        Config the bitwidth via `wageubn.config.Config` and `wageubn.config.network_config`. `wageubn.config.Config` is a namedtuple and you can set bitwidth as its key.
        
        With more consideration of hardware implement, the input and output of any module should be FIXED(or quantized). So there is a `iostrict` attribute to do this.
        
        ## wageubn's problem 
        
        `iostrict` is one part as stated in [last section](#wageubnconfig). The paper uses `directquant` almost everywhere, which turns to a new problem: the given k is not the bitwidth of data. It's just the width of the decimal. As a result, I decide to add more attribute to control the module's real bitwidth `real_*` for each part.
        
        ## Supplement
        
        ### Where is the hardware-precision error, update and gradient?
        
        In the PyTorch framework, it's not easy to do the **real** quantization now. We cannot get gradient for int-like data type, for example. And there are more essential problems like overflow (no matter how careful you are to make the quantization via float) or truncation error, unless the network could rewrite all operations in CUDA or C++. But it requires much more effort than I can afford now. 
        
        ### How to get the data width of a tensor?
        
        In fact, wageubn does the simplest work: get the data, train and eval based on the given bit width. But how to get the bit width is not mentioned. So we can say, under a given bit width (like 8), we need manually find the fixed point's place for each operations or even each layer. Then I will turn to TQT to do more work.
        
        # Contributing 
        
        It will be great of you to make this project better! There is some ways to contribute!
        
        1. To start with, issues and feature request could let maintainers know what's wrong or anything essential to be added. 
        2. If you use the package in you work/repo, just cite the repo and add a dependency note! 
        3. You can add some function in `torch.nn` like `HardTanh` and feel free to open a pull request! The code style is simple as [here](style.md).
        
        # Acknowledgment 
        
        
        The original papar could be find at [Arxiv, Training high-performance and large-scale deep neural networks with full 8-bit integers](https://arxiv.org/abs/1909.02384).
Keywords: quantization NN torch
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: GPU
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: BSD License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python
Requires-Python: >=3.7.*
Description-Content-Type: text/markdown
