DIRESA architecture

Introduction

DIRESA is a non-linear dimension reduction technique which fulfills the following requirements:

  1. Decoded latent dataset is as close as possible to original dataset

  2. Distance (ordering) in real space is preserved in latent space.

  3. Latent components are independent

_images/diresa_requirements.jpg

We could add as a fourth requirement that the latent components are ordered by importance (explained variance). This is not part of the architecture, but ordering can be done afterwards by decoding the latent components one by one, and calculating the variances.

Architecture

DIRESA is a Siamese twin autoencoder. The original dataset is fed in one encoder branch, while the other is provided with a shuffled dataset. The two encoder branches share the weights and produce two different latent space representations. These are used for the distance loss term, with the goal that the distance between samples in the dataset be reflected (preserved or correlated) in the distance between latent representations of those samples. Different distance loss functions are implemented, these can be found in the loss module.

_images/diresa_architecture.jpg

The total loss is the sum of the reconstruction loss, the covariance loss (multiplied by a weight factor) and the distance loss (multiplied by a weight factor). To lower the hyperparameter tuning effor, an annealing method is foreseen for the covariance loss weight factor.