Metadata-Version: 2.4
Name: hdmrlib
Version: 0.1.1
Summary: High Dimensional Model Representation (HDMR) and Enhanced Multivariate Products Representation (EMPR) library
Author: HDMR-Lib Contributors
Author-email: Muhammed Enis Sen <m.enis.97@gmail.com>, Bugra Eyidogan <bugra.eyidogan@gmail.com>, Pinar Guler <yalcin.pinar2@gmail.com>
License-Expression: MIT
Project-URL: Homepage, https://github.com/hdmrlib/HDMR-Lib
Project-URL: Documentation, https://github.com/hdmrlib/HDMR-Lib#readme
Project-URL: Repository, https://github.com/hdmrlib/HDMR-Lib
Project-URL: Bug Tracker, https://github.com/hdmrlib/HDMR-Lib/issues
Keywords: hdmr,empr,tensor,decomposition,sensitivity-analysis
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Science/Research
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Requires-Python: >=3.8
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: numpy>=1.22
Provides-Extra: torch
Requires-Dist: torch>=2.2; extra == "torch"
Provides-Extra: tensorflow
Requires-Dist: tensorflow>=2.14; extra == "tensorflow"
Provides-Extra: all
Requires-Dist: torch>=2.2; extra == "all"
Requires-Dist: tensorflow>=2.14; extra == "all"
Provides-Extra: dev
Requires-Dist: pytest>=7.0; extra == "dev"
Requires-Dist: pytest-cov>=4.0; extra == "dev"
Requires-Dist: matplotlib>=3.5; extra == "dev"
Dynamic: license-file

# HDMR-Lib

**High Dimensional Model Representation and Enhanced Multivariate Products Representation**

A powerful Python library for tensor decomposition using HDMR and EMPR methods with multi-backend support (NumPy, PyTorch, TensorFlow).

---

## 🎯 **What is HDMR-Lib?**

HDMR-Lib implements two state-of-the-art tensor decomposition methods:

- **HDMR (High Dimensional Model Representation)**: Decomposes multivariate functions into hierarchical components with weighted support vectors
- **EMPR (Enhanced Multivariate Products Representation)**: An optimized variant using unweighted support vectors for better computational efficiency

Both methods represent complex multivariate functions as sums of lower-dimensional components, making them ideal for:
- **Sensitivity Analysis** - Identify which variables matter most
- **Uncertainty Quantification** - Understand how input uncertainties propagate  
- **Function Approximation** - Approximate complex functions with simpler components
- **Dimensionality Reduction** - Reduce computational complexity while preserving accuracy

---

## 🚀 **Quick Start**

### Installation

#### Option 1: Install from PyPI (Recommended)

```bash
# Install with all backends (NumPy, PyTorch, TensorFlow)
pip install hdmrlib

# Development installation (includes testing tools)
pip install hdmrlib[dev]
```

#### Option 2: Install from Source

1. **Clone the repository:**
```bash
git clone https://github.com/hdmrlib/HDMR-Lib.git
cd HDMR-Lib
```

2. **Install in development mode:**
```bash
pip install -e .

# Or with development tools
pip install -e .[dev]
```

### GPU Acceleration (Optional)

For CUDA/GPU support:

```bash
# PyTorch with CUDA
pip install torch --index-url https://download.pytorch.org/whl/cu118  # CUDA 11.8

# TensorFlow (GPU support included)
pip install tensorflow
```

**Requirements:**
- NVIDIA GPU with CUDA capability
- CUDA Toolkit (11.8 or 12.1)
- cuDNN library

### Basic Usage

```python
import numpy as np
from hdmrlib import HDMR, EMPR, set_backend

# Create test data
tensor = np.random.rand(5, 5, 5)

# Set backend (numpy, torch, tensorflow)
set_backend('numpy')

# EMPR Decomposition
empr_model = EMPR(tensor, order=2)
empr_result = empr_model.reconstruct()
empr_components = empr_model.components  # dict of components

# HDMR Decomposition  
hdmr_model = HDMR(tensor, order=2)
hdmr_result = hdmr_model.reconstruct()
hdmr_components = hdmr_model.components

print(f"EMPR MSE: {np.mean((tensor - empr_result) ** 2):.6e}")
print(f\"Available EMPR components: {list(empr_components.keys())}\")
```

---

## 📚 **Detailed Usage Guide**

### 1. **Backend Selection**

HDMR-Lib supports multiple computational backends:

```python
from hdmrlib import set_backend, get_backend

# Available backends
set_backend('numpy')      # NumPy (default, always available)
set_backend('torch')      # PyTorch (requires torch>=2.2)  
set_backend('tensorflow') # TensorFlow (requires tensorflow>=2.14)

# Check current backend
print(f"Current backend: {get_backend()}")
```

### 2. **EMPR (Enhanced Multivariate Products Representation)**

EMPR provides efficient decomposition with configurable support vectors:

```python
import numpy as np
from hdmrlib import EMPR

# Create test tensor
x, y, z = np.meshgrid(np.linspace(0, 1, 6), np.linspace(0, 1, 6), np.linspace(0, 1, 6), indexing='ij')
tensor = 1 + 2*x + 3*y + 4*z + x*y + x*z + y*z

# Initialize EMPR with different support vectors
empr_das = EMPR(tensor, supports='das')     # Data-Adaptive Supports (recommended)
empr_ones = EMPR(tensor, supports='ones')   # Uniform supports

# Decompose at different orders
result_order1 = empr_das.reconstruct(order=1)  # Main effects only
result_order2 = empr_das.reconstruct(order=2)  # Main effects + 2-way interactions
result_order3 = empr_das.reconstruct(order=3)  # Full decomposition

# Extract components (all stored in the underlying model)
components = empr_das.components
print("Available components:", list(components.keys()))
# Example: ['g_1', 'g_2', 'g_3', 'g_1,2', 'g_1,3', 'g_2,3', 'g_1,2,3']

# Component interpretation:
# g1, g2, g3: Main effects (univariate)
# g12, g13, g23: Two-way interactions (bivariate)  
# g123: Three-way interaction (trivariate)
```

### 3. **HDMR (High Dimensional Model Representation)**

HDMR uses weighted support vectors for enhanced numerical stability:

```python
from hdmrlib import HDMR

# Initialize HDMR with different weight types
hdmr_avg = HDMR(tensor, order=2, weight='avg')        # Average weights (default)
hdmr_gauss = HDMR(tensor, order=2, weight='gaussian') # Gaussian weights
hdmr_cheb = HDMR(tensor, order=2, weight='chebyshev') # Chebyshev weights

# Custom weights
custom_weights = [np.ones((6, 1)), np.ones((6, 1)), np.ones((6, 1))]
hdmr_custom = HDMR(tensor, weight='custom', custom_weights=custom_weights)

# Decompose and analyze
result = hdmr_avg.reconstruct(order=2)
components = hdmr_avg.components

# Calculate reconstruction quality
mse = np.mean((tensor - result) ** 2)
print(f"HDMR Reconstruction MSE: {mse:.6e}")
```

### 4. **Custom Support Vectors**

For advanced users, you can provide custom support vectors:

```python
# Define custom support vectors for each dimension
custom_supports = [
    np.linspace(0, 1, 6).reshape(-1, 1),  # Linear support for dimension 1
    np.exp(np.linspace(-1, 1, 6)).reshape(-1, 1),  # Exponential for dimension 2
    np.sin(np.linspace(0, np.pi, 6)).reshape(-1, 1)  # Sinusoidal for dimension 3
]

# Use custom supports
empr_custom = EMPR(tensor, supports='custom', order=2, custom_supports=custom_supports)
result = empr_custom.reconstruct(order=2)
```

### 5. **Multi-Backend Comparison**

Compare performance across different backends:

```python
import time
from hdmrlib import set_backend

backends = ['numpy', 'torch', 'tensorflow']
tensor = np.random.rand(8, 8, 8)

for backend in backends:
    try:
        set_backend(backend)
        
        start_time = time.time()
        model = EMPR(tensor, order=2)
        result = model.reconstruct(order=2)
        elapsed = time.time() - start_time
        
        mse = np.mean((tensor - result) ** 2)
        print(f"{backend:>12}: {elapsed:.4f}s, MSE: {mse:.6e}")
        
    except Exception as e:
        print(f"{backend:>12}: Error - {e}")
```

---

## 🔬 **Advanced Examples**

### Sensitivity Analysis

Analyze which variables contribute most to function variation:

```python
# Create function with known variable importance
x1, x2, x3 = np.meshgrid(np.linspace(-1, 1, 8), np.linspace(-1, 1, 8), np.linspace(-1, 1, 8), indexing='ij')
tensor = 5*x1**2 + 2*x2 + 0.1*x3 + x1*x2  # x1 most important, x3 least

# Decompose with EMPR
model = EMPR(tensor, supports='das', order=2)
components = model.components

# Calculate component variances (sensitivity indicators)
sensitivities = {}
for comp_name, comp_tensor in components.items():
    sensitivities[comp_name] = np.var(comp_tensor)

# Sort by importance
sorted_sens = sorted(sensitivities.items(), key=lambda x: x[1], reverse=True)
print("Component sensitivities (most to least important):")
for comp, sens in sorted_sens:
    print(f"  {comp}: {sens:.4f}")
```

### Function Approximation

Use HDMR/EMPR for efficient function approximation:

```python
# Original expensive function
def expensive_function(x, y, z):
    return np.sin(np.pi*x) * np.cos(np.pi*y) * np.exp(z) + x*y*z

# Create training data
x, y, z = np.meshgrid(np.linspace(0, 1, 10), np.linspace(0, 1, 10), np.linspace(0, 1, 10), indexing='ij')
training_data = expensive_function(x, y, z)

# Train EMPR surrogate
surrogate = EMPR(training_data, supports='das', order=3)

# Test approximation quality at different orders
for order in [1, 2, 3]:
    approx = surrogate.reconstruct(order=order)
    
    # Compare on training grid
    error = np.mean((training_data - approx) ** 2)
    print(f"Order {order} approximation MSE: {error:.6e}")
```

### Performance Optimization

Optimize performance for large tensors:

```python
# For large tensors, use appropriate backend
tensor_large = np.random.rand(20, 20, 20)

# Try PyTorch backend for potential GPU acceleration
try:
    set_backend('torch')
    model_torch = EMPR(tensor_large, order=2)
    result_torch = model_torch.reconstruct(order=2)
    print("PyTorch backend successful")
except:
    print("PyTorch not available, using NumPy")
    set_backend('numpy')
    model_cpu = EMPR(tensor_large, order=2)
    result_cpu = model_cpu.reconstruct(order=2)

# For very large tensors, consider lower order approximations
model = EMPR(tensor_large, order=3)
result_fast = model.reconstruct(order=1)  # Faster, main effects only
result_accurate = model.reconstruct(order=3)  # Slower, full interactions
```

---

## 📊 **Component Interpretation**

Understanding HDMR/EMPR components:

| Component | Description | Mathematical Form | Interpretation |
|-----------|-------------|-------------------|----------------|
| `g1`, `g2`, `g3` | Main effects | f(x₁), f(x₂), f(x₃) | Individual variable impacts |
| `g12`, `g13`, `g23` | Two-way interactions | f(x₁,x₂), f(x₁,x₃), f(x₂,x₃) | Pairwise variable interactions |
| `g123` | Three-way interactions | f(x₁,x₂,x₃) | Complex multi-variable interactions |

**Decomposition Formula:**
```
F(x₁,x₂,x₃) ≈ f₀ + g₁(x₁) + g₂(x₂) + g₃(x₃) + g₁₂(x₁,x₂) + g₁₃(x₁,x₃) + g₂₃(x₂,x₃) + g₁₂₃(x₁,x₂,x₃)
```

---

## 🎯 **Best Practices**

### 1. **Backend Selection**
- **NumPy**: Default choice, good for most applications (CPU only)
- **PyTorch**: Better for integration with deep learning workflows (CPU/GPU)
- **TensorFlow**: Good for large-scale distributed computing (CPU/GPU)

### 2. **Support Vector Choice**
- **DAS (Data-Adaptive Supports)**: Usually provides best approximation quality
- **Ones**: Simpler, faster, good for well-behaved functions
- **Custom**: For domain-specific knowledge or special function properties

### 3. **Order Selection**
- **Order 1**: Fast, captures main effects only
- **Order 2**: Good balance of speed and accuracy for most applications
- **Order 3+**: Full accuracy but exponentially more expensive

### 4. **Weight Configuration (HDMR)**
- **Average**: Good default choice for most functions
- **Gaussian**: Better for smooth functions
- **Chebyshev**: Good for polynomial-like functions
- **Custom**: When you have domain knowledge about function behavior

---

## 🔧 **Troubleshooting**

### Common Issues

**1. Import Errors**
```python
# If you get import errors, ensure you're in the project directory
import sys
import os
sys.path.insert(0, os.path.dirname(__file__))
```

**2. Backend Not Available**
```python
from hdmrlib import set_backend

try:
    set_backend('torch')
except ValueError as e:
    print(f"Backend error: {e}")
    set_backend('numpy')  # Fallback to NumPy
```

**3. Memory Issues with Large Tensors**
```python
# For memory issues, reduce decomposition order or tensor size
tensor_large = np.random.rand(50, 50, 50)

# Instead of order=3, use order=2 or order=1
model = EMPR(tensor_large, order=1)
result = model.reconstruct(order=1)  # Uses less memory
```

**4. Poor Approximation Quality**
```python
# Try different support vectors or increase order
model_das = EMPR(tensor, supports='das', order=2)
model_ones = EMPR(tensor, supports='ones', order=2)

result_das = model_das.reconstruct(order=2)
result_ones = model_ones.reconstruct(order=2)

mse_das = np.mean((tensor - result_das) ** 2)
mse_ones = np.mean((tensor - result_ones) ** 2)

print(f"DAS MSE: {mse_das:.6e}, Ones MSE: {mse_ones:.6e}")
```

---

## 📈 **Performance Benchmarks**

Typical performance on various tensor sizes (NumPy backend, order=2):

| Tensor Size | Elements | Time (seconds) | Memory (MB) |
|-------------|----------|----------------|-------------|
| 5×5×5 | 125 | 0.001 | <1 |
| 10×10×10 | 1,000 | 0.01 | ~5 |
| 20×20×20 | 8,000 | 0.1 | ~50 |
| 50×50×50 | 125,000 | 2.0 | ~500 |

*Performance varies with hardware and decomposition order*

---

## 🤝 **Contributing**

We welcome contributions! To contribute:

1. Fork the repository
2. Create a feature branch: `git checkout -b feature-name`
3. Make your changes and add tests
4. Run tests: `python -m pytest tests/`
5. Submit a pull request

---

## 📝 **License**

This project is licensed under the MIT License - see the LICENSE file for details.

---

## 📚 **References**

1. Sobol, I. M. (2001). Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. *Mathematics and Computers in Simulation*, 55(1-3), 271-280.

2. Rabitz, H., & Aliş, Ö. F. (1999). General foundations of high‐dimensional model representations. *Journal of Mathematical Chemistry*, 25(2-3), 197-233.

3. Li, G., Wang, S. W., & Rabitz, H. (2002). Practical approaches to construct RS-HDMR component functions. *The Journal of Physical Chemistry A*, 106(37), 8721-8733.

---

## 📞 **Support**
- 🐛 Issues: [GitHub Issues](https://github.com/hdmrlib/HDMR-Lib/issues)
- 💬 Discussions: [GitHub Discussions](https://github.com/hdmrlib/HDMR-Lib/discussions)

---

**Made with ❤️ for the scientific computing community**

