Metadata-Version: 2.4
Name: fd-cate
Version: 0.1.1
Summary: Front-door CATE/ATE estimation toolkit with debiased learners.
Author: Yonghan Jung
Keywords: causal-inference,front-door,cate,ate,heterogeneous-treatment-effect
Requires-Python: >=3.9
Description-Content-Type: text/markdown
Requires-Dist: numpy
Requires-Dist: pandas
Requires-Dist: scikit-learn
Requires-Dist: joblib
Requires-Dist: xgboost
Provides-Extra: nn
Provides-Extra: docs
Requires-Dist: mkdocs-material; extra == "docs"
Requires-Dist: mkdocstrings[python]; extra == "docs"
Provides-Extra: viz
Requires-Dist: matplotlib; extra == "viz"
Requires-Dist: shap; extra == "viz"
Provides-Extra: dev
Requires-Dist: pytest; extra == "dev"
Requires-Dist: ruff; extra == "dev"
Requires-Dist: pre-commit; extra == "dev"
Requires-Dist: build; extra == "dev"
Requires-Dist: twine; extra == "dev"

# FD-CATE

Front-door CATE/ATE estimation toolkit with paper-parity defaults for debiased front-door learners.

This repository keeps the original research scripts (`FDCATE.py`, `analyze_fars_2000_fd.py`) and adds a standard-library interface (`fd_cate`) with a stable artifact contract.

## Install

```bash
python -m pip install -U pip
python -m pip install fd-cate
```

Default learner is `xgb` (XGBoost). `nn` is also supported via `nuisance_learner="nn"`.

## One-Click Quickstart

```bash
fdcate demo --outdir ./fdcate-demo
```

This single command runs:
- synthetic data generation
- model fit + artifact contract write
- optional quick benchmark (enabled by default)

Expected files:
- `./fdcate-demo/synthetic.csv`
- `./fdcate-demo/fit_out/summary.txt`
- `./fdcate-demo/fit_out/results.json`
- `./fdcate-demo/fit_out/diagnostics.json`
- `./fdcate-demo/fit_out/effects.csv`
- `./fdcate-demo/fit_out/model.pkl`
- `./fdcate-demo/benchmark_quick.json` (unless `--run-benchmark false`)

## Quickstart (Python API)

```python
from fd_cate import FDCATE
from FDCATE import simulate_fd_data_md

# synthetic example
D = simulate_fd_data_md(n=500, d=10, seed=0)

est = FDCATE(method="fd-dr", nuisance_learner="xgb", random_state=0)
est.fit(D.C, D.Y, t=D.X, m=D.Z)

tau = est.effect(D.C)
print(est.ate_)
print(est.summary())
```

## Quickstart (CLI)

```bash
# generate synthetic csv
fdcate synthetic --n 300 --d 8 --seed 42 --out synthetic.csv

# fit + write standard artifacts
fdcate fit \
  --data synthetic.csv \
  --outcome y --treat t --med m \
  --outdir out/

# diagnostics only
fdcate doctor \
  --data synthetic.csv \
  --outcome y --treat t --med m
```

Standard artifacts under `out/`:
- `summary.txt`
- `results.json`
- `diagnostics.json`
- `effects.csv`
- `model.pkl`

## Benchmark (Quick Profile + Golden Regression)

`fd-cate` now includes a deterministic quick benchmark profile for regression checks.

```bash
fdcate benchmark --n 120 --d 6 --seed 2026 --nuisance-learner xgb --out results/benchmark_quick.json
```

Multi-seed profile (recommended for robust comparisons):

```bash
fdcate benchmark \
  --profile multiseed \
  --n 120 --d 6 --seed 2026 --n-seeds 20 \
  --nuisance-learner xgb \
  --fd-r-g-solver direct \
  --fd-r-b-learner xgb \
  --out results/benchmark_multiseed.json
```

Output schema (`fdcate.benchmark`, `schema_version=0`) contains:
- `clean` RMSE for `fd-pi`, `fd-dr`, `fd-r`
- `weak-overlap` RMSE for `fd-pi`, `fd-dr`, `fd-r`
- `aggregate_mean_rmse` across the two scenarios
- with `--profile multiseed`: `per_seed` results + summary statistics (`mean/std/min/max`)

FD-R benchmarking knobs:
- `--fd-r-g-solver`: `direct` or `ratio`
- `--fd-r-b-learner`: `xgb` or `nn`
- `--no-fd-r-swap-average`: disable swapped D1/D2 averaging

CI also runs a golden snapshot regression test:
- `tests/test_benchmark_golden.py`
- golden reference file: `tests/benchmark_quick_reference.json`

## Live Demo (Toy + Benchmark)

Primary path (CLI one-click):

```bash
fdcate demo --outdir /tmp/fdcate_live_demo
```

Secondary path (legacy helper script):

```bash
bash scripts/run_demo_quick.sh
```

The demo writes:
- `/tmp/fdcate_live_demo/fit_out/summary.txt`
- `/tmp/fdcate_live_demo/fit_out/results.json`
- `/tmp/fdcate_live_demo/fit_out/diagnostics.json`
- `/tmp/fdcate_live_demo/fit_out/effects.csv`
- `/tmp/fdcate_live_demo/fit_out/model.pkl`
- `/tmp/fdcate_live_demo/benchmark_quick.json`

Manual one-liners:

```bash
fdcate synthetic --n 120 --d 6 --seed 2026 --out /tmp/fdcate_live_demo/synthetic.csv
fdcate fit --data /tmp/fdcate_live_demo/synthetic.csv --outcome y --treat t --med m --method fd-dr --nuisance-learner xgb --outdir /tmp/fdcate_live_demo/fit_out
fdcate benchmark --n 60 --d 4 --seed 17 --nuisance-learner xgb --out /tmp/fdcate_live_demo/benchmark_quick.json
```

Example terminal output preview (`fdcate demo --outdir /tmp/fdcate_live_demo`):

```text
[demo] output directory: /tmp/fdcate_live_demo
[demo] ATE=0.540874
[demo] generated files:
 - /tmp/fdcate_live_demo/synthetic.csv
 - /tmp/fdcate_live_demo/fit_out/summary.txt
 - /tmp/fdcate_live_demo/fit_out/results.json
 - /tmp/fdcate_live_demo/fit_out/diagnostics.json
 - /tmp/fdcate_live_demo/fit_out/effects.csv
 - /tmp/fdcate_live_demo/fit_out/model.pkl
 - /tmp/fdcate_live_demo/benchmark_quick.json
[demo] next: fdcate effect --model /tmp/fdcate_live_demo/fit_out/model.pkl --data /tmp/fdcate_live_demo/synthetic.csv --out /tmp/fdcate_live_demo/effects_from_model.csv
```

Final benchmark figures (FD-R full-noise setting):

![FD-CATE n-sweep at rho=2, d=30 (FD-R full-noise)](https://raw.githubusercontent.com/yonghanjung/FD-CATE/main/fdcate_nsweep_rho2_d30_fullnoise_plot.png)

![FD-CATE rho-sweep at n=2000, d=30 (FD-R full-noise)](https://raw.githubusercontent.com/yonghanjung/FD-CATE/main/fdcate_rhosweep_n2000_d30_fullnoise_plot.png)

## Model Compatibility Policy (`model.pkl`)

`model.pkl` loading is allowed only when **major.minor** package versions match.

- Example: model saved with `0.1.x` can be loaded by `0.1.y`.
- Example: model saved with `0.1.x` cannot be loaded by `0.2.x`.

## Scope (v0.1)

Supported:
- binary treatment `T ∈ {0,1}`
- binary mediator `M ∈ {0,1}`
- numeric covariates
- continuous or binary outcome (regression handling)

Not supported:
- non-binary `T`/`M`
- automatic categorical encoding pipelines

## Legacy Reproduction Scripts

The original paper-focused scripts are preserved:
- `python FDCATE.py --help`
- `python analyze_fars_2000_fd.py --help`

## Development

```bash
python -m pip install -e .[dev]
python -m pytest -q
python -m build
```

Nightly/manual slow tests are separated from PR fast gates:

```bash
python -m pytest -q -m "slow"
```

## Release (v0.1.0)

```bash
bash scripts/release_preflight.sh
```

Detailed checklist: [`RELEASE_RUNBOOK.md`](RELEASE_RUNBOOK.md)

## Troubleshooting

1. `fdcate: command not found`
- Re-open your shell after installation, or run with module form:
  - `python -m fd_cate --help`

2. XGBoost import/runtime issue
- Reinstall in a clean environment:
  - `python -m pip install -U pip`
  - `python -m pip install --force-reinstall fd-cate`

3. Permission or write-path errors
- Use a writable output directory explicitly:
  - `fdcate demo --outdir /tmp/fdcate-demo`
