diff_diff.ImputationDiD#

class diff_diff.ImputationDiD[source]#

Bases: ImputationDiDBootstrapMixin

Borusyak-Jaravel-Spiess (2024) imputation DiD estimator.

This is the efficient estimator for staggered Difference-in-Differences under parallel trends. It produces shorter confidence intervals than Callaway-Sant’Anna (~50% shorter) and Sun-Abraham (2-3.5x shorter) under homogeneous treatment effects.

The estimation procedure: 1. Run OLS on untreated observations to estimate unit + time fixed effects 2. Impute counterfactual Y(0) for treated observations 3. Aggregate imputed treatment effects with researcher-chosen weights

Inference uses the conservative clustered variance estimator from Theorem 3 of the paper.

Parameters:

anticipation (int, default=0) – Number of periods before treatment where effects may occur.
alpha (float, default=0.05) – Significance level for confidence intervals.
cluster (str, optional) – Column name for cluster-robust standard errors. If None, clusters at the unit level by default.
vcov_type (str, default="hc1") – Variance estimator family. Permanently narrow to {"hc1"} per the IF-based variance contract (Theorem 3): analytical-sandwich families {classical, hc2, hc2_bm} and conley are rejected at __init__ with methodology-rooted messages. cluster= invokes per-cluster IF summation; survey_design= invokes TSL on the combined IF. See REGISTRY.md for the cross-estimator IF-vs-sandwich taxonomy.
n_bootstrap (int, default=0) – Number of bootstrap iterations. If 0, uses analytical inference (conservative variance from Theorem 3).
bootstrap_weights (str, default="rademacher") – Type of bootstrap weights: “rademacher”, “mammen”, or “webb”.
seed (int, optional) – Random seed for reproducibility.
rank_deficient_action (str, default="warn") – Action when design matrix is rank-deficient: - “warn”: Issue warning and drop linearly dependent columns - “error”: Raise ValueError - “silent”: Drop columns silently
horizon_max (int, optional) – Maximum event-study horizon. If set, event study effects are only computed for abs(h) <= horizon_max.
aux_partition (str, default="cohort_horizon") – Controls the auxiliary model partition for Theorem 3 variance: - “cohort_horizon”: Groups by cohort x relative time (tightest SEs) - “cohort”: Groups by cohort only (more conservative) - “horizon”: Groups by relative time only (more conservative)
pretrends (bool, default=False) – If True, event study includes pre-treatment horizons for visual pre-trends assessment. Pre-period effects should be ~0 under parallel trends. Only affects event_study aggregation; overall ATT and group aggregation are unchanged.
leave_one_out (bool, default=False) – If True, apply the Borusyak-Jaravel-Spiess (2024) Supplementary Appendix A.9 leave-one-out finite-sample refinement to the conservative variance. The non-LOO auxiliary aggregate tau_tilde_g is built from the fitted tau_hat_it and thus partially overfits to the noise epsilon_it, biasing the variance downward. LOO recomputes each unit’s group aggregate excluding that unit – implemented efficiently by rescaling each treated auxiliary residual by 1 / (1 - v_ig**2 / sum_j v_jg**2) (App. A.9), which is exactly equivalent to the direct leave-one-out at the per-unit cluster sum. Yields a larger, less-downward-biased SE (Prop. A8: unbiased for an upper bound). Default False preserves R didimputation parity; the refinement is an option in the authors’ Stata did_imputation. LOO is undefined for a group with a single positive-weight unit (App. A.9 footnote 51): such groups fall back to the non-LOO residual with a UserWarning. The Prop. A8 direction (LOO >= non-LOO) is guaranteed at the default unit clustering; coarser cluster= / analytical survey_design= / n_bootstrap compositions apply the same rescale but are a library extension beyond the paper’s derivation. Replicate-weight survey designs raise NotImplementedError (their variance bypasses the influence-function path where the rescale lives).

results_#

Estimation results after calling fit().

Type:: ImputationDiDResults

is_fitted_#

Whether the model has been fitted.

Type:: bool

Examples

Basic usage:

>>> from diff_diff import ImputationDiD, generate_staggered_data
>>> data = generate_staggered_data(n_units=200, seed=42)
>>> est = ImputationDiD()
>>> results = est.fit(data, outcome='outcome', unit='unit',
...                   time='time', first_treat='first_treat')
>>> results.print_summary()

With event study:

>>> est = ImputationDiD()
>>> results = est.fit(data, outcome='outcome', unit='unit',
...                   time='time', first_treat='first_treat',
...                   aggregate='event_study')
>>> from diff_diff import plot_event_study
>>> plot_event_study(results)

Notes

The imputation estimator uses ALL untreated observations (never-treated + not-yet-treated periods of eventually-treated units) to estimate the counterfactual model. There is no control_group parameter because this is fundamental to the method’s efficiency.

References

Borusyak, K., Jaravel, X., & Spiess, J. (2024). Revisiting Event-Study Designs: Robust and Efficient Estimation. Review of Economic Studies, 91(6), 3253-3285.

Methods

`__init__`([anticipation, alpha, cluster, ...])
`fit`(data, outcome, unit, time, first_treat)	Fit the imputation DiD estimator.
`get_params`()	Get estimator parameters (sklearn-compatible).
`print_summary`()	Print summary to stdout.
`set_params`(**params)	Set estimator parameters (sklearn-compatible).
`summary`()	Get summary of estimation results.

Attributes

`n_bootstrap`
`bootstrap_weights`
`alpha`
`seed`
`anticipation`
`horizon_max`

__init__(anticipation=0, alpha=0.05, cluster=None, vcov_type='hc1', n_bootstrap=0, bootstrap_weights='rademacher', seed=None, rank_deficient_action='warn', horizon_max=None, aux_partition='cohort_horizon', pretrends=False, leave_one_out=False)[source]#

Parameters:

anticipation (int)
alpha (float)
cluster (str | None)
vcov_type (str)
n_bootstrap (int)
bootstrap_weights (str)
seed (int | None)
rank_deficient_action (str)
horizon_max (int | None)
aux_partition (str)
pretrends (bool)
leave_one_out (bool)

classmethod __new__(*args, **kwargs)#