Labs

Welcome to our Labs!

Causal Identification in DAGs using Backdoor and Swigs, Equivalence Classes, Falsifiability Tests

List all conditional independence in a DAG; these are obtained by using the graphical d- separation criterion. We then go ahead and test those restrictions assuming a linear ASEM structure. The note- book also illustrates the analysis from the next chapter.

Dosearch for Causal Identification in DAGs.

This a simple notebook for teaching that illustrates capabilites of the "dosearch" package, which is a great tool. NB. In my experience, the commands are sensitive to syntax ( e.g. spacing when -> are used), so be careful when changing to other examples.

Machine Learning Estimators for Wage Prediction

We illustrate how to predict an outcome variable 𝑌 in a high-dimensional setting, where the number of covariates 𝑝 is large in relation to the sample size 𝑛 . So far we have used linear prediction rules, e.g. Lasso regression, for estimation. Now, we also consider nonlinear prediction rules including tree-based methods.

Functional Approximations by Trees and Neural Networks

Illustrate the flexibility of these methods in approximating the function exp(4G).

The Effect of Gun Ownership on Gun-Homicide Rates

Provide anapplication of DML inference to learn predictive/causal effects of gun ownership on homicide rates across U.S. counties

Dagitty in the Analysis of Impact of 401(k) on Net Financial Wealth

Analyze graph structures that enable identification of the causal effect of 401(K) eligibility on net financial wealth.

Inference on Predictive and Causal Effects in High-Dimensional Nonlinear Models

Provide application of DML inference to learn predictive/causal effects of 401(K) eligibility on net financial wealth. (Note: The results produced in this notebook and provided in the text are slightly different than those in the original paper. The replication files are given at the following Github repository. The difference is due to our use of a single split of the sample in producing the results for this text while the baseline results are based on a method that aggregates results across multiple data splits.)

Double/Debiased Machine Learning for the Partially Linear Regression Model

This is a simple implementation of Debiased Machine Learning for the Partially Linear Regression Model, which provides an application of DML inference to determine the causal effect of countries' intitial wealth on the rate of economic growth.

Variational Autoencoders

In this notebook, we'll introduce and explore "variational autoencoders," which are a very successful family of models in modern deep learning. In particular we will: Illustrate the connection between autoencoders and classical Principal Component Analysis (PCA) Train a non-linear variational auto-encoder that uses a deep neural network

DoubleML and Feature Engineering with BERT

Provides an introduction to text embeddings via BERT and provides an application to predicting demand for toys.

Sensitivity Analysis for Unobserved Confounder with DML and Sensmakr

Analyses the sensitivity of the DML estimate in the Darfur wars example to unobserved confounders

Negative (Proxy) Controls for Unobserved Confounding

Provides an application of using proxy controls to estimate the effect of smoking on birth weight.

Inference on Predictive and Causal Effects in High-Dimensional Nonlinear Models

Estimate the Local Average Treatment Effects of 401(K) participation on net financial wealth.

Weak IV Experiments

A Simple Example of Properties of IV estimator when Instruments are Weak

Debiased ML for Partially Linear IV Model

Provides DML analysis of the impact of institutions on a country’s wealth following AER. The notebook first pro- ceeds with the analysis assuming strong identification. It then notes the weak instrument problem and performs DML analysis that is robust to weak identification.

CATE Inference

analyzes CATE of welfare experiment and for 401k experiment with Best Linear Predictors of CATE and with Random Forest and Causal Forest based methods.

CATE Inference

Analyzes Best Predictors of CATE for 401(K) conditional on income.

CATE Estimation

Analyzes CATE of welfare experiment and for 401k experiment with forests and other methods.

Regression Discontinuity

This notebook illustrates the use of Regression Discontinuity in an empirical study. We analyze the effect of the antipoverty program Progresa/Opportunidades on the consumption behavior of families in Mexico in the early 2000s.

Minimum Wage Example Notebook with DiD

This notebook implements Difference-in-Differences in an application on the effect of minimum wage changes on teen employment. We use data from Callaway (2022). The data are annual county level data from the United States covering 2001 to 2007. The outcome variable is log county-level teen employment, and the treatment variable is an indicator for whether the county has a minimum wage above the federal minimum wage.

Welcome to our Labs!

OLS and Lasso for wage prediction

The Gender Wage Gap

Exercise on Overfitting

VaccinationRCT

Covariates in RCT

Reemployment Bonus RCT

Penalized Linear Regressions: A Simulation Experiment

Case Study using Wage Data from 2015

Simulation on Orthogonal Estimation

Comparing orthogonal (partialling-out) with non-orthogonal learning.

Testing the Convergence Hypothesis

Heterogeneous Effect of Sex on Wage Using Double Lasso

Collider Bias

Causal Identification in DAGs using Backdoor and Swigs, Equivalence Classes, Falsifiability Tests

Dosearch for Causal Identification in DAGs.

Machine Learning Estimators for Wage Prediction

Functional Approximations by Trees and Neural Networks

The Effect of Gun Ownership on Gun-Homicide Rates

Dagitty in the Analysis of Impact of 401(k) on Net Financial Wealth

Inference on Predictive and Causal Effects in High-Dimensional Nonlinear Models

Double/Debiased Machine Learning for the Partially Linear Regression Model

Variational Autoencoders

DoubleML and Feature Engineering with BERT

Sensitivity Analysis for Unobserved Confounder with DML and Sensmakr

Negative (Proxy) Controls for Unobserved Confounding

Inference on Predictive and Causal Effects in High-Dimensional Nonlinear Models

Weak IV Experiments

Debiased ML for Partially Linear IV Model

CATE Inference

CATE Inference

CATE Estimation

Regression Discontinuity

Minimum Wage Example Notebook with DiD