In American journal of epidemiology ; h5-index 65.0
"Heterogeneous treatment effects" is a term which refers to conditional average treatment effects (i.e., CATEs) that vary across population subgroups. Epidemiologists are often interested in estimating such effects because they can help detect populations who may particularly benefit from or be harmed by a treatment. However, standard regression approaches for estimating heterogeneous effects are limited by pre-existing hypotheses, test a single effect modifier at a time, and are subject to the multiple comparisons problem. The objective of this text is to offer a practical guide to honest causal forests, an ensemble tree-based learning method which can discover as well as estimate heterogeneous treatment effects using a data-driven approach. We discuss the fundamentals of tree-based methods, describe how honest causal forests can identify and estimate heterogeneous effects, and demonstrate an implementation of this method using simulated data. Our implementation highlights the steps required to simulate datasets, build honest causal forests, and assess model performance across a variety of simulation scenarios. Overall, this paper is intended for epidemiologists and other population health researchers who lack an extensive background in machine learning yet are interested in utilizing an emerging method for identifying and estimating heterogeneous treatment effects.
Jawadekar Neal, Kezios Katrina, Odden Michelle C, Stingone Jeanette A, Calonico Sebastian, Rudolph Kara, Al Hazzouri Adina Zeki
2023-Feb-24
data science, effect modifier, epidemiology, machine learning, precision medicine