PUBH7485

PUBH 7485 - Methods for Causal Inference (3 Cr.)

School of Public Health - Adm (11162) TPUB - School of Public Health

PUBH 7485 - Methods for Causal Inference (3 Cr.)

Course description

Although most of statistical inference focuses on associational relationships among variables, in many biomedical and health sciences contexts the focus is on establishing the causal effect of an intervention or treatment. Drawing causal conclusions can be challenging, particularly in the context of observational data, as treatment assignment may be confounded. The first part of this course focuses on methods to establish the causal effect of a point exposure, i.e., situations in which treatment is given at a single point in time. Methods to estimate causal treatment effects will include outcome regression, propensity score methods (i.e., inverse weighting, matching), and doubly robust approaches. The second half of the course focuses on estimating the effect of a series of treatment decisions during the course of a chronic disease such as cancer, substance abuse, mental health disorders, etc. Methods to estimate these time-varying treatments include marginal structural models estimated by inverse probability weighting, structural nested models estimated by G-estimation, and the (parametric) G-computation algorithm. We will then turn our attention to estimating the optimal treatment sequence for a given subject, i.e., how to determine "the right treatment, for the right patient, at the right time," using dynamic marginal structural models and methods derived from reinforcement learning (e.g., Q-learning, A-learning) and classification problems (outcome weighted learning, C-learning). PubH 8485 is appropriate for Ph.D students in Biostatistics and Statistics. The homework and projects will focus more on the theoretical aspects of the methods to prepare students for methodological research in this area. PubH 7485 is appropriate for Masters students in Biostatistics and PhD students in other fields who wish to learn causal methods to apply them to topics in the health sciences. This course uses the statistical software of R, a freely available statistical software package, to implement many of the methods we discuss. However, most of the methods discussed in this course can be implemented in any statistical software (e.g., SAS, Stata, SPSS, etc.) and students will be free to use any software for homework assignments.

prereq: Background in regression (e.g. linear, logistic, models) at the level of PubH 7405-7406, PubH 6450-6451, PubH 7402, or equiv. Background in statistical theory (Stat 5101-5102 or PubH 7401) is helpful.

Minimum credits

3

Maximum credits

3

Is this course repeatable?

No

Grading basis

OPT - Student Option

Lecture

Fulfills the writing intensive requirement?

No

Typically offered term(s)

Every Fall