Simulated data with uncontrolled confounding, exposure misclassification, and selection bias
Source:R/data_uc_em_sel.R
df_uc_em_sel.Rd
Data containing three sources of bias, three known confounders, and
100,000 observations. This data is obtained by sampling with replacement
with probability = S from df_uc_em_sel_source
then removing
the columns X, U, and S. The resulting data corresponds
to what a researcher would see in the real-world: a misclassified exposure,
Xstar; missing data on a confounder U; and missing data for
those not selected into the study (S=0). As seen in
df_uc_em_sel_source
, the true, unbiased exposure-outcome
odds ratio = 2.