Simulated multivariate continuous samples from three cohort clusters (1-3 ,4-6, 7-10) with different mean vectors. Each cohort has 100 observations, 15 simuolated covariates, 1 cohort id indicator, and 1 true cluster (distribution) indicator. Four cohorts have two complete missing variables. Cohort 3: X3 and X10, Cohort 5: X4 and X12, Cohort 7: X9 & X11, Cohort 9: X3 & X5

data(cohort_na_df)

Format

An object of class data.frame with 1000 rows and 17 columns:

X1-X15

numeric, simulated continuous variables from multivariate joint distributions

cohortid

numeric, cohort id

distribution

numeric, true distribution indicator