Learning Bayesian Networks from Correlated Data

Learning Bayesian Networks from Correlated Data Harold Bae1,* , Stefano Monti2, , Monty Montano3, , Martin H. Steinberg2, , Thomas T. Perls2, , and Paola Sebastiani4 1 Oregon

State University, College of Public Health and Human Sciences, Corvallis, 97331, U.S.A. University, Department of Medicine, Boston, 02118, U.S.A. 3 Harvard Medical School, Department of Medicine, Boston, 02115, U.S.A. 4 Boston University, Department of Biostatistics, Boston, 02118, U.S.A. 2 Boston

Supplementary Information

Score BICM BICJ BICY BICC AICM LRTM BICF LRTF AICF

ne 4656 3346 1768 582 4656 4656 4656 4656 4656

1 44 49 64 119 1598 496 83 727 1949

Errors At Each Level 2 3 4 0 0 0 0 0 0 0 0 0 2 0 0 790 294 77 96 12 1 2 0 0 212 40 3 1077 455 152

≥5 0 0 0 0 16 0 0 0 47

Tot Test 10396 10441 10576 11042 23295 14336 10745 16291 25803

Error Rates FPR FWER 0.0042 0.044 0.0047 0.049 0.0061 0.064 0.0110 0.114 0.1191 0.821 0.0422 0.397 0.0079 0.081 0.0603 0.517 0.1426 0.879

Supplementary Table 1. False Positive Rates and Family-wise Error Rates of Different Model Selection Metrics For Continuous Data When h2 = 0.25. BICM : BIC based on integrated likelihood and full sample size; BICJ , BICY , BICC : BIC with Jones’, Young and conservative effective sample size; AICM : AIC based on integrated likelihood and full sample size; LRTM : likelihood ratio test based on integrated likelihood to account for correlated data; BICF , LRTF , and AICF : traditional BIC, likelihood ratio test, and AIC. FPR is the false positive rate defined as number of errors over total number of tests ignoring correlated data; FW ER is family wise error rate, i.e., probability of one or more errors.

Score BICM BICJ BICY BICC AICM LRTM BICF LRTF AICF

ne 4656 2483 1768 582 4656 4656 4656 4656 4656

1 55 64 75 131 1597 528 193 1146 2540

Errors At Each Level 2 3 4 ≥5 2 0 0 0 3 0 0 0 3 0 0 0 10 0 0 0 758 279 86 18 111 16 0 0 16 0 0 0 447 140 23 1 1582 842 359 159

Tot Test 10475 10555 10654 11169 23135 14591 11739 19781 29816

Error Rates FPR FWER 0.0054 0.051 0.0064 0.059 0.0073 0.070 0.0126 0.121 0.1183 0.822 0.045 0.415 0.0178 0.179 0.0888 0.702 0.1839 0.936

Supplementary Table 2. False Positive Rates and Family-wise Error Rates of Different Model Selection Metrics For Continuous Data When h2 = 0.75. BICM : BIC based on integrated likelihood and full sample size; BICJ , BICY , BICC : BIC with Jones’, Young and conservative effective sample size; AICM : AIC based on integrated likelihood and full sample size; LRTM : likelihood ratio test based on integrated likelihood to account for correlated data; BICF , LRTF , and AICF : traditional BIC, likelihood ratio test, and AIC. FPR is the false positive rate defined as number of errors over total number of tests ignoring correlated data; FW ER is family wise error rate, i.e., probability of one or more errors.

2/4

BICM LRTBICM BICJ LRTBICJ BICY LRTBICY BICC LRTBICC

Strong Effect 0.797 0.810 0.815 0.824 0.838 0.835 0.877 0.875

Power Moderate Effect 0.444 0.461 0.464 0.473 0.510 0.506 0.593 0.584

Weak Effect 0.273 0.287 0.290 0.297 0.332 0.329 0.417 0.405

Supplementary Table 3. Power Comparisons of Four Variants of BIC vs. Corresponding LRTM For Continuous Data When h2 = 0.25. Results are based on 1,000 simulated datasets with 3 situations of strong, moderate, and weak covariate effects. BICM : BIC based on integrated likelihood and full sample size; BICJ , BICY , BICC : BIC with Jones’, Young and conservative effective sample size; LRTBICM , LRTBICJ , LRTBICY , and LRTBICC : likelihood ratio test based on integrated likelihood using the significance threshold obtained from empirical false positive rates.

3/4

BICM LRTBICM BICJ LRTBICJ BICY LRTBICY BICC LRTBICC

Strong Effect 0.233 0.270 0.268 0.286 0.284 0.304 0.356 0.374

Power Moderate Effect 0.102 0.124 0.122 0.138 0.138 0.150 0.189 0.192

Weak Effect 0.049 0.067 0.067 0.072 0.071 0.075 0.102 0.106

Supplementary Table 4. Power Comparisons of Four Variants of BIC vs. Corresponding LRTM For Continuous Data When h2 = 0.75. Results are based on 1,000 simulated datasets with 3 situations of strong, moderate, and weak covariate effects. BICM : BIC based on integrated likelihood and full sample size; BICJ , BICY , BICC : BIC with Jones’, Young and conservative effective sample size; LRTBICM , LRTBICJ , LRTBICY , and LRTBICC : likelihood ratio test based on integrated likelihood using the significance threshold obtained from empirical false positive rates.

Score BICM AICM LRTM BICF AICF LRTF

1 77 1661 522 107 1837 625

Errors At Each Level 2 3 4 ≥5 1 0 0 0 821 326 91 24 114 21 2 0 2 0 0 0 956 406 132 39 161 25 2 0

Tot Test 10683 23769 14646 10970 24851 15537

Error Rates FPR FWER 0.0073 0.075 0.1230 0.841 0.0450 0.413 0.0099 0.106 0.1356 0.864 0.0523 0.476

Supplementary Table 5. False Positive Rates and Family-wise Error Rates of Different Model Selection Metrics For Time-to-event Data When h2 = 0.25. BICM : BIC based on integrated likelihood and number of events as the sample size; AICM : AIC based on integrated likelihood and full sample size; LRTM : likelihood ratio test based on integrated likelihood to account for correlated data; BICF , LRTF , and AICF : traditional BIC, likelihood ratio test, and AIC. FPR is the false positive rate defined as number of errors over total number of tests ignoring correlated data; FW ER is family wise error rate, i.e., probability of one or more errors.

Score BICM AICM LRTM BICF AICF LRTF

1 79 1712 553 190 2193 906

Errors At Each Level 2 3 4 4 0 0 864 344 106 124 16 3 20 1 0 1272 597 218 282 65 14

≥5 0 24 0 0 59 2

Tot Test 10707 23858 14910 11698 27519 17727

Error Rates FPR FWER 0.0078 0.075 0.1278 0.833 0.0466 0.434 0.0180 0.171 0.1577 0.909 0.0716 0.614

Supplementary Table 6. False Positive Rates and Family-wise Error Rates of Different Model Selection Metrics For Time-to-event Data When h2 = 0.75. BICM : BIC based on integrated likelihood and number of events as the sample size; AICM : AIC based on integrated likelihood and full sample size; LRTM : likelihood ratio test based on integrated likelihood to account for correlated data; BICF , LRTF , and AICF : traditional BIC, likelihood ratio test, and AIC. FPR is the false positive rate defined as number of errors over total number of tests ignoring correlated data; FW ER is family wise error rate, i.e., probability of one or more errors.

4/4