Close
This site uses cookies

By using this site, you consent to our use of cookies. You can view our terms and conditions for more information.

Type I error rates for ANOVA, GLM, and GLMM for binary outcomes when simulating from binomial logistic model and Ratcliff diffusion model

Authors
Ms. Yiming Wang
University College London
Ms. Ruslana Tymchyk
Dr. Henrik Singmann
University College London ~ Experimental Psychology
Abstract

Logistic regression models are often recommended for analysing binary response variables, such as accuracy, that commonly arise in psychological research designs. One of the main reasons for this recommendation are simulation studies showing that binomial logistic models outperform ordinary ANOVA models when simulating from a binomial logistic model. However, such a simulation setup is at risk of circularity as the logistic model is both the data generating and winning candidate model. To overcome this limitation, we compared different candidate models when simulating from two data generating models – a binomial logistic model and the Ratcliff diffusion model. For each simulation study, we simulated a two-group between-participants design with 30 participants per condition. We also varied the number of observations per simulated participant, either 1 observation or 100 observations. We then compared the type I error rates (i.e., the proportion of false positive errors) from three popular candidate methods, linear regression (ANOVA), generalised binomial logistic regression (GLM), and generalised binomial logistic mixed models (GLMM). Our results suggested that ANOVA shows the best performance in terms of type I errors across different simulation setups and data generated by both logistic and diffusion models. For the GLM, the type I error rate was around 0.05 only for 1 observation per participant and severely anti-conservative (i.e., too high type I error) for 100 observations. GLMM yielded an acceptable type I error rate with 100 observations per participant but varied amounts of type I errors dependent on the data generation models with 1 observation per participant. When simulating from the logistic model GLMM produced acceptable type I error rates but too high type I error rates when simulating from the diffusion model. Additionally, the type I errors from GLMM with 1 observation per participant increased as the overall performance level approached the boundary of the parameter space. Overall, our results suggest that in terms of type I error rates, ANOVA generally perform better than logistic models in most cases and the performance of logistic models depend exactly on the simulation setup.

Tags

Keywords

Type I error
binomial logistic model
Ratcliff diffusion model
linear regression
generalised binomial logistic regression
Discussion
New

There is nothing here yet. Be the first to create a thread.

Cite this as:

Wang, Y., Tymchyk, R., & Singmann, H. (2023, July). Type I error rates for ANOVA, GLM, and GLMM for binary outcomes when simulating from binomial logistic model and Ratcliff diffusion model. Abstract published at MathPsych/ICCM/EMPG 2023. Via mathpsych.org/presentation/1045.