Methods: Here we present the results of several computer simulations assessing the adequacy of psychometrically harmonized scores obtained from moderated nonlinear factor analysis (MNLFA) models. Our general strategy is to (1) simulate data representing multiple samples arising from independent substance use studies which also differ from one another in measurement and in terms of relevant demographic characteristics; (2) combine these samples into one large dataset; (3) apply MNLFA to obtain scores which adjust for non-construct-related differences in measurement between persons (e.g., differences due to between-study variations in measurement); and (4) assess the accuracy of these scores.
Results: MNLFA yields scores that are highly accurate, as indexed by their correlation with population values, as long as all relevant covariates (i.e., demographic characteristics) are included in the scoring model. Additionally, the use of MNLFA scores are as predictors in subsequent regression models yields unbiased effect estimates. By contrast, bias may be severe for scores arising from latent variable methods which do not take into account between-study differences in measurement.
Conclusions: Differences between studies in measurement are critical to take into account when analyzing data which has been pooled across studies. Latent variable measurement models like MNLFA offer the opportunity to construct psychometrically harmonized scores for IDA that adjust for differences between studies due to subtle variations in measurement. These scores perform well in simulation studies but additional research is needed to confirm their advantages with real data.