Caspar J. Van Lissa

Utrecht University | Open Science Community Utrecht
Tilburg University
Funded by NWO Veni grant VI.Veni.191G.090

Conceptual introduction

Central thesis

Machine learning can help advance theory formation in developmental psychology

  • Replication crisis (Scheel 2022; Lavelle 2021)
  • Solutions improve deductive (theory-testing) research practices
    • questionable research practices (QRPs)
    • replication
    • preregistration

Many preregistered hypotheses are not supported (Scheel, Schijen, and Lakens 2021)

  • Lack of “good theories” another explanation for replication crisis

Defining theory

Theory: A model that describes the nature of relations between several phenomena in sufficient detail that it allows for the derivation of quantifiable hypotheses that can be subject to (severe) testing.

  • most psychological theories fall short of this definition
    • insufficiently precise
    • too flexible
  • Compare to Walasek, Frankenhuis, and Panchanathan (2022)

Why should statisticians care about theory?

  • It dictates what models are sensible
  • Useful as an instrument of cumulative knowledge acquisition
  • Communicating social scientific research to its consumers

Deduction vs induction

  • Lack of theory cannot be overcome by improving deductive research
  • Theory formation requires inductive (exploratory) research (Creswell and Clark 2017)

Empirical cycle

De Groot’s empirical cycle: a model of knowledge production through scientific research (de Groot 1961)

Rigorous exploration

How can we improve exploratory research?

Role of flexibility:

  • Confirmatory methods poorly suited to exploratory research
    • p-values: Type I error is 5%, but false discovery rate (FDR) is 14-50% (Vidgen and Yasseri 2016).
    • Model fit indices: manually specify models, no guarantee that best model is included

“unguided exploration” is effortful and inflates the risk of spurious results

\[ FDR = \frac{FP}{TP + FP} \]

Machine learning

Machine learning:

  • automated model building
  • checks and balances to prevent overfitting
  • maximize predictive performance and generalizability of results

Psychology can benefit from its superior predictive performance (Yarkoni and Westfall 2017)

  • E.g., personalized (mental) health care, automated assessment aids

NEW: Implications for the theory crisis have not yet been discussed

Unguided vs rigorous exploration

Phenomena detection

First step in Theory Construction Methods is identifying relevant phenomena (Borsboom et al. 2020)

  • Phenomena: stable and general features of the world
  • Few quantitative methods for phenomena detection (aside from expert opinion)

Text mining may be suitable for phenomena detection (van Lissa 2021)

  • Unsupervised learning method
  • Clusters in keywords and abstracts

Holistic approach

Every study examines only a piece of the puzzle; we never see the complete picture

Machine learning accommodates more predictors than classical methods

  • Regularization
  • Variable selection

Including all relevant predictors is important:

  • Assumption for causal interpretation
    • Multicollinearity
  • Good theory incorporates most important causes
    • Cannot assess the relative utility across studies
  • Include potential predictors from various theories, and undertheorized factors

Complex effects

Many machine learning methods accommodate:

  • non-linear effects
  • higher-order interactions, without having to specify the nature of these effects a-priori.
  • Automatic: tree-based methods
  • Manual: penalized methods

Psychological theories rarely account for complex effects

  • Machine learning can provide nuance by revealing them

Theoretical elements

Some machine learning methods incorporate theoretical elements

  • E.g.: assumption that development follows a latent growth curve (LGC).
  • SEM forests (Brandmaier et al. 2016)
  • regularized SEM (Jacobucci, Grimm, and McArdle 2016)

When the theory is a (nomological) network:

  • LASSO-penalized Gaussian graphical model (GGM) (Epskamp, Rhemtulla, and Borsboom 2017)
  • E.g.: network theory of major depression (Cramer et al. 2016)

All of these methods allow for theory guided exploration using machine learning

Person-centered approaches

Explain heterogeneity at a more fine-grained level than the whole sample

  • E.g., latent class analyses
    • “Which individuals are similar?”
    • Other unsupervised learning methods exist
  • RI-CLPM and DSEM
    • Rarely explain heterogeneity in within-person effects
    • Regularization
  • Tree-based models
    • Group individuals based on predictors to maximize homogeneity of the outcome
    • “Why are these individuals similar?”

Generalizability

When using data models to guide theory formation, generalizability is essential

Checks and balances to curtail overfitting

  • cross-validation to estimate predictive accuracy

From models to theory

Naive interpretative approach

  • Variable importance metrics
    • congruent with theoretical assumptions about important predictors?
    • any theoretically important predictors rank low?
    • any undertheorized factors rank high?
  • Marginal associations
    • non-linear effects?
    • high importance but flat marginal association?

Comparing predictive performance of simpler parametric model that represents these ‘insights’ to machine learning model

From models to theory 2

Data models -> formal theory (Haslbeck et al. 2021)

  • abductive formal theory construction (AFTC) framework
    • construct formal theory, generate data, mathematically compare to empirical data
    • Discrepancies: amend formal theory

Summary

  • There is a paucity of good theory
  • Need for exploratory research for theory formation
  • Machine learning for rigorous exploration
    • automates model building
    • incorporates checks and balances for generalizable results
  • Unsupervised learning can assist in phenomenon detection
  • Supervised learning to identify important predictors
  • Some algorithms incorporate basic theoretical elements

Applied examples

Emotion regulation in adolescence

Developmentally sensitive period (Zimmermann and Iwanski 2014)

20% develop psychopathology (Lee et al. 2014)

Potentially lifelong implications for mental health and well-being

Substantial empirical research, but no overarching theoretical framework (Buss, Cole, and Zhou 2019)

Towards integrative theory

First step: identifying relevant phenomena (Borsboom et al. 2020)

We conducted a text mining systematic review (TMSR) (van Lissa 2021)

  • Narrative reviews: small samples, confirmation bias, emphasize positive results (Littell 2008)
  • TMSR: Unlimited sample size, transparent, objective, reproducible

6653 papers on Addresses emotion regulation in population overlapping with adolescence [10-24]

<doi.org/10.1007/s40894-021-00160-7>

Open science

Baseline network

Phenomena relevant to adolescents' emotion regulation according to theory (a) and narrative reviews (b; transparent nodes indicate constructs also present in the theory).

Phenomena relevant to adolescents’ emotion regulation according to theory (a) and narrative reviews (b; transparent nodes indicate constructs also present in the theory).

  1. Theory (b) narrative reviews; transparent nodes indicate constructs also present in theory.

TMSR results

Author keywords (a) and abstracts (b)

Interpretation

  • Both analyses reflected some constructs from theoretical literature
    • Particularly related to neurodevelopment and socialization
  • Mental health-related outcomes feature prominently
    • Emotion regulation implicated in mental health problems (Lee et al. 2014),
    • This underlines the importance of research in this area
  • Substantial correspondence between keywords and abstracts, suggesting validity of method
  • Networks are sparse; few connections among constructs: fragmented literature

Undertheorized themes

  • Developmental disorders
  • Physical health
  • External stressors
  • Structural disadvantage
  • Addictive behavior
  • Identity and moral development
  • Sexual development

Next step: Identifying individual risk factors

  • Predictors of latent growth model?
    • Only linear differences, few predictors (power)
  • Multigroup latent growth model?
    • Non-linear differences, but only one moderator
  • Latent class growth model with auxilliary variables?
    • Groups with regard to trajectory, not predictors

Solution: Machine learning

  • Allows non-linear differences between trajectories
  • Performs variable selection
    • Cast wide net among potentially relevant predictors
  • Checks & balances ensure generalizability
  • Exploratory: No hypothesis

Method illustrated

Method

SEM-forest (Brandmaier et al. 2016)

  1. Bootstrap sample
  2. On each sample, estimate a SEM-model
  3. Consider \(k\) candidate predictors
  4. Identify predictor and value that maximizes \(LR\) of post-split multi-group model
  5. Average predictions across bootstrap samples

We used 1000 bootstrap samples, \(k = 9\)

Present study

RQ1: What are the most important predictors of adolescents’ trajectories of emotion regulation development?

RQ2: What is the nature of the association of the predictors with the trajectories?

Participants

  • RADAR data (https://www.uu.nl/en/research/radar)
  • 497 Dutch adolescents (283 boys; age at T1: M =13.03, SD = 0.46)
  • Collected between 2006-2011
  • Most families were medium- to high-SES (10% low-SES)
  • 87 candidate predictors
  • DV: quadratic growth model of Difficulties in emotion regulation (Gratz and Roemer 2004)

Open science

Results

Variable importance

Marginal association

Interpretation

Important predictors:

  • Personality and related constructs
  • Best parental predictor is autonomy support (bal. rel.)
    • Negative parenting more predictive than positive parenting
  • Conflict frequency and behavior
  • Empathy, particularly personal distress

Less important than expected:

  • SES
  • Bullying/victimization
  • Delinquency
  • Substance use
  • Monitoring

Reflection content

  • Most important predictors are routinely assessed (big 5) or overt (conflict)
    • Prime candidates for early risk assessment
    • Conflict resolution behavior can be taught
      • Target for intervention if association is causal
  • Emphasis on parenting (in literature and theory) might not be justified
    • Congruent with RI-CLPM showing few parenting effects in adolescence (Van Lissa et al. 2019)
  • Order of predictors mirrors bioecological model

Reflections form

  • Many predictors show non-linear effects: Emotion dysregulation only for +/-1SD
  • Some predictors show almost no marginal effect (e.g., father’s age, drug use)
    • This suggests they might be important in interactions

Conclusions

  • Proximal factors more predictive of ER development than distal predictors
  • Parenting may be less relevant than previously thought
  • Personality and conflict behavior are candidates for early diagnosis
  • Conflict skills training might be avenue for intervention

References

Borsboom, Denny, Jonas Dalege, Rogier A Kievit, and Brian D Haig. 2020. “Theory Construction Methodology: A Practical Framework for Building Theories in Psychology.” PsyArxiv. https://doi.org/10.31234/osf.io/w5tp8.

Brandmaier, Andreas M., John J. Prindle, John J. McArdle, and Ulman Lindenberger. 2016. “Theory-Guided Exploration with Structural Equation Model Forests.” Psychological Methods 21 (4): 566–82. https://doi.org/10.1037/met0000090.

Buss, Kristin A., Pamela M. Cole, and Anna M. Zhou. 2019. “Theories of Emotional Development: Where Have We Been and Where Are We Now?” In Handbook of Emotional Development, edited by Vanessa LoBue, Koraly Pérez-Edgar, and Kristin A. Buss, 7–25. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-030-17332-6_2.

Cramer, Angélique O. J., Claudia D. van Borkulo, Erik J. Giltay, Han L. J. van der Maas, Kenneth S. Kendler, Marten Scheffer, and Denny Borsboom. 2016. “Major Depression as a Complex Dynamic System.” PLOS ONE 11 (12): e0167490. https://doi.org/10.1371/journal.pone.0167490.

Creswell, John W., and Vicki L. Plano Clark. 2017. Designing and Conducting Mixed Methods Research. Third Edition. SAGE Publications.

de Groot, Adriaan D. 1961. Methodologie: Grondslagen van onderzoek en denken in de gedragswetenschappen. ’s Gravenhage: Uitgeverij Mouton.

Epskamp, Sacha, Mijke Rhemtulla, and Denny Borsboom. 2017. “Generalized Network Psychometrics: Combining Network and Latent Variable Models.” Psychometrika 82: 904–27. https://doi.org/10.1007/s11336-017-9557-x.

Gratz, Kim L., and Lizabeth Roemer. 2004. “Multidimensional Assessment of Emotion Regulation and Dysregulation: Development, Factor Structure, and Initial Validation of the Difficulties in Emotion Regulation Scale.” Journal of Psychopathology and Behavioral Assessment 26 (1): 41–54. https://doi.org/10.1023/B:JOBA.0000007455.08539.94.

Haslbeck, Jonas M. B., Oisín Ryan, Donald J. Robinaugh, Lourens J. Waldorp, and Denny Borsboom. 2021. “Modeling Psychopathology: From Data Models to Formal Theories.” Psychological Methods, November. https://doi.org/10.1037/met0000303.

Jacobucci, Ross, Kevin J. Grimm, and John J. McArdle. 2016. “Regularized Structural Equation Modeling.” Structural Equation Modeling : A Multidisciplinary Journal 23 (4): 555–66. https://doi.org/10.1080/10705511.2016.1154793.

Lavelle, Jane Suilin. 2021. “When a Crisis Becomes an Opportunity: The Role of Replications in Making Better Theories.” The British Journal for the Philosophy of Science, April, 714812. https://doi.org/10.1086/714812.

Lee, Francis S., Hakon Heimer, Jay N. Giedd, Edward S. Lein, Nenad Šestan, Daniel R. Weinberger, and B. J. Casey. 2014. “Adolescent Mental Health and Obligation.” Science 346 (6209): 547–49. https://doi.org/10.1126/science.1260497.

Littell, Julia H. 2008. “Evidence-Based or Biased? The Quality of Published Reviews of Evidence-Based Practices.” Children and Youth Services Review 30 (11): 1299–1317. https://doi.org/10.1016/j.childyouth.2008.04.001.

Scheel, Anne M. 2022. “Why Most Psychological Research Findings Are Not Even Wrong.” Infant and Child Development 31 (1): e2295. https://doi.org/10.1002/icd.2295.

Scheel, Anne M., Mitchell R. M. J. Schijen, and Daniël Lakens. 2021. “An Excess of Positive Results: Comparing the Standard Psychology Literature With Registered Reports.” Advances in Methods and Practices in Psychological Science 4 (2): 25152459211007467. https://doi.org/10.1177/25152459211007467.

van Lissa, Caspar J. 2021. “Mapping Phenomena Relevant to Adolescent Emotion Regulation: A Text-Mining Systematic Review.” Adolescent Research Review, May. https://doi.org/10.1007/s40894-021-00160-7.

Van Lissa, Caspar J., Andreas M. Brandmaier, Loek Brinkman, Anna-Lena Lamprecht, Aaron Peikert, Marijn E. Struiksma, and Barbara Vreede. 2020. WORCS: A Workflow for Open Reproducible Code in Science,” May. https://doi.org/10.17605/OSF.IO/ZCVBS.

Van Lissa, Caspar J., Renske Keizer, Pol A. C. Van Lier, Wim H. J. Meeus, and Susan Branje. 2019. “The Role of Fathers’ Versus Mothers’ Parenting in Emotion-Regulation Development from Midlate Adolescence: Disentangling Between-Family Differences from Within-Family Effects.” Developmental Psychology 55 (2): 377–89. https://doi.org/10.1037/dev0000612.

Vidgen, Bertie, and Taha Yasseri. 2016. “P- Values: Misunderstood and Misused.” Frontiers in Physics 4.

Walasek, Nicole, Willem E Frankenhuis, and Karthik Panchanathan. 2022. “An Evolutionary Model of Sensitive Periods When the Reliability of Cues Varies Across Ontogeny.” Behavioral Ecology 33 (1): 101–14. https://doi.org/10.1093/beheco/arab113.

Yarkoni, Tal, and Jacob Westfall. 2017. “Choosing Prediction Over Explanation in Psychology: Lessons From Machine Learning.” Perspectives on Psychological Science 12 (6): 1100–1122. https://doi.org/10.1177/1745691617693393.

Zimmermann, Peter, and Alexandra Iwanski. 2014. “Emotion Regulation from Early Adolescence to Emerging Adulthood and Middle Adulthood: Age Differences, Gender Differences, and Emotion-Specific Developmental Variations.” International Journal of Behavioral Development 38 (2): 182–94.