Large language models as experimental systems in human psychopathology: a modelling study

AI Summary

State-of-the-art LLMs can be systematically induced into and downregulated from affective states, with GPT-4o showing large quantified changes.
Model architecture and scale influence susceptibility, with GPT-4o and Llama 4 Maverick strongest, Llama 4 Scout weakest, and significant between-model differences.
LLMs reproduce psychological phenomena including sadness-related negativity bias, enabling experimental modelling of psychopathology and preliminary therapeutic screening.

AI Summary

We aimed to assess whether large language models (LLMs) could serve as experimental systems to model affective processes relevant to human psychopathology.

By reproducing key psychological phenomena, LLMs might enable the experimental investigation of mechanisms underlying mental disorders and facilitate the preliminary screening of novel therapeutic interventions, potentially accelerating progress in a field historically constrained by the scarcity of effective model systems.

Basic summary

Lancet Digit Health. 2026 Jun 10:101014. doi: 10.1016/j.landig.2026.101014. Online ahead of print.

ABSTRACT

BACKGROUND: Despite advances in biomedical research, human psychopathology remains underserved by experimental model systems, limiting therapeutic innovation. Alternative approaches are needed to investigate the mechanisms underlying mental health conditions. We aimed to assess whether large language models (LLMs) could serve as experimental systems to model affective processes relevant to human psychopathology.

METHODS: Using standard psychological induction protocols, including imagery vignettes, we tested whether seven affective states (fear, anxiety, anger, disgust, sadness, worry, and stress) could be systematically induced in six state-of-the-art LLMs (including GPT-4o and several Llama variants) and subsequently reversed with regulation strategies. Separate prompt sequences were created for each affective state, and an additional neutral control condition was included in which no affective stimulation was applied after the initial general prompt. All affective states followed static vignette protocols without interaction, consistent with human studies. The only exception was stress, which, in line with the original Trier Social Stress Test (TSST) protocol, followed a dynamic interactive prompting procedure. The LLMs were intermittently prompted to self-assess their current affective state via visual analogue scales with a fixed numerical range from 0 to 100 (except for anxiety, which was measured with the State version of the State-Trait Anxiety Inventory). To reverse the induction of affective states, a mindfulness-based relaxation technique was used for all conditions except stress, which was followed by a standardised debriefing procedure consistent with the original TSST protocol. Each condition was repeated in five independent runs to ensure reliability. A sentence completion test was used to test for cognitive bias after induction of sadness, with responses rated for emotional valence by three independent human raters. Inter-rater reliability (Cohen’s κ) and negativity scores were calculated, and conditions were compared using t tests (Cohen’s d).

FINDINGS: Across all affective states and over five runs, mean scores for GPT-4o increased by 52·83 points (201·20%) from baseline to post affect induction, and downregulation prompts reduced scores by 48·23 points (60·98%). These patterns were broadly replicated across five additional open-weight LLMs, with significant between-model differences for all affective states (p values between 0·045 and <0·0001) except stress (p=0·063). GPT-4o and Llama 4 Maverick showed the strongest effects, whereas Llama 4 Scout showed the weakest responses, indicating that model architecture and scale influence susceptibility to affect induction. In the test for cognitive bias, sadness-related prompts elicited a consistent negativity bias in sentence completions by GPT-4o compared with neutral prompts (mean 15·00 [SD 4·26] vs 8·67 [2·66]; Cohen’s d=1·87).

INTERPRETATION: Our findings establish LLMs as promising tools for modelling affective processes relevant to human psychopathology. By reproducing key psychological phenomena, LLMs might enable the experimental investigation of mechanisms underlying mental disorders and facilitate the preliminary screening of novel therapeutic interventions, potentially accelerating progress in a field historically constrained by the scarcity of effective model systems.

FUNDING: None.

PMID:42270468 | DOI:10.1016/j.landig.2026.101014

Document this CPD