Prediction, syntax and semantic grounding in the brain and large language models

Language comprehension involves continuous anticipation of upcoming linguistic input, requiring the rapid integration of syntactic structure and semantic information. To capture the spatio-temporal dynamics of such anticipatory processes during naturalistic language comprehension, we combined electr...

Full description

Saved in:
Bibliographic Details
Main Authors: Kölbl, Nikola (Author) , Rampp, Stefan (Author) , Kaltenhäuser, Martin (Author) , Tziridis, Konstantin (Author) , Maier, Andreas (Author) , Kinfe, Thomas M. (Author) , Chavarriaga, Ricardo (Author) , Krauss, Patrick (Author) , Schilling, Achim (Author)
Format: Article (Journal)
Language:English
Published: 10 March 2026
In: Scientific reports
Year: 2026, Volume: 16, Pages: 1-17
ISSN:2045-2322
DOI:10.1038/s41598-026-41532-0
Online Access:Verlag, kostenfrei, Volltext: https://doi.org/10.1038/s41598-026-41532-0
Verlag, kostenfrei, Volltext: https://www.nature.com/articles/s41598-026-41532-0
Get full text
Author Notes:Nikola Kölbl, Stefan Rampp, Martin Kaltenhäuser, Konstantin Tziridis, Andreas Maier, Thomas Kinfe, Ricardo Chavarriaga, Patrick Krauss & Achim Schilling
Description
Summary:Language comprehension involves continuous anticipation of upcoming linguistic input, requiring the rapid integration of syntactic structure and semantic information. To capture the spatio-temporal dynamics of such anticipatory processes during naturalistic language comprehension, we combined electroencephalography (EEG) and magnetoencephalography (MEG), leveraging their complementary sensitivities and high temporal resolution. Using this combined EEG-MEG approach, we investigated word-class-specific neural responses during continuous speech perception and related these findings to word class-level predictability and representational structure in a large language model. Twenty-nine healthy participants listened to a German audio book while their neural responses were recorded. Event-related fields and event-related potentials for different word classes showed highly reproducible, characteristic spatio-temporal signatures, including significant pre-onset activity for nouns, suggesting enhanced anticipatory processing of this word class. Source-space analyses revealed activity patterns extending beyond temporal regions into areas compatible with sensorimotor cortices, suggesting a deeper semantic grounding of nouns in e.g. sensory experiences than verbs. By analyzing word class-specific predictability and representational structure in the transformer-based language model Llama, we provide a computational reference frame that complements the neural findings at the level of word classes. These findings highlight the power of simultaneous MEG-EEG recordings in unraveling the predictive, syntactic, and semantic mechanisms that underlie language comprehension.
Item Description:Gesehen am 13.04.2026
Physical Description:Online Resource
ISSN:2045-2322
DOI:10.1038/s41598-026-41532-0