Subba Reddy Oota Vijay Rowtula Satya Sai Srinath Khushbu Pahwa
Anant Khandelwal Manish Gupta Tanmoy Chakraborty Bapi S. Raju
Linguistic Properties and Model Scale in Brain Encoding: From Small to Compressed Language Models
2
Language models (LMs) predict brain activity evoked by complex
language (e.g. listening a story) to an impressive degree
Once
upon
a
time
Brain alignment of a LM ⇒ how similar its representations are to a human brains
Wehbe et al. 2014,
Jain and Huth 2018,
Gauthier and Levy 2019
Toneva and Wehbe 2019,
Caucheteux et al. 2020,
Toneva et al. 2020
Jain et al. 2020,
Schrimpf et al. 2021,
Goldstein et al. 2022
...
3
Language models (LMs) predict brain activity evoked by complex
language (e.g. listening a story) to an impressive degree
Jain and Huth. Incorporating context into language encoding models for fMRI. (NeurIPS 2018)
Toneva and Wehbe. Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain). (NeurIPS 2019)
Brain alignment of a LM ⇒ does better brain alignment demand scaling of language models?
brain alignmenti = Pearson corr(true vi, pred vi)
4
Antonello et al. 2023 NeurIPS
LMs predict brain activity, and that alignment improves as models scale, suggesting neural scaling laws
Larger models, better brain alignment, but harder to interpret and costly. How small can we go?
5
How small can we go? Three questions about scale, compression, and the brain
Use model size and precision as controlled "knobs" to probe what representations the brain needs.
6
Two knobs: model scale and numerical precision
Investigate via probing
The Moth Radio Hour
Naturalistic stimulus
Text Transcript
Listening
fMRI
Large language models (LLMs)
Quantized (SLMs & LLMs)
Lightweight models
(SLMs)
Significant Difference
Quantization affects alignment
Brain Alignment
Brain alignment shifts with scale and compression - does linguistic competence shift the same way?
7
Does linguistic competence drive brain–model alignment?
Investigate via an indirect approach
Morphology, Discourse, Semantics, Syntax, Reasoning
FlashHolmes
The Moth Radio Hour
Naturalistic stimulus
Text Transcript
Listening
fMRI
Large language models (LLMs)
Quantized (SLMs & LLMs)
Lightweight models
(SLMs)
Linguistic Competence
Significant Difference
Quantization affects alignment
Brain Alignment
Probing (66 tasks)
Accuracy
8
Datasets & Model
To quantify model predictions, we have an estimate of the explainable variance and use that to measure normalize brain alignment.
9
What is the minimal model capacity required to achieve brain alignment comparable to larger LLMs?
Result-1: SLMs vs. LLMs vs. Quantization variants & brain alignment
Is brain alignment equally robust to all compression methods?
11
How do compression methods (quantization and pruning) affect brain alignment for both SLMs and LLMs?
Result-2: Brain alignment is robust to compression, except GPTQ
Moderate pruning (10–25%) preserves alignment at both scales (1.5-3B), but aggressive 50% pruning collapses the 1.5B model
Does linguistic competence vary across scale and precision?
13
Which linguistic properties are preserved or degraded across small, large, and compressed models, and do these changes correlate with brain alignment?
Result-3: Linguistic competence and brain alignment dissociate
Conclusions for neuro-AI research field
16
Subba Reddy Oota
Linguistic Properties and Model Scale in Brain Encoding: From Small to Compressed Language Models (ICML-2026)
Manish Gupta
Bapi S. Raju
Anant Khandelwal
Khushbu Pahwa
Satya Sai Srinath
Vijay Rowtula
Tanmoy Chakraborty