BERT and the brain

Neuroscience studies have found striking parallels between BERT's internal representations and human brain activity during language processing.

Cognitive insights:

BERT's layer progression mirrors the temporal cascade of language processing in the brain.
Middle BERT layers best predict fMRI activity in language regions.
The N400 event related potential (ERP) component correlates with BERT's prediction confidence. (ERP: brain voltage changes time-locked to specific stimuli.)

Does BERT "understand" language like we do? Probably not — but it may have discovered similar computational solutions to the same problem.

Schrimpf et al. (2021, PNAS) "The neural architecture of language" — Middle transformer layers best predict fMRI activity in language regions.

Goldstein et al. (2022, Nature Neuroscience) "Shared computational principles for language processing in humans and deep language models" — Layer hierarchy of language models maps onto the temporal hierarchy of brain responses.

Michaelov et al. (2023, IEEE TCDS) "So Cloze yet so Far" — N400 amplitude is better predicted by language model surprisal than by human cloze probabilities. (N400: a negative brain-voltage deflection ~400 ms after an unexpected word; cloze probability: the proportion of people who fill in a given word when shown a sentence with a blank.)

Lecture 18: BERT deep dive

PSYC 51.17: Models of language and communication

Learning objectives

Stuff you already know...

How GPT pays attention

Encoders vs decoders

Breaking the causal mask

Why bidirectionality matters

The Winograd schema challenge

What fill-in-the-blank teaches

Try it! what does BERT predict?

BERT reveals its biases

What attention heads specialize in

BERT learns a linguistic pipeline

Embeddings change across layers

The pre-training data question

BERT's lasting impact

BERT and the brain

Demo time!

Questions?