Token	Token Emb	Segment	Position
[CLS]	E_CLS	A	0
I	E_I	A	1
love	E_love	A	2
NLP	E_NLP	A	3
[SEP]	E_SEP	A	4
It	E_It	B	5
is	E_is	B	6
fun	E_fun	B	7

Discussion: do contextual embeddings reflect "understanding"?

Capture complex semantic relationships
Generalize to new contexts
Handle compositional meaning (e.g., idioms, metaphors, sarcasm)
Human-level performance on many benchmarks

Brittle to adversarial examples (small changes can fool them)
Lack of world knowledge and reasoning: grammatical but nonsensical sentences can be rated as highly probable
Struggle with out-of-distribution inputs that humans handle easily (e.g., novel metaphors, jokes)

Notice that with these models we are far beyond pre-defined symbolic rules and simple co-occurrence statistics. They capture a lot of nuance in language use that was previously out of reach. Arguing "where models fall short of true understanding" is (increasingly) becoming a moving target, and increasingly subtle.

Lack of grounding in actual experience means that these models cannot distinguish between reasonable vs. unreasonable statements that happen to be linguistically well-formed. Whatever "understanding" these models might be said to have is fundamentally different from human understanding: it is based purely on patterns in text, without any connection to real-world referents or experiences.

Lecture 12: Contextual embeddings

PSYC 51.17: Models of language and communication

Learning objectives

The polysemy problem

Static embeddings (Word2Vec, GloVe, FastText)

Contextual embeddings (ELMo, BERT)

Language models as feature extractors

ELMo: Embeddings from Language Models

ELMo architecture

ELMo in Python

Universal Sentence Encoder (USE)

Universal Sentence Encoder in Python

BERT: Bidirectional Encoder Representations

Masked Language Modeling (MLM)

MLM training procedure

Next Sentence Prediction (NSP)

BERT input representation

BERT in Python

Fine-tuning BERT

Fine-tuning BERT

Real-world applications

Discussion: do contextual embeddings reflect "understanding"?

Summary

Questions?