A ‘hands-off’ approach �to modelling constructional change �with word embeddings
Lauren Fonteyn & Enrique Manjavacas
‘Modelling constructional variation and change’, 15-16 November 2021, Zurich
Introduction
Introduction
Aims
Outline
Case study: grammaticalization of to death ☠️
😵 she was by the godesse wounded to death. (EEBO, 1641)
🙁 That look of yours frightens me to death. (CLMET3.1, 1750)
🙂 I have a new toy and I’m tickled to death. (Gutenberg, 1913)
decrease in compositionality: literal death > endpoint, extreme
negative meaning of source persists
host-class expansion (increase in productivity/schematicity):
diversification of collocate verbs (lexical & semantic)
Grammaticalization of phrasal expression to death (Hoeksema & Jo Napoli 2008; Claridge 2011; Margerie 2011; Blanco Suárez 2017):
Case study: why?
Data: corpora
Data: corpora
Data: to death
| 1550 | 1600 | 1650 | 1700 | 1750 | 1800 | 1850 | 1900 |
CLMET3.1 |
|
|
| 39 | 45 | 182 | 100 | 26 |
COHA |
|
|
|
|
| 488 | 700 | 774 |
ECCO |
|
|
| 78 | 395 | 12 |
|
|
EEBO | 800 | 800 | 794 | 413 |
| 2 |
|
|
EVANS |
|
| 6 | 211 | 360 | 116 |
|
|
Total | 800 | 800 | 800 | 741 | 800 | 800 | 800 | 800 |
Type freq | 87 | 101 | 93 | 95 | 97 | 150 | 131 | 135 |
Data: to death
Data: to death
Data: to death
physical actions | mental verbs | ||||
burn | stab | whip | amuse | scare | vex |
beat (0.57) | strangle (0.59) | cudgel (0.69) | delude (0.73) | frighten (0.78) | afflict (0.72) |
kill (0.57) | knife (0.59) | bludgeon (0.66) | flatter (0.63) | terrify (0.73) | perplex (0.72) |
consume (0.56) | bleed (0.58) | lash (0.66) | perplex (0.61) | startle (0.67) | harass (0.71) |
scorch (0.55) | slash (0.58) | kick (0.59) | terrify (0.60) | worry (0.55) | annoy (0.69) |
shoot (0.55) | bang (0.56) | cuff (0.57) | frighten (0.60) | drive (0.54) | oppress (0.69) |
spoil (0.53) | kill (0.55) | spur (0.57) | tickle (0.58) | sweep (0.52) | fret (0.67) |
smother (0.53) | poison (0.55) | flog (0.56) | harass (0.54) | delude (0.51) | grieve (0.64) |
smoke (0.53) | bite (0.55) | bang (0.55) | tire (0.54) | astonish (0.51) | terrify (0.61) |
hunt (0.53) | cudgel (0.54) | goad (0.55) | annoy (0.52) | annoy (0.50) | pester (0.60) |
hang (0.53) | prick (0.54) | scourge (0.54) | vex (0.51) | amuse (0.50) | worry (0.58) |
Method: diachronic cluster analysis
Assumption: clusters in distributional space reproduce semantic fields
Approach: monitor changes in the semantic space using clustering metrics
Method: diachronic cluster analysis
With increasing host-class expansion, we expect
Method: diachronic cluster analysis
Operationalization of change in terms of silhouette
Silhouette captures density from two angles
Method: diachronic cluster analysis
What about number of clusters?
Strategy: hands off!
Method: diachronic cluster analysis
Mediation of type frequency
Collocate type frequency
Increased type frequency = ?
Method: diachronic cluster analysis
Fixing the shape of the semantic space …
… the optimal number depends on the sample size
Method: diachronic cluster analysis
Change
Type Frequency
Number of Clusters
?
Method: diachronic cluster analysis
Bootstrap simulation of variation in type frequency
Method: diachronic cluster analysis
The S-curve
Modeling choice -> time as monotonic effect
Compare
Taken from “Studying the History of English”
Results: diachronic cluster analysis
Results: diachronic cluster analysis
Method: sentiment analysis
Hypothesis
Approach
Method: sentiment analysis
Results: sentiment analysis
Results: sentiment analysis
Discussion
Discussion: challenges
Future work
Thank you!
Do check out all the exciting work we refer to in our reference list: