1 of 146

A broad-coverage semantic �classification of the English clause-embedding lexicon

Aaron Steven White

University of Rochester

MECORE Kickoff Workshop

University of Edinburgh

21 October 2021

2 of 146

Slides

aaronstevenwhite.io

Data + Code

megaattitude.io

3 of 146

Kyle Rawlins

Johns Hopkins University

Ellise Moon

University of Rochester

Hannah An

University of Rochester

Ben Kane

University of Rochester

Will Gantt

University of Rochester

Collaborators

4 of 146

Ben Kane

University of Rochester

Will Gantt

University of Rochester

5 of 146

Overarching Question�What are the components that clause-taking predicates' semantic values are built from?

6 of 146

Subquestion #1�Which inferences triggered by sentences containing clause-embedding are associated with lexical information?

7 of 146

Jo hated that Bo left. ⇝ Bo left.

Veridicality inference

NP V S ⇝ S

8 of 146

Subquestion #2�Given a set of inference types, which possible inferential patterns associated with lexical items are attested?

9 of 146

Jo hated that Bo left. ⇝ Bo left.

NP V S ⇝ S

Veridicality inference

Jo hated that Bo left. ⇝ Jo believed Bo left.

Doxastic inference

NP V S ⇝ NP believe S

Jo hated that Bo left. ⇝ Jo didn't want Bo to have left.

Bouletic inference

NP V S ⇝ NP not want S

10 of 146

Subquestion #2�Given a set of inference types, which possible inferential patterns associated with lexical items are attested?

11 of 146

Predicate	NP V S ⇝ S	NP V S ⇝ NP believe S	NP V S ⇝ NP want S
think	0	+	0
doubt	0	-	0
hope	0	0	+
hate	+	+	-

12 of 146

Theoretical Import�Gaps in attested patterns potentially suggest deep constraints on lexicalization.

Horn 1972, Barwise & Cooper 1981, Levin & Rappaport Hovav 1991, a.o.

13 of 146

Goals for today's talk

Which logically possible inference patterns are both attested and predictive of syntactic distribution?

14 of 146

Approach

Cluster predicates based on measures of their inferential properties.
Determine optimal # of clusters based on how well particular clusterings predict syntactic distribution.

15 of 146

NP V S ⇝ S

Veridicality inference

Doxastic inference

NP V S ⇝ NP believe S

Bouletic inference

NP V S ⇝ NP want S

NP not V S ⇝ (not) S

NP not V S ⇝ NP (not) believe S

NP not V S ⇝ NP (not) want S

(not)

Neg-raising inference

NP not V S ⇝ NP V not S

16 of 146

Roadmap

Measuring distribution
Measuring inference
Discovering inference patterns
Investigating inference patterns

17 of 146

Measuring distribution

18 of 146

MegaAcceptability dataset

Acceptability for 1,000 verbs in 50 syntactic frames focused on clause-embedding.

White & Rawlins 2016, 2020

19 of 146

think

know

wonder

love

surprise

tell

say

start

stop

...

Verbs

20 of 146

Bleaching method

Frame templates (e.g. NP __ that S) instantiated by semantically bleached fillers.

21 of 146

Someone __ed something happened

Someone __ed that something happened

Someone __ed whether something happened

Someone __ed which someone something happened

Someone __ed someone that something happened

Someone __ed someone whether something happened

Someone __ed to someone that something happened

Someone __ed to do something

Someone __ed someone to do something

...

think

know

wonder

love

surprise

tell

say

start

stop

...

Verbs

Frames

22 of 146

50,000 total items x 5 judgments per item

MegaAcceptability dataset

Acceptability for 1,000 verbs in 50 syntactic frames focused on clause-embedding.

White & Rawlins 2016, 2020

23 of 146

Question

Is bleaching a valid method for capturing the acceptability of a verb in a frame?

Validation Strategy

Compare judgments for bleached items against judgments from trained linguists.

24 of 146

Validation data

Select 30 verbs from across Hacquard & Wellwood's (2012) classification
Gather judgments for these verbs in all 50 syntactic frames from:

trained linguists
naïve speakers

25 of 146

Comparison

Correlation between judgments from LI and Sprouse et al.'s (2013) dataset

26 of 146

Sprouse Linguistic Inquiry

MegaAcceptability

Correlation

27 of 146

Conclusion

Safe to use bleaching to collect acceptabiliy judgments focused on capturing selection.

Important Point

Be cautious in using this dataset to investigate individual predicates.

28 of 146

Measuring inference

29 of 146

NP V S ⇝ S

Veridicality inference

Doxastic inference

NP V S ⇝ NP believe S

Bouletic inference

NP V S ⇝ NP want S

NP not V S ⇝ (not) S

NP not V S ⇝ NP (not) believe S

NP not V S ⇝ NP (not) want S

(not)

Neg-raising inference

NP not V S ⇝ NP V not S

30 of 146

Recipe

Validate a bleaching paradigm for collecting judgments for an inference type.
Select a set of frames of interest.
Select predicates acceptable in those frames using MegaAcceptability.
Collect judgments using the paradigm.

31 of 146

Someone was irritated that a particular thing happened.

Did that thing happen?

no maybe or maybe not yes

Veridicality task

White & Rawlins 2018

32 of 146

Someone {knew, didn't know} that a particular thing happened.

NP _ that S

Someone {was, wasn't} surprised that a particular thing happened.

NP be _ that S

Someone {needed, didn’t need} for a particular thing to happen.

NP _ for NP to VP

Someone {told, didn’t tell} a particular person to do a particular thing.

Someone {believed, didn’t believe} a particular person to have a particular thing.

NP _ NP to VP[+/-eventive]

A particular person {was, wasn’t} excited to do a particular thing.

A particular person {was, wasn’t} suspected to have a particular thing.

NP be _ to VP[+/-eventive]

A particular person {managed, didn’t manage} to do a particular thing.

A particular person {seemed, didn’t seem} to have a particular thing.

NP _ to VP[+/-eventive]

33 of 146

If I were to say I don’t think that a particular thing happened, how likely is it that I mean I think that that thing didn’t happen?

Neg-raising task

Extremely unlikely

Extremely likely

An & White 2020

34 of 146

know that a particular thing happened.

NP _ that S

A particular person {didn’t, doesn’t}

I {didn’t, don’t}

surprised that a particular thing happened.

NP be _ that S

A particular person {wasn’t, isn’t}

I {wasn’t, ‘m not}

told to do a particular thing.

believed to have a particular thing.

NP be _ to VP[+/-eventive]

A particular person {wasn’t, isn’t}

I {wasn’t, ‘m not}

managed to do a particular thing.

seemed to have a particular thing.

NP _ to VP[+/-eventive]

A particular person {didn’t, doesn’t}

I {didn’t, don’t}

35 of 146

If A knew that C happened, how likely is it that A believed that C happened?

Doxastic task

Extremely unlikely

Extremely likely

Kane et al. 2021

36 of 146

If A persudaded B that C happened, how likely is it that B believed that C happened?

Doxastic task

Extremely unlikely

Extremely likely

Kane et al. 2021

37 of 146

If A was appalled that C happened, how likely is it that A wanted C to have happened?

Bouletic task

Extremely unlikely

Extremely likely

Kane et al. 2021

38 of 146

If A apologized to B that C happened, how likely is it that B wanted C to have happened?

Bouletic task

Extremely unlikely

Extremely likely

Kane et al. 2021

39 of 146

A {knew, didn't know} that C happened.

NP _ that S

A {told, didn't tell} B that C happened.

NP _ NP that S

A {said, didn't say} to B that C happened.

NP _ to NP that S

A {was, wasn’t} surprised that C happened.

NP _ that S

A {hoped, didn't hope} that C would happen.

NP _ that S[+future]

A {promised, didn't promise} B that C would happen.

NP _ NP that S[+future]

A {predicted, didn't predict} to B that C would happen.

NP _ to NP that S[+future]

A {was, wasn’t} excited that C would happen.

NP _ that S[+future]

40 of 146

Question

Is bleaching a valid method for capturing inferences associated with verb in a frame?

41 of 146

Validation Strategy #1

Compare judgments for bleached items against judgments from trained linguists.

42 of 146

	Neg-raising	Non-neg-raising
NP __ that S	think, believe, feel, reckon, figure, guess, suppose, imagine	announce, claim, assert, report, know, realize, notice, find out
NP __ to VP	want, wish, happen, seem, plan, intend, mean, turn out	love, hate, need, continue, try, like, desire, decide

43 of 146

Non-neg-raising

Neg-raising

Mean rating of bleached example

44 of 146

Validation Strategy #1

Compare judgments for bleached items against judgments from trained linguists.

Validation Strategy #2

Compare judgments for bleached items to judgments for more contentful items.

45 of 146

Implementation

For each verb-frame pair in validation set, sample five items from corpus.

46 of 146

Mean rating of corpus example

Mean rating of bleached example

r = 0.8

(p < 0.001)

47 of 146

Validation Strategy #1

Compare judgments for bleached items against judgments from trained linguists.

Validation Strategy #2

Compare judgments for bleached items to judgments for more contentful items.

Validation Strategy #3

Compare inference judgments for bleached items to acceptability judgments for established distributional diagnostic.

48 of 146

Implementation

For each verb-frame pair in validation set, collect acceptability of strong NPI (additive either).

Jo didn’t do a particular thing, and…

…I think that Bo didn’t do that thing either.

…I don’t think that Bo did that thing either.

49 of 146

Mean rating of bleached example

Mean acceptability of strong NPI

r = 0.77

(p < 0.001)

50 of 146

Conclusion

Safe to use bleaching to collect at least these types of inference judgments.

Important Point (again)

Be cautious in using this dataset to investigate individual predicates.

51 of 146

Discovering inference patterns

52 of 146

Approach

Cluster predicate-frame pairs in inference space using a multiview mixed effects mixture model.

53 of 146

Predicate	NP V S ⇝ S	NP V S ⇝ NP believe S	NP V S ⇝ NP want S
think	0	+	0
doubt	0	-	0
hope	0	0	+
hate	+	+	-

54 of 146

know + NP _ that S

Inference patterns

Doxastic

Bouletic

maybe

yes

Veridicality

Neg-raising

55 of 146

know + NP _ that S

Inference patterns

Doxastic

Bouletic

maybe

yes

Veridicality

Neg-raising

56 of 146

Finding clusters

Fit model to raw that-clause data in MegaVeridicality, MegaNegRaising, and MegaIntensionality using variational inference.

57 of 146

Output

A distribution over inference patterns for each verb-frame pair.

58 of 146

know + NP _ that S

Inference patterns

Doxastic

Bouletic

maybe

yes

Veridicality

Neg-raising

59 of 146

Output

A distribution over inference patterns for each verb-frame pair.
Distributions over judgments for each inference type and inference pattern

60 of 146

know + NP _ that S

Inference patterns

Doxastic

Bouletic

maybe

yes

Veridicality

Neg-raising

61 of 146

Question

How many inference patterns should we assume there are?

Idea

Only as many as we need to explain syntactic distribution.

62 of 146

Implementation

Select the smallest clustering for which no larger clustering improves prediction of the judgments in MegaAcceptability.

64 of 146

Predicate

Cluster

Frame

Predicate

65 of 146

Implementation

Select the smallest clustering for which no larger clustering improves prediction of the judgments in MegaAcceptability.

Result

Optimal number of inference patterns is 15.

66 of 146

Interpretation

There are at least 15 distributionally correlated inference patterns.

Important Point #2

Enriching the distributional representation could increase the granularity of the patterns.

Important Point #1

Not all inference patterns instantiated by particular predicates will get their own inference pattern.

67 of 146

Investigating inference patterns

68 of 146

know + NP _ that S

Inference patterns

Doxastic

Bouletic

maybe

yes

Veridicality

Neg-raising

0.5

69 of 146

Predicate

Cluster

Frame

Predicate

71 of 146

Representiationals

doxastic mental states and mental processes

NP {thought, believed, suspected} that S

75 of 146

Preferentials

expressions of preference for a (future) situation.

NP {hoped, wished, demanded, recommended} that S[+/-future]

79 of 146

Positive internal emotives

positive emotional states

A was {pleased, thrilled, enthused} that C happened.

Preferentials

expressions of preference for a (future) situation.

NP {hoped, wished, demanded, recommended} that S[+/-future]

84 of 146

Negative emotive miratives

expressions of surprise with negative valence

NP was {dazed, flustered, alarmed} that S[+future].

Negative external emotives

expressions of negative emotion with behavioral correlates

NP {whined, whimpered, pouted} to NP that S[+future].

Positive external emotives

expressions of positive emotion with behavioral correlates

NP was {congratulated, praised, fascinated} that S.

Positive internal emotives

positive emotional states

NP was {pleased, thrilled, enthused} that S.

Preferentials

expressions of preference for a (future) situation.

NP {hoped, wished, demanded, recommended} that S[+future/-tense]

Negative internal emotives

negative emotional states

NP was {frightened, disgusted, infuriated} that S.

86 of 146

Representiationals

doxastic mental states and mental processes

NP {thought, believed, suspected} that S

Speculatives

communication of uncertain beliefs.

NP {ventured, guessed, gossiped} that S

Future commitment

expressions of commitment to future action or result.

NP {promised, ensured, attested} S[+future]

88 of 146

Weak communicatives

communicative acts with weak doxastic inferences about the source.

NP {reported, remarked, yelped} to NP that S

Representiationals

doxastic mental states and mental processes

NP {thought, believed, suspected} that S

Speculatives

communication of uncertain beliefs.

NP {ventured, guessed, gossiped} that S

Future commitment

expressions of commitment to future action or result.

NP {promised, ensured, attested} S[+future]

Strong communicatives

communicative acts with strong doxastic inferences about the source.

NP {confessed, admitted, acknowledged} that S

Discourse commitment

communicative acts committing the source to the content’s truth.

A {maintained, remarked, swore} that C would happen.

89 of 146

Negative emotive miratives

expressions of surprise with negative valence

A was {dazed, flustered, alarmed} that C would happen.

Negative external emotives

expressions of negative emotion with behavioral correlates

A {whined, whimpered, pouted} to B that C would happen.

Positive external emotives

expressions of positive emotion with behavioral correlates

A was {congratulated, praised, fascinated} that C happened.

Positive internal emotives

positive emotional states

A was {pleased, thrilled, enthused} that C happened.

Preferentials

expressions of preference for a (future) situation.

NP {hoped, wished, demanded, recommended} that S[+/-future]

Negative internal emotives

negative emotional states

A was {frightened, disgusted, infuriated} that C happened.

Negative emotive communicatives

communicative acts with broadly negative valence.

A {screamed, ranted, growled} to B that C would happen.

91 of 146

Weak communicatives

communicative acts with weak doxastic inferences about the source.

NP {reported, remarked, yelped} to NP that S

Representiationals

doxastic mental states and mental processes

NP {thought, believed, suspected} that S

Speculatives

communication of uncertain beliefs.

NP {ventured, guessed, gossiped} that S

Future commitment

expressions of commitment to future action or result.

NP {promised, ensured, attested} S[+future]

Strong communicatives

communicative acts with strong doxastic inferences about the source.

NP {confessed, admitted, acknowledged} that S

Deceptives

actions involving dishonesty, deceit, or pretense.

NP {lied, misled, faked, fabricated} ((to) NP) that S.

Discourse commitment

communicative acts committing the source to the content’s truth.

NP{maintained, remarked, swore} that S[+future].

93 of 146

Interpretation

There are at least 15 distributionally correlated inference patterns.

Important Point #2

Enriching the distributional representation could increase the granularity of the patterns.

Important Point #1

Not all inference patterns instantiated by particular predicates will get their own inference pattern.

94 of 146

Interpretation

There are at least 15 distributionally correlated inference patterns.

Important Point #2

Enriching the distributional representation could increase the granularity of the patterns.

Important Point #1

Not all inference patterns instantiated by particular predicates will get their own inference pattern.

95 of 146

Conclusion

96 of 146

Overarching Question�What are the components that clause-taking predicates' semantic values are built from?

97 of 146

Current Directions�How do we discover the underlying representational components?

98 of 146

Subdirection #1�Decomposition of the inference patterns themselves.

99 of 146

Correlations across inference types

Correlations across inference patterns

100 of 146

Subdirection #1�Decomposition of the inference patterns themselves.

Subdirection #2�Decomposition of the relationship between inference patterns and syntactic distribution.

101 of 146

Relationship between inference patterns and syntax

Correlations across syntactic structures

102 of 146

Subdirection #1�Decomposition of the inference patterns themselves.

Subdirection #2�Decomposition of the relationship between inference patterns and syntactic distribution.

Subdirection #3�Decomposition of the relationship between inference patterns and lexical items.

103 of 146

Correlations across predicates

Predicate

Cluster

104 of 146

Possible Unified Approach�Multi-task combinatory categorial grammar induction with structured denotation decoders

105 of 146

Gene Kim

University of Rochester

106 of 146

Thanks!

Supported by NSF-BCS-1748969

The MegaAttitude Project: Investigating selection and polysemy at the scale of the lexicon

107 of 146

Appendix A:�Further Validation of MegaAcceptability

108 of 146

Case Study�The vast majority of about-PPs are adjuncts�

Rawlins 2013, 2014

109 of 146

XP₁ V (XP₂) (XP₃) about XP₄

is acceptable

XP₁ V (XP₂) (XP₃)

is acceptable

110 of 146

NP _ed

NP _ed about XP

111 of 146

Rawlins 2014

112 of 146

NP _ed

NP _ed about XP

113 of 146

NP _ed

NP _ed about XP

114 of 146

NP _ed

NP _ed about XP

115 of 146

NP _ed

NP _ed about XP

116 of 146

NP _ed

NP _ed about XP

117 of 146

NP _ed

NP _ed about XP

118 of 146

Noise variance / acceptability variance

Proportion violations

Independence

119 of 146

NP (was) _ed

NP (was) _ed about whether S

120 of 146

NP (was) _ed about whether S

NP (was) _ed

121 of 146

NP (was) _ed about whether S

NP (was) _ed

122 of 146

NP (was) _ed about whether S

NP (was) _ed

123 of 146

Acceptability threshold

Proportion violations

124 of 146

Noise variance / acceptability variance

Proportion violations

Independence

125 of 146

Acceptability threshold

Proportion violations

126 of 146

Appendix D:�Distribution of Inference Judgments

130 of 146

Appendix C:�Validation of MegaIntensionality

131 of 146

Question

Is bleaching a valid method for capturing doxastic and bouletic inferences associated with verb in a frame?

132 of 146

Challenge

Doxastic and bouletic inferences are highly sensitive to world knowledge.

133 of 146

Jo doubts that Bo left. ⇝ Jo doesn't believe that Bo left.

Jo doubts that Bo left. ⇝ Jo wants Bo to have left.

Trump doubts that he won in 2020.

Trump wants to have won in 2020.

134 of 146

Approach

Norm scenarios for likelihood of prior belief or desire not conditioned on a previous sentence

135 of 146

Executives generally want their deals to go through.

Executives generally believe that their deals will go through.

Norming

137 of 146

Approach

Norm scenarios for likelihood of prior belief or desire not conditioned on a previous sentence
Test those normed schenarios in an inference task focused 24 verbs.

138 of 146

Executives generally want their deals to go through.

Executives generally believe that their deals will go through.

Norming

The executive knew that his deal had gone through.

Contentful

139 of 146

Approach

Norm scenarios for likelihood of prior belief or desire not conditioned on a previous sentence
Test those normed schenarios in an inference task focused 24 verbs.
Compare to bleached variants.

140 of 146

Executives generally want their deals to go through.

Executives generally believe that their deals will go through.

Norming

The executive knew that his deal had gone through.

Contentful

A knew that C happened.

Bleached

142 of 146

Appendix D:�Number of possible inference patterns

143 of 146

(3 veridicality inferences)^{2 matrix polarities}

(3 doxastic inferences)^{2 matrix polarities}

(3 bouletic inferences)^{2 matrix polarities}

2 neg-raising inferences

1,458 inference patterns

If any lexical knowledge relevant to any inference type is gradient (and continuous), there are an uncountable number of patterns.

144 of 146

Appendix E:�Principal Component Analysis

145 of 146

95% of variance

146 of 146

The polarity of veridicality and doxastic inferences under negation is anti-correlated with neg-raising.
The polarity of a belief presupposition about a recipient is correlated with the polarity of a desire presupposition.
The valence of an emotive communicative is anticorrelated with veridicality.
Bouletic inferences about the source and the target of a communication are anticorrelated with veridicality.
Desire inferences about the source in a communication are anticorrelated with belief inferences about the target.
Veridicality is correlated with belief inferences in the target of a communication but anticorrelated with desire inferences.

1 of 146

2 of 146

3 of 146

4 of 146

5 of 146

6 of 146

7 of 146

8 of 146

9 of 146

10 of 146

11 of 146

12 of 146

13 of 146

14 of 146

15 of 146

16 of 146

17 of 146

18 of 146

19 of 146

20 of 146

21 of 146

22 of 146

23 of 146

24 of 146

25 of 146

26 of 146

27 of 146

28 of 146

29 of 146

30 of 146

31 of 146

32 of 146

33 of 146

34 of 146

35 of 146

36 of 146

37 of 146

38 of 146

39 of 146

40 of 146

41 of 146

42 of 146

43 of 146

44 of 146

45 of 146

46 of 146

47 of 146

48 of 146

49 of 146

50 of 146

51 of 146

52 of 146

53 of 146

54 of 146

55 of 146

56 of 146

57 of 146

58 of 146

59 of 146

60 of 146

61 of 146

62 of 146

63 of 146

64 of 146

65 of 146

66 of 146

67 of 146

68 of 146

69 of 146

70 of 146

71 of 146

72 of 146

73 of 146

74 of 146

75 of 146

76 of 146

77 of 146

78 of 146

79 of 146

80 of 146