1 of 146

A broad-coverage semantic �classification of the English clause-embedding lexicon

Aaron Steven White

University of Rochester

MECORE Kickoff Workshop

University of Edinburgh

21 October 2021

2 of 146

Slides

aaronstevenwhite.io 

Data + Code 

megaattitude.io 

3 of 146

Kyle Rawlins

Johns Hopkins University

Ellise Moon

University of Rochester

Hannah An

University of Rochester

Ben Kane

University of Rochester

Will Gantt

University of Rochester

Collaborators

4 of 146

Ben Kane

University of Rochester

Will Gantt

University of Rochester

5 of 146

Overarching QuestionWhat are the components that clause-taking predicates' semantic values are built from?

6 of 146

Subquestion #1�Which inferences triggered by sentences containing clause-embedding are associated with lexical information?

7 of 146

Jo hated that Bo left. ⇝ Bo left.

Veridicality inference

NP   V           S     ⇝    S    

8 of 146

Subquestion #2�Given a set of inference types, which possible inferential patterns associated with lexical items are attested?

9 of 146

Jo hated that Bo left. ⇝ Bo left.

NP   V           S     ⇝    S    

Veridicality inference

Jo hated that Bo left. ⇝ Jo believed Bo left.

Doxastic inference

NP   V           S     ⇝ NP believe     S    

Jo hated that Bo left. ⇝ Jo didn't want Bo to have left.

Bouletic inference

NP   V           S     ⇝ NP    not want        S        

10 of 146

Subquestion #2�Given a set of inference types, which possible inferential patterns associated with lexical items are attested?

11 of 146

Predicate

NP V S ⇝ S

NP V S ⇝ NP believe S

NP V S ⇝ NP want S

think

0

+

0

doubt

0

-

0

hope

0

0

+

hate

+

+

-

12 of 146

Theoretical ImportGaps in attested patterns potentially suggest deep constraints on lexicalization.

Horn 1972, Barwise & Cooper 1981, Levin & Rappaport Hovav 1991, a.o.

13 of 146

Goals for today's talk

Which logically possible inference patterns are both attested and predictive of syntactic distribution?

14 of 146

Approach

  1. Cluster predicates based on measures of their inferential properties.
  2. Determine optimal # of clusters based on how well particular clusterings predict syntactic distribution.

15 of 146

NP V S S

Veridicality inference

Doxastic inference

NP V S ⇝ NP believe S

Bouletic inference

NP V S ⇝ NP want S

NP not V S ⇝ (not) S    

NP not V S ⇝ NP (not) believe S

NP not V S ⇝ NP (not) want S

(not)

(not)

(not)

Neg-raising inference

NP not V S ⇝ NP V not S

16 of 146

Roadmap

  1. Measuring distribution
  2. Measuring inference
  3. Discovering inference patterns
  4. Investigating inference patterns

17 of 146

Measuring distribution

18 of 146

MegaAcceptability dataset

Acceptability for 1,000 verbs in 50 syntactic frames focused on clause-embedding.

White & Rawlins 2016, 2020

19 of 146

think

know

wonder

love

surprise

tell

say

start

stop

...

Verbs

20 of 146

Bleaching method

Frame templates (e.g. NP __ that S) instantiated by semantically bleached fillers.

21 of 146

Someone __ed something happened

Someone __ed that something happened

Someone __ed whether something happened

Someone __ed which someone something happened

Someone __ed someone that something happened

Someone __ed someone whether something happened

Someone __ed to someone that something happened

Someone __ed to do something

Someone __ed someone to do something

...

think

know

wonder

love

surprise

tell

say

start

stop

...

x

Verbs

Frames

22 of 146

50,000 total items x 5 judgments per item

MegaAcceptability dataset

Acceptability for 1,000 verbs in 50 syntactic frames focused on clause-embedding.

White & Rawlins 2016, 2020

23 of 146

Question

Is bleaching a valid method for capturing the acceptability of a verb in a frame?

Validation Strategy

Compare judgments for bleached items against judgments from trained linguists. 

24 of 146

Validation data

  1. Select 30 verbs from across Hacquard & Wellwood's (2012) classification
  2. Gather judgments for these verbs in all 50 syntactic frames from:
    1. trained linguists
    2. naïve speakers 

25 of 146

Comparison

Correlation between judgments from LI and Sprouse et al.'s (2013) dataset

26 of 146

Sprouse Linguistic Inquiry

MegaAcceptability

Correlation

27 of 146

Conclusion

Safe to use bleaching to collect acceptabiliy judgments focused on capturing selection.

Important Point

Be cautious in using this dataset to investigate individual predicates.

28 of 146

Measuring inference

29 of 146

NP V S S

Veridicality inference

Doxastic inference

NP V S ⇝ NP believe S

Bouletic inference

NP V S ⇝ NP want S

NP not V S ⇝ (not) S    

NP not V S ⇝ NP (not) believe S

NP not V S ⇝ NP (not) want S

(not)

(not)

(not)

Neg-raising inference

NP not V S ⇝ NP V not S

30 of 146

Recipe

  1. Validate a bleaching paradigm for collecting judgments for an inference type.
  2. Select a set of frames of interest.
  3. Select predicates acceptable in those frames using MegaAcceptability.
  4. Collect judgments using the paradigm.

31 of 146

Someone was irritated that a particular thing happened.

Did that thing happen?

no      maybe or maybe not       yes

Veridicality task

White & Rawlins 2018

32 of 146

Someone {knew, didn't know} that a particular thing happened.

NP _ that S

Someone {was, wasn't} surprised that a particular thing happened.

NP be _ that S

Someone {needed, didn’t need} for a particular thing to happen

NP _ for NP to VP

Someone {told, didn’t tell} a particular person to do a particular thing

Someone {believed, didn’t believe} a particular person to have a particular thing

NP _ NP to VP[+/-eventive]

A particular person {was, wasn’t} excited to do a particular thing

A particular person {was, wasn’t} suspected to have a particular thing

NP be _ to VP[+/-eventive]

A particular person {managed, didn’t manage} to do a particular thing

A particular person {seemed, didn’t seem} to have a particular thing.

NP _ to VP[+/-eventive]

33 of 146

If I were to say I don’t think that a particular thing happened, how likely is it that I mean I think that that thing didn’t happen?

Neg-raising task

Extremely unlikely

Extremely likely

An & White 2020

34 of 146

know that a particular thing happened.

NP _ that S

A particular person {didn’t, doesn’t}

I {didn’t, don’t}

surprised that a particular thing happened.

NP be _ that S

A particular person {wasn’t, isn’t}

I {wasn’t, ‘m not}

told to do a particular thing

believed to have a particular thing

NP be _ to VP[+/-eventive]

A particular person {wasn’t, isn’t}

I {wasn’t, ‘m not}

managed to do a particular thing

seemed to have a particular thing.

NP _ to VP[+/-eventive]

A particular person {didn’t, doesn’t}

I {didn’t, don’t}

35 of 146

If A knew that C happened, how likely is it that A believed that C happened?

Doxastic task

Extremely unlikely

Extremely likely

Kane et al. 2021

36 of 146

If A persudaded B that C happened, how likely is it that B believed that C happened?

Doxastic task

Extremely unlikely

Extremely likely

Kane et al. 2021

37 of 146

If A was appalled that C happened, how likely is it that A wanted C to have happened?

Bouletic task

Extremely unlikely

Extremely likely

Kane et al. 2021

38 of 146

If A apologized to B that C happened, how likely is it that B wanted C to have happened?

Bouletic task

Extremely unlikely

Extremely likely

Kane et al. 2021

39 of 146

A {knew, didn't know} that C happened.

NP _ that S

A {told, didn't tell} B that C happened.

NP _ NP that S

A {said, didn't say} to B that C happened.

NP _ to NP that S

A {was, wasn’t} surprised that C happened.

NP _ that S

A {hoped, didn't hope} that C would happen.

NP _ that S[+future]

A {promised, didn't promise} B that C would happen.

NP _ NP that S[+future]

A {predicted, didn't predict} to B that C would happen.

NP _ to NP that S[+future]

A {was, wasn’t} excited that C would happen.

NP _ that S[+future]

40 of 146

Question

Is bleaching a valid method for capturing inferences associated with verb in a frame?

41 of 146

Validation Strategy #1

Compare judgments for bleached items against judgments from trained linguists. 

42 of 146

Neg-raising

Non-neg-raising

NP __ that S

think, believe, feel, reckon, figure, guess, suppose, imagine

announce, claim, assert, report, know, realize, notice, find out

NP __ to VP

want, wish, happen, seem, plan, intend, mean, turn out

love, hate, need, continue, try, like, desire, decide

43 of 146

Non-neg-raising

Neg-raising

Mean rating of bleached example

44 of 146

Validation Strategy #1

Compare judgments for bleached items against judgments from trained linguists. 

Validation Strategy #2

Compare judgments for bleached items to judgments for more contentful items.

45 of 146

Implementation

For each verb-frame pair in validation set, sample five items from corpus.

46 of 146

Mean rating of corpus example

Mean rating of bleached example

r = 0.8

(p < 0.001)

47 of 146

Validation Strategy #1

Compare judgments for bleached items against judgments from trained linguists. 

Validation Strategy #2

Compare judgments for bleached items to judgments for more contentful items.

Validation Strategy #3

Compare inference judgments for bleached items to acceptability judgments for established distributional diagnostic.

48 of 146

Implementation

For each verb-frame pair in validation set, collect acceptability of strong NPI (additive either).

Jo didn’t do a particular thing, and…

…I think that Bo didn’t do that thing either.

…I don’t think that Bo did that thing either.

49 of 146

Mean rating of bleached example

Mean acceptability of strong NPI

r = 0.77 

(p < 0.001)

50 of 146

Conclusion

Safe to use bleaching to collect at least these types of inference judgments.

Important Point (again)

Be cautious in using this dataset to investigate individual predicates.

51 of 146

Discovering inference patterns

52 of 146

Approach

Cluster predicate-frame pairs in inference space using a multiview mixed effects mixture model.

53 of 146

Predicate

NP V S ⇝ S

NP V S ⇝ NP believe S

NP V S ⇝ NP want S

think

0

+

0

doubt

0

-

0

hope

0

0

+

hate

+

+

-

54 of 146

know + NP _ that S

1

2

3

4

5

6

7

8

9

10

11

12

Inference patterns

1

0

1

0

1

0

Doxastic

Bouletic

no

maybe

yes

Veridicality

Neg-raising

55 of 146

know + NP _ that S

1

2

3

4

5

6

7

8

9

10

11

12

Inference patterns

1

0

1

0

1

0

Doxastic

Bouletic

no

maybe

yes

Veridicality

Neg-raising

56 of 146

Finding clusters

Fit model to raw that-clause data in MegaVeridicality, MegaNegRaising, and MegaIntensionality using variational inference.

57 of 146

Output

  1. A distribution over inference patterns for each verb-frame pair.

58 of 146

know + NP _ that S

1

2

3

4

5

6

7

8

9

10

11

12

Inference patterns

1

0

1

0

1

0

Doxastic

Bouletic

no

maybe

yes

Veridicality

Neg-raising

59 of 146

Output

  1. A distribution over inference patterns for each verb-frame pair.
  2. Distributions over judgments for each inference type and inference pattern

60 of 146

know + NP _ that S

1

2

3

4

5

6

7

8

9

10

11

12

Inference patterns

1

0

1

0

1

0

Doxastic

Bouletic

no

maybe

yes

Veridicality

Neg-raising

61 of 146

Question

How many inference patterns should we assume there are?

Idea

Only as many as we need to explain syntactic distribution.

62 of 146

Implementation

Select the smallest clustering for which no larger clustering improves prediction of the judgments in MegaAcceptability.

63 of 146

Cluster

64 of 146

Predicate

Cluster

Frame

Predicate

65 of 146

Implementation

Select the smallest clustering for which no larger clustering improves prediction of the judgments in MegaAcceptability.

Result

Optimal number of inference patterns is 15.

66 of 146

Interpretation

There are at least 15 distributionally correlated inference patterns.

Important Point #2

Enriching the distributional representation could increase the granularity of the patterns.

Important Point #1

Not all inference patterns instantiated by particular predicates will get their own inference pattern.

67 of 146

Investigating inference patterns

68 of 146

know + NP _ that S

1

2

3

4

5

6

7

8

9

10

11

12

Inference patterns

1

0

1

0

1

0

Doxastic

Bouletic

no

maybe

yes

Veridicality

Neg-raising

0

0.5

1

69 of 146

Predicate

Cluster

Frame

Predicate

70 of 146

71 of 146

Representiationals

doxastic mental states and mental processes

NP {thought, believed, suspected} that S

72 of 146

73 of 146

74 of 146

75 of 146

Preferentials 

expressions of preference for a (future) situation.

NP {hoped, wished, demanded, recommended} that S[+/-future]​

76 of 146

77 of 146

78 of 146

79 of 146

Positive internal emotives

positive emotional states

A was {pleased, thrilled, enthused} that C happened.

Preferentials 

expressions of preference for a (future) situation.

NP {hoped, wished, demanded, recommended} that S[+/-future]​

80 of 146

81 of 146

82 of 146

83 of 146

84 of 146

Negative emotive miratives

expressions of surprise with negative valence

NP was {dazed, flustered, alarmed} that S[+future].

Negative external emotives

expressions of negative emotion with behavioral correlates

NP {whined, whimpered, pouted} to NP that S[+future].​

Positive external emotives

expressions of positive emotion with behavioral correlates

NP was {congratulated, praised, fascinated} that S.

Positive internal emotives

positive emotional states

NP was {pleased, thrilled, enthused} that S.

Preferentials 

expressions of preference for a (future) situation.

NP {hoped, wished, demanded, recommended} that S[+future/-tense]​

Negative internal emotives

negative emotional states

NP was {frightened, disgusted, infuriated} that S.​

85 of 146

86 of 146

Representiationals

doxastic mental states and mental processes

NP {thought, believed, suspected} that S

Speculatives 

communication of uncertain beliefs.

NP {ventured, guessed, gossiped} that S

Future commitment

expressions of commitment to future action or result.

NP {promised, ensured, attested} S[+future]

87 of 146

88 of 146

Weak communicatives

communicative acts with weak doxastic inferences about the source.

NP {reported, remarked, yelped} to NP that S

Representiationals

doxastic mental states and mental processes

NP {thought, believed, suspected} that S

Speculatives 

communication of uncertain beliefs.

NP {ventured, guessed, gossiped} that S

Future commitment

expressions of commitment to future action or result.

NP {promised, ensured, attested} S[+future]

Strong communicatives

communicative acts with strong doxastic inferences about the source.

NP {confessed, admitted, acknowledged} that S​

Discourse commitment

communicative acts committing the source to the content’s truth.

A {maintained, remarked, swore} that C would happen.

89 of 146

Negative emotive miratives

expressions of surprise with negative valence

A was {dazed, flustered, alarmed} that C would happen.

Negative external emotives

expressions of negative emotion with behavioral correlates

A {whined, whimpered, pouted} to B that C would happen.​

Positive external emotives

expressions of positive emotion with behavioral correlates

A was {congratulated, praised, fascinated} that C happened.

Positive internal emotives

positive emotional states

A was {pleased, thrilled, enthused} that C happened.

Preferentials 

expressions of preference for a (future) situation.

NP {hoped, wished, demanded, recommended} that S[+/-future]​

Negative internal emotives

negative emotional states

A was {frightened, disgusted, infuriated} that C happened.​

Negative emotive communicatives

communicative acts with broadly negative valence.

A {screamed, ranted, growled} to B that C would happen.​

90 of 146

91 of 146

Weak communicatives

communicative acts with weak doxastic inferences about the source.

NP {reported, remarked, yelped} to NP that S

Representiationals

doxastic mental states and mental processes

NP {thought, believed, suspected} that S

Speculatives 

communication of uncertain beliefs.

NP {ventured, guessed, gossiped} that S

Future commitment

expressions of commitment to future action or result.

NP {promised, ensured, attested} S[+future]

Strong communicatives

communicative acts with strong doxastic inferences about the source.

NP {confessed, admitted, acknowledged} that S​

Deceptives

actions involving dishonesty, deceit, or pretense.

NP {lied, misled, faked, fabricated} ((to) NP) that S.​

Discourse commitment

communicative acts committing the source to the content’s truth.

NP{maintained, remarked, swore} that S[+future].

92 of 146

93 of 146

Interpretation

There are at least 15 distributionally correlated inference patterns.

Important Point #2

Enriching the distributional representation could increase the granularity of the patterns.

Important Point #1

Not all inference patterns instantiated by particular predicates will get their own inference pattern.

94 of 146

Interpretation

There are at least 15 distributionally correlated inference patterns.

Important Point #2

Enriching the distributional representation could increase the granularity of the patterns.

Important Point #1

Not all inference patterns instantiated by particular predicates will get their own inference pattern.

95 of 146

Conclusion

96 of 146

Overarching QuestionWhat are the components that clause-taking predicates' semantic values are built from?

97 of 146

Current Directions�How do we discover the underlying representational components?

98 of 146

Subdirection #1�Decomposition of the inference patterns themselves. 

99 of 146

Correlations across inference types

Correlations across inference patterns

100 of 146

Subdirection #1�Decomposition of the inference patterns themselves. 

Subdirection #2�Decomposition of the relationship between inference patterns and syntactic distribution. 

101 of 146

Relationship between inference patterns and syntax

Correlations across syntactic structures

102 of 146

Subdirection #1�Decomposition of the inference patterns themselves. 

Subdirection #2�Decomposition of the relationship between inference patterns and syntactic distribution. 

Subdirection #3�Decomposition of the relationship between inference patterns and lexical items. 

103 of 146

Correlations across predicates

Predicate

Cluster

104 of 146

Possible Unified Approach�Multi-task combinatory categorial grammar induction with structured denotation decoders

105 of 146

Gene Kim

University of Rochester

106 of 146

Thanks!

Supported by NSF-BCS-1748969

The MegaAttitude Project: Investigating selection and polysemy at the scale of the lexicon

107 of 146

Appendix A:�Further Validation of MegaAcceptability

108 of 146

Case Study�The vast majority of about-PPs are adjuncts�

Rawlins 2013, 2014

109 of 146

XP1 V (XP2) (XP3) about XP4

is acceptable

XP1 V (XP2) (XP3)

is acceptable

X

110 of 146

NP _ed  

NP _ed about XP  

111 of 146

Rawlins 2014

112 of 146

NP _ed  

NP _ed about XP  

113 of 146

NP _ed  

NP _ed about XP  

114 of 146

NP _ed  

NP _ed about XP  

115 of 146

NP _ed  

NP _ed about XP  

116 of 146

NP _ed  

NP _ed about XP  

117 of 146

NP _ed  

NP _ed about XP  

118 of 146

Noise variance / acceptability variance 

Proportion violations

Independence

119 of 146

NP (was) _ed  

NP (was) _ed about whether S  

120 of 146

NP (was) _ed about whether S  

NP (was) _ed  

121 of 146

NP (was) _ed about whether S  

NP (was) _ed  

122 of 146

NP (was) _ed about whether S  

NP (was) _ed  

123 of 146

Acceptability threshold 

Proportion violations

124 of 146

Noise variance / acceptability variance 

Proportion violations

Independence

125 of 146

Acceptability threshold 

Proportion violations

126 of 146

Appendix D:�Distribution of Inference Judgments

127 of 146

128 of 146

129 of 146

130 of 146

Appendix C:�Validation of MegaIntensionality

131 of 146

Question

Is bleaching a valid method for capturing doxastic and bouletic inferences associated with verb in a frame?

132 of 146

Challenge

Doxastic and bouletic inferences are highly sensitive to world knowledge.

133 of 146

Jo doubts that Bo left. ⇝ Jo doesn't believe that Bo left.

Jo doubts that Bo left. ⇝ Jo wants Bo to have left.

Trump doubts that he won in 2020.

Trump wants to have won in 2020.

134 of 146

Approach

  1. Norm scenarios for likelihood of prior belief or desire not conditioned on a previous sentence

135 of 146

Executives generally want their deals to go through.

Executives generally believe that their deals will go through.

Norming

136 of 146

137 of 146

Approach

  1. Norm scenarios for likelihood of prior belief or desire not conditioned on a previous sentence
  2. Test those normed schenarios in an inference task focused 24 verbs. 

138 of 146

Executives generally want their deals to go through.

Executives generally believe that their deals will go through.

Norming

The executive knew that his deal had gone through.

Contentful

139 of 146

Approach

  1. Norm scenarios for likelihood of prior belief or desire not conditioned on a previous sentence
  2. Test those normed schenarios in an inference task focused 24 verbs. 
  3. Compare to bleached variants.

140 of 146

Executives generally want their deals to go through.

Executives generally believe that their deals will go through.

Norming

The executive knew that his deal had gone through.

Contentful

A knew that C happened.

Bleached

141 of 146

142 of 146

Appendix D:�Number of possible inference patterns

143 of 146

(3 veridicality inferences)2 matrix polarities

x

(3 doxastic inferences)2 matrix polarities

x

(3 bouletic inferences)2 matrix polarities

x

2 neg-raising inferences

=

1,458 inference patterns

If any lexical knowledge relevant to any inference type is gradient (and continuous), there are an uncountable number of patterns.

144 of 146

Appendix E:�Principal Component Analysis

145 of 146

95% of variance

146 of 146

  1. The polarity of veridicality and doxastic inferences under negation is anti-correlated with neg-raising.
  2. The polarity of a belief presupposition about a recipient is correlated with the polarity of a desire presupposition.
  3. The valence of an emotive communicative is anticorrelated with veridicality.
  4. Bouletic inferences about the source and the target of a communication are anticorrelated with veridicality.
  5. Desire inferences about the source in a communication are anticorrelated with belief inferences about the target.
  6. Veridicality is correlated with belief inferences in the target of a communication but anticorrelated with desire inferences.