1 of 18

Hierarchical Overlapping Belief Estimation by Structured Matrix Factorization

Chaoqi Yang, Jinyang Li, Ruijie Wang, Shuochao Yao, Huajie shao, Dongxin Liu, Shengzhong Liu, Tianshi Wang, Tarek F. Abdelzaher

12 / 08 / 2020, Virtual

2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

2 of 18

Content

  • Common limitation
  • Motivate with an example
  • Present our BSMF model
    • Problem formulation
    • Propose the model (contribution 1)
    • Implementation (contribution 2)
    • Improve BSMF
      • Social convolution module
      • Message interpolation module
  • Experiment
    • Synthetic dataset
    • Three real-world dataset

2

3 of 18

Common limitation

  • Separate a large amount of data into disjoint groups
    • Belief separation (belief)
    • Opinion mining (opinion)
    • Community detection (community)
    • Sentiment analysis (sentiment groups)

3

One topic/event

group1

group2

group3

group4

disjoint

4 of 18

An example on COVID

4

No mask

Reopen everything

Let nature selection work

Plan for herd immunity

Wuhan, China

locked down

Photograph: Leon Neal/AFP via Getty Images

https://www.wsj.com/articles/chinese

-lockdown-redux-11589843868

no action

action

https://en.wikipedia.org/wiki/Social_distancing

Wear mask

Social distancing

quarantine

All classes online

Hybrid class model

(about class)

5 of 18

Belief structure

  • Structure for the example

5

  • Notations
    • Source group:
    • Each group has a common belief
    • Subset:
    • has specific belief , which is independent to . However, is is a precondition of .
  • Venn diagram

Source

Belief

6 of 18

(Unsupervised) belief separation problem

  • Input data
    • A binary matrix of size
      • Who says what
    • Claim text (tweets)
    • Conceptual belief structure
  • Output
    • A clustering of
    • A clustering of

6

7 of 18

The belief model

  • : the probability that source endorses claim
  • : the probability that source
  • : the probability that claim

7

For a specific source and a specific claim , will the source endorse the claim?

8 of 18

Belief structed matrix

8

Residual part

group

belief

Source

Belief

8

9 of 18

Belief structured NMF (BSMF)

  • We design (affiliation) source-group matrix and (affiliation) claim-belief matrix , and assume , , then we can get the following

  • Let the matrix be the matrix of probability

9

10 of 18

Belief structured NMF

10

x

x

source

claim

claim

source

group

belief

group

belief

11 of 18

Tri-factorization implementation

  • Loss function (we want to estimate )

  • Derivative

  • Learning rate for GD (we generalize standard multiplicative update rules)

11

12 of 18

Improvement for initial

  • Motivation
    • is sparse
  • Social convolution
    • Assumption: similar sources will endorse similar claims
    • Extract social relations (adjacency matrix)
      • Smooth by graph convolution
    • Extension: GNN model
  • Message interpolation
    • Assumption: the same source might endorse similar claims
    • Extract the similarity by bag-of-word (BOW)
      • Smooth by interpolation
    • Extension: BERT, Transformer
  • Details in our paper
  • Additional theoretical analysis

12

13 of 18

Synthetic dataset

  •  

13

 

 

 

 

14 of 18

Real-world dataset

  • Two datasets
    • crawled by Apollo Social Sensing Toolkit

  • Baselines
    • Random, DBSCAN, Sentiment140, H-Ncut, NMF, NMTF

14

Apollo Social Sensing Toolkit http://apollo2.cs.illinois.edu/

15 of 18

Results (Eurovision2016)

15

16 of 18

Results2 (global�warming)

16

17 of 18

Conclusion

  • identify hierarchical belief structure
  • design a belief structure matrix factorization (BSMF) method
  • generalize the multiplicative update rules
  • conduct informative experiments

17

18 of 18

Thanks for Listening��Q & A

18