Introduction to the Models and Tools of Network Analysis�Part 3: Selection: Latent Factor Selection Models in R
Ken Frank
1
Overview
2
Selection: How Actors Choose Others with whom to Interact�
Examples of Research Questions
How do farmers decide to whom to provide help?
How do bankers decide to whom to loan money?
How do social service agencies choose other agencies to refer clients to?
Theoretical Mechanisms (see Frank and Fahrbach, 1999)
Balance seeking/homophily -- seeking to interact with others like yourself
Information seeking Goal oriented , Reduce uncertainty , Power oriented , Better understanding , Curiosity, Inoculate
Transaction costs: Constraints on exposure
Frank, K.A., Muller, C., Mueller, A.S., 2013. The Embeddedness of Adolescent Friendship Nominations: The Formation of Social Capital in Emergent Network Structures. American Journal of Sociology, Vol 119(1):216-253.
Status seeking – talk to popular others (preferential attachment)
Zerubavel, N., Bearman, P. S., Weber, J., & Ochsner, K. N. (2015). Neural mechanisms tracking popularity in real-world social networks. Proceedings of the National Academy of Sciences, 112(49), 15072-15077
3
We see the World in Networks
Selection of Network Partners: Graphical Representation
7
t
t-1
Examples of Selection
Frank, K.A., Muller, C., Mueller, A.S., 2013. The Embeddedness of Adolescent Friendship Nominations: The Formation of Social Capital in Emergent Network Structures. American Journal of Sociology, Vol 119(1):216-253. Media hits: Atlantic; Huffington Post; Huffington Post (Op-ed); US News and World Report; Yahoo; Health Day; RedOrbit; The Times of India; MSUToday: Psych Central: Positions and Promotions; Deccan Chronicle: wood radio: Fox Chicago: local news channels (south Carolina): Science Daily; National Science Foundation
Spillane, J., Kim, Chong Min, Frank, K.A. 2012. “Instructional Advice and Information Providing and Receiving Behavior in Elementary Schools: Exploring Tie Formation as a Building Block in Social Capital Development.” American Educational Research Journal. Vol 49 no. 6 1112-1145
Crosnoe, Robert, Anna Strassman-Mueller, and Frank, K.A. 2008. “Gender, Body Size, and Social Relations in American High Schools.” Social Forces 86: 1189-1216.
Frank, K.A. & Yasumoto, J. 1998. "Linking Action to Social Structure within a System: Social Capital Within and Between Subgroups." American Journal of Sociology, Volume 104, No 3, pages 642-686
Wilhelm, A.G., Chen, I.C., Smith, T.M. and Frank, K.A., 2016. Selecting Expertise in Context Middle School Mathematics Teachers’ Selection of New Sources of Instructional Advice. American Educational Research Journal, 53(3), pp.456-491.
8
Selection Model
9
Interactions between i and i’ between t-1 and t
The Logistic Regression Model
The "logit" model :��ln[p/(1-p)] = β0 + β1X �
10
11
w
W=1
W=0
P(w|x)
Interpretation of Ogive
12
Introducing the Odds Ratio for the Logistic Transformation
13
Example Interpretation of coefficient β1
14
Odds=p/(1-p)
Odds of tie for same gender = (20/25)/(5/25)=4
Odds of tie for different gender = (10/15)/(5/15)=2
Change in odds due to same gender =4/2=2
Can also be calculated as (AD)/(BC)=(20x5)/(10x5)=2
Logistic regression coefficient=LN(2)=.693
Change in odds =e β0 + β1/e β0=eβ1 =e.693 =2
| | tie | |
| | No | yes |
Same gender | no | A 5 | B 10 |
| yes | C 5 | D 20 |
| total | 10 | 30 |
Selection Exercise
A) Write a model for whether two actors talked as a function of whether they are of different race and whether they are of different gender.
wii’ represents whether i and i’ talked,
yi represents the gender of i (0 if male, 1 if female), and
zi represents the race of i (0 if white, 1 if African American)
(You’ll need one term for effects associated with gender, and another for race)
15
Selection Exercise
B) Assume that Bob and Lisa are African American and that Jane and Bill are white. Bill and Bob are male and Lisa and Jane are female.
Calculate the independent variables based on difference of race and gender for Bob with each of his interaction partners:
(Bob, Lisa): different gender = _______; different race = _________
(Bob, Jane): different gender =_______; different race = _________
(Bob, Bill): different gender = _______; different race =__________
16
Selection Exercise
C) Assuming the values of the θ’s are negative and that the effect of race is stronger than that of gender, who is Bob most likely to talk to?
D) Include a term capturing the interaction of race and gender
17
Estimation of Selection Model
Use the example of wii’ being whether one teacher helped another (1 if helped, 0 if not)
Naive: logistic regression:
18
19
1 2 3 4 5 6
1|0 1 1 0 0 0
2|1 0 1 0 0 0
3|0 1 0 1 0 1
4|0 0 0 0 1 1
5|0 0 0 1 0 0
6|0 0 1 1 0 0
ROW COLUMN WEIGHT
1 2 1
1 3 1
1 4 0
1 5 0
1 6 0
2 1 1
2 3 1
2 4 0
2 5 0
2 6 0
3 1 0
3 2 1
3 4 1
3 5 0
Nominator Nominee W(help)
.
.
.
Matrix
Edgelist
Logistic Regression?
Just model the 1 or 0 as an outcome?
Estimation of Selection Model
Use the example of wii’ being whether one teacher helped another (1 if helped, 0 0 if not)
Naive: logistic regression:
Likelihood function: p(A and B) = p(A)×p(B) if A and B are independent. NO!
Helpii’ is not independent of Helpii” !
20
The p1 Approach
21
| Wi’i =0 | Wi’i =1 |
Wii’ =0 | Cell A (reciprocity) | Cell B |
Wii’ =1 | Cell C | Cell D (reciprocity) |
Model as 4 cells, A,B,C,D instead of just Wii’ =0
Holland, Paul W. and S. Leinhardt. 1981. "An Exponential Family of Probability Distributions for Directed Graphs." Journal of American Statistical Association 76(373):33-49.
Estimation via p*�
22
Visual representations of p2 model�control for dependencies associated with nominator and nominee�
23
Van Duijn, M.A.J. (1995). Estimation of a random effects model for directed graphs. In: Snijders, T.A.B. (Ed.) SSS '95. Symposium Statistische Software, nr. 7. Toeval zit overal: programmatuur voor random-coefficient modellen [Chance is omnipresent: software for random coefficient models], p. 113-131. Groningen, iec ProGAMMA.
SOFTWARE http://stat.gamma.rug.nl/stocnet/
Lazega, E. and van Duijn, M (1997). “Position in formal structure, personal characteristics and choices of advisors in a law firm: a logistic regression model for dyadic network data.” Social Networks, Vol 19, pages 375-397.
Hoff, P.D., 2005. Bilinear mixed-effects models for dyadic data. Journal of the american Statistical association, 100(469), pp.286-295.
https://www.stat.washington.edu/~pdhoff/
θ1
β
γ01
ai
bi’
β
β
Cross-nesting
24
Nominees nested within nominators
Nominator nested within nominees
nominator effects ai
Nominee effects bi’
Selection and Influence
25
Influence
selection
0 1 2 3
Time
Change in Behavior
Change in Relations
Behavior |
Relations |
Leenders, R. (1995). Structure and influence: Statistical models for the dynamics of actor attributes, network structure and their interdependence. Amsterdam: Thesis Publishers.
Causality
26
Peter Hoff’s additive, multiplicative, and latent factor models
27
See also: Sweet, T. M., Thomas, A. C., & Junker, B. W. (2013). Hierarchical Network Models for Education Research Hierarchical Latent Space Models. Journal of Educational and Behavioral Statistics, 38(3), 295-318.
Latent space adjusted approach
Assume latent social position in a n-dim space that can determine probability of interaction
28
From Hoff et al. (2002)
Note here we use actors i and j instead of i and i’
reflection
Latent Space as Proxy for Unobserved Attributes with Tie to k
29
i
k
j
Latent space
L
M
Observed network
Shadow network
ci
cj
ck
Latent Space as Proxy for Unobserved Attributes: No Tie to k
30
i
k
j
Latent space
L
M
Observed network
Shadow network
ci
cj
ck
Hoff’s notation
31
Tie
i 🡪 j
Dyadic covariate
Row covariate
Sender
nominator
Column covariate
Receiver
nominee
Row random effect
Column random effect
Similarity on latent factors
Wii’
Latent Space vs Latent Factor
32
-5 -4 -3 -2 -1 0 1 2 3 4 5
3
2
1
0
-1
-2
-3
Dim 1
Dim 2
Sqrt(1.22 +-.52)=1.3
Selection of Network Partners: Graphical Representation
33
VDE
VCE
VBE
Shared paths (as in ERGM) =2
E
Selection of Network Partners: Graphical Representation
34
uDE
uCE
uBE
Shared nominations as in ERGM=2
E
Latent factor instead of latent space
35
36
Page 11
37
Software
38
Latent space Selection Model
40
Absolute value of difference
in attributes
Represents the effect of difference in attribute
+Plotted Distance between i and i’
Latent factor
41
Reciprocity: Wii’ (as yij) Wi’i (as yji) Modeled Simultaneously (Lazega and Van Duijn 1997)
42
Selection Model (p2)
43
Pair Level (i,i’)
Sender Level (i) or nominator
Receiver Level (i’) or nominee
ui ~N(0,τu)
Vi’ ~N(0,τv)
Difference
In attribute
reciprocity
Sender
attribute
Receiver
attribute
Sender
variance
Receiver
variance
Toy Data
44
Network (w)
Attribute
attr1 attr2
2
2
3
2
1
2
total
Total 1 2 3 3 1 2
In-degree
Out-degree
Hoff’s notation
45
Tie
i 🡪 j
Dyadic covariate
Row covariate
Sender (out-degree)
nominator
Column covariate
Receiver (in-degree)
nominee
Row random effect
Column random effect
Similarity on latent factors
Toyfit1: �Latent factor selection models �line 62: toyfit1 <-ame(matnet, Xr=allatt, print=F)
46
Regression coefficients:
pmean psd z-stat p-val
intercept 0.278 0.366 0.758 0.448
attr1.row -0.072 0.402 -0.179 0.858
attr2.row 0.117 0.552 0.211 0.833
Variance parameters:
pmean psd
va 0.365 0.346
cab 0.025 0.157 (covar of a,b)
vb 0.261 0.171
rho 0.522 0.231 (reciprocity)
ve 0.326 0.144
Pmean is posterior mean effect
Psd is approximate posterior standard error
For a 1 unit increase in attr1 the log odds of a tie increase .117
47
W
|Yi-Yi’ |
toyfit3 <-ame(matnet, Xd=abattr1, print=F)�summary(toyfit3)�
48
Regression coefficients:
pmean psd z-stat p-val
intercept 0.785 0.352 2.230 0.026
.dyad -0.168 0.080 -2.106 0.035
[difference on attribute 1 is stat sig]
Variance parameters:
pmean psd
va 0.243 0.166
cab 0.019 0.119
vb 0.262 0.200
rho 0.423 0.270
ve 0.268 0.123
Two dimensions of latent space�toyfit6 <-ame(matnet, Xd=abattr1, Xr=attr1, Xc=attr2, R=2, print=F, family="nrm")�summary(toyfit6)
49
2 dimensions latent space
fit
No latent space�toyfit6nl <-ame(matnet, Xd=abattr1, Xr=attr1, Xc=attr2, print=F, family="nrm")�summary(toyfit6nl)
50
Selection Exercise 2
1) Start by modeling with just attribute 1 as a row (sender covariate)
2) Changing data
51
Exercise for P2
53
Using statnet
54
55
Basic statnet setup
56
Basic ERGM in statnet �from statnet tutorial
57
58
59
60
61
Concerns about Causality
62
Outcome: advice from i’ to i
Prior advice from i’ to i
Similarity of behavior between i’ and i
Influence
selection
63
. For every pair of school staff i and j, if i turned to j for advice about instruction the i → j relationship was assigned a value of 1 and 0 otherwise.
64
65
Selection Application�Transition from Social Exchange to Systemic Exchange Via Quasi-Ties�video : (42:25-48:00)
66
Frank, K.A. 2009 Quasi-Ties: Directing Resources to Members of a Collective �American Behavioral Scientist. 52: 1613-1645
p2 extended model
67
Quasi-tie
Interaction of Close Colleagues and Identification of the Potential Provider on the Provision of Help: Evidence of a Quasi-Tie
68
Cross Nested Multilevel Poisson Regression (i.e., p2 social network model)�of Extent (# of days per year) to which i’ Helped i
69
Quasi-tie
Examples of Selection
Frank, K.A., Muller, C., Mueller, A.S., 2013. The Embeddedness of Adolescent Friendship Nominations: The Formation of Social Capital in Emergent Network Structures. American Journal of Sociology, Vol 119(1):216-253. Media hits: Atlantic; Huffington Post; Huffington Post (Op-ed); US News and World Report; Yahoo; Health Day; RedOrbit;The Times of India; MSUToday: Psych Central: Positions and Promotions; Deccan Chronicle: wood radio: Fox Chicago: local news channels (south Carolina): Science Daily;National Science Foundation
Spillane, J., Kim, Chong Min, Frank, K.A. 2012. “Instructional Advice and Information Providing and Receiving Behavior in Elementary Schools: Exploring Tie Formation as a Building Block in Social Capital Development.” American Educational Research Journal. Vol 49 no. 6 1112-1145
Crosnoe, Robert, Anna Strassman-Mueller, and Frank, K.A. 2008. “Gender, Body Size, and Social Relations in American High Schools.” Social Forces 86: 1189-1216.
Frank, K.A. & Yasumoto, J. (1998). "Linking Action to Social Structure within a System: Social Capital Within and Between Subgroups." American Journal of Sociology, Volume 104, No 3, pages 642-686
70
Selection Answers
71
Selection Answers
72
Selection Answers
73
Note: variable is 1 if different gender, 0 if same gender.
Could also make it: 1 if same gender, 0 if different gender
Selection answers
74
C
D
For Bob and Jane: .4-.2(1)-.5(1)= -.3
If θ0=.4 and θ1= -.2 and θ2= -.5 then
Bob : Lisa
Scenarios for the Network analyst
For each of the scenarios below,
identify the theoretical processes at work
write down what model or tool you would employ to evaluate the theory.
describe what data you would collect to apply the model or tool to
describe what estimation procedure/tool you would use.
Sally is concerned that her daughter is experimenting with alcohol and thinks it is because her daughter’s friends are experimenting. Sally wonders generally if adolescents tend to drink more if their friends drink alcohol.
Michael wants to understand the social structure of his synagogue (church). He has an idea that there are certain sets of people who interact with each other, and, if he could understand what those sets of people are, he might better be able to tailor programs of the synagogue to be more effective.
How could Michael use the information above track the diffusion of new beliefs or behaviors in his synagogue?
Pennie wants to know under what conditions one social service agency would allocate resources to another. Is it because they have a history of doing so, they share clients, they deal with similar issues, etc.
What clustering among social service agencies might emerge as a result of the processes above?
75
Reflection