ERGM et al
MGT 780 SPRING 2022
STEVE BORGATTI
(C) 2022 STEPHEN P BORGATTI
1
25-Apr-22
26 APR
Agenda
(C) 2022 STEPHEN P BORGATTI
2
25-Apr-22
Problem statement
(C) 2022 STEPHEN P BORGATTI
3
25-Apr-22
Tie dependencies
(C) 2022 STEPHEN P BORGATTI
4
25-Apr-22
Node attributes & dyadic covariates
(C) 2022 STEPHEN P BORGATTI
5
25-Apr-22
Model assumptions
(C) 2022 STEPHEN P BORGATTI
6
25-Apr-22
Evolutionary process
(C) 2022 STEPHEN P BORGATTI
7
25-Apr-22
Formal model
(C) 2022 STEPHEN P BORGATTI
8
25-Apr-22
What is ERGM?
(C) 2020 STEPHEN P BORGATTI
9
25 April 2022
Converting problem to a logistic regression
(C) 2020 STEPHEN P BORGATTI
10
25 April 2022
Thanks to Filip Agneessens
Can cross out the Ks
(C) 2020 STEPHEN P BORGATTI
11
25 April 2022
Change in graph statistics that result from tie present vs absent
Odds <- change statistics
(C) 2020 STEPHEN P BORGATTI
12
25 April 2022
The “logit” of a probability is the log of the associated odds ratio
Logits to probabilities
(C) 2022 STEPHEN P BORGATTI
13
25-Apr-22
So, a model written in terms of odds can ultimately be converted back to probabilities
Interpreting parameters
(C) 2022 STEPHEN P BORGATTI
14
25-Apr-22
Simplest ERGM
(C) 2020 STEPHEN P BORGATTI
15
25 April 2022
Estimate Std. Error MCMC % p-value
edges -1.609 0.245 NA <1e-04 ***
Using PAGM data
Logit(Xij =1) = b0(Δedges)
Adding closure parameter
(C) 2022 STEPHEN P BORGATTI
16
25-Apr-22
Padgm results
(C) 2020 STEPHEN P BORGATTI
17
25 April 2022
a
b
c
d
Logit(Xij =1) = b0(Δedges) + b1(Δtriangles)
The conditional log-odds of two actors having a tie is:
-1.764*(change in the number of ties) + 0.091*(change in number of triangles)
Estimate Std. Error MCMC % z value Pr(>|z|)
edges -1.76367 0.33136 0 -5.323 <0.0001 ***
triangle 0.09115 0.15760 0 0.578 0.563 NS
---
Note: your results will vary
Degree variance (centralization)
(C) 2022 STEPHEN P BORGATTI
18
25-Apr-22
(C) 2020 STEPHEN P BORGATTI
19
25 April 2022
Parameter | Name | Interpretation | Image |
Arc | Edges | This is a baseline propensity for tie formation | |
Reciprocity | Mutual | This is often positive, suggesting that reciprocated ties are very likely to be observed for positive affect networks | |
Simple connectivity | twopath | This measures the extent to which actors who send ties also receive them. It controls for the correlation between in and out degree. It is often negative. | |
Popularity spread | gwidegree | popularity spread. indicates a network with high in-degree nodes (i.e., centralized)—gwidegree is opposite | |
Activity spread | gwodegree | popularity spread. indicates a network with high out-degree nodes (i.e., centralized)—gwiegree is opposite | |
Triangulation | gwesp | A positive effect indicates there is a high degree of closure, or multiple clusters of triangles in the data | |
Cyclic closure | ctriple | A negative effect indicates tendencies against cyclic triads (sometimes this is interpreted as a tendency against generalized exchange or generalized reciprocity) | |
Multiple connectivity | gwdsp | 2-path in the networks. A negative estimate in conjunction with positive triangulation indicates that 2-paths tend to be closed (i.e., triangles) | |
Directed effects
(C) 2022 STEPHEN P BORGATTI
20
25-Apr-22
Exogenous effects
(C) 2022 STEPHEN P BORGATTI
21
25-Apr-22
Adding a node characteristic (exogenous var)
(C) 2020 STEPHEN P BORGATTI
22
25 April 2022
Estimate Std. Error MCMC % p-value
edges -2.59493 0.53606 NA <1e-04 ***
nodecov.wealth 0.01055 0.00467 NA 0.026 *
Adding a node characteristic (exogenous var)
(C) 2020 STEPHEN P BORGATTI
23
25 April 2022
Estimate Std. Error MCMC % p-value
edges -2.59493 0.53606 NA <1e-04 ***
nodecov.wealth 0.01055 0.00467 NA 0.026 *
ERGM Goodness of fit
(C) 2020 STEPHEN P BORGATTI
24
25 April 2022
ERGM Goodness of fit – cont.
(C) 2020 STEPHEN P BORGATTI
25
25 April 2022
ERGM in R
(C) 2022 STEPHEN P BORGATTI
26
25-Apr-22
(C) 2022 STEPHEN P BORGATTI
27
25-Apr-22
Padgm example
(C) 2022 STEPHEN P BORGATTI
28
25-Apr-22
Padgm triangles
(C) 2022 STEPHEN P BORGATTI
29
25-Apr-22
Goodness of fit
(C) 2022 STEPHEN P BORGATTI
30
25-Apr-22
Reflections on ERGM
(C) 2022 STEPHEN P BORGATTI
31
25-Apr-22
General comments
(C) 2022 STEPHEN P BORGATTI
32
25-Apr-22
Endogenous effects & exogenous covariates
(C) 2020 STEPHEN P BORGATTI
33
25 April 2022
Emergence & self-organization
(C) 2020 STEPHEN P BORGATTI
34
25 April 2022
Parameters and social processes
(C) 2022 STEPHEN P BORGATTI
35
25-Apr-22
The triangle parameter
(C) 2022 STEPHEN P BORGATTI
36
25-Apr-22
Triangle parameter – cont.
(C) 2022 STEPHEN P BORGATTI
37
25-Apr-22
Other models
(C) 2022 STEPHEN P BORGATTI
38
25-Apr-22
How does ergm relate to qap?
(C) 2022 STEPHEN P BORGATTI
39
25-Apr-22
When using a dyadic covariate in ergm …
(C) 2022 STEPHEN P BORGATTI
40
25-Apr-22
QAP | ERGM |
|
|
Longitudinal extensions to ERGM
(C) 2020 STEPHEN P BORGATTI
41
25 April 2022
SAOMs
(C) 2020 STEPHEN P BORGATTI
42
25 April 2022
SAOM process
(C) 2020 STEPHEN P BORGATTI
43
25 April 2022
(C) 2020 STEPHEN P BORGATTI
44
25 April 2022
| MR-QAP | ERGM | LERGM | SAOM | REM |
Summary description | Multiple regression in which cases are dyads and dependent variable is presence/absence or strength of tie. Significance assessed via permutation test | Odds of a tie modeled as a function of the contribution of that tie to graph statistics reflecting prevalence of micro-configurations | Odds of a tie modeled as a function of the contribution of that tie to graph statistics reflecting prevalence of micro-configurations | Actors make choices to add/drop ties to maximize utility function |
|
What is modeled | Presence/absence or strength of ties | - Probability of observed network given set of network statistics (e.g., counts of micro-configurations)�- Presence/absence of ties as equilibrium outcome of tie-wise change process | Presence/absence of ties | - Presence/absence of ties�- (optional) Change in node-level attributes (e.g., behaviors) | Occurrence of relational events |
Kinds of effects | Any variables that can be expressed in dyadic form, including both endogenous and exogeneous variables, as well as attribute-based variables�2. Endogeneous variables (e.g., whether a tie completes a transitive triple) | 1. Exogenous covariates (e.g., similarity of interests)�2. Endogenous variables (e.g., # of transitive triples that would be added if tie were added) | 1. Exogenous covariates (e.g., similarity of interests)�2. Endogenous variables (e.g., # of transitive triples that would be added if tie were added) | 1. Exogenous covariates (e.g., similarity of interests)�2. Endogenous variables (e.g., # of transitive triples that would be added if tie were added) |
|
Meaning of parameters | Positive parameter for X indicates that larger values of X are associated with greater probability or strength of tie | 1. Exogenous variables: change in odds of tie given unit increase in X�2. Endogenous variables: Positive parameter for X (e.g., transitivity) indicates that odds of a tie is higher to the extent its presence contributes to change in X (e.g., change in # of transitive triples) | 1. Exogenous variables: change in odds of tie given unit increase in X�2. Endogenous variables: Positive parameter for X (e.g., transitivity) indicates that odds of a tie is higher to the extent its presence contributes to change in X (e.g., change in # of transitive triples) | 1. Exogenous variables: change in odds of tie given unit increase in X�2. Endogenous variables: Positive parameter for X (e.g., transitivity) indicates that odds of a tie is higher to the extent its presence contributes to change in X (e.g., change in # of transitive triples) |
|
Approach to change | Implicit. Model gives effect of X on Y. Hence, we predict change in Y given a change in X. Variants can model Y(t)-Y(t-1) or control for Y(t-1) when modeling Y(t) | Implicit. Model gives effect of X on Y. Hence, we predict change in Y given a change in X. TERGM variant can control for Y at t-1 | Explicit. Continuous time framework updates dyadic dependencies as each dyad changes. Ordering of tie changes makes a difference | Explicit. Continuous time framework updates dyadic dependencies as each dyad changes. Ordering of tie changes makes a difference |
|
Approach to non-independence of observations in dyadic data | Control for sources of dependence via permutation method | Explicitly model sources of dependence between dyads | Explicitly model sources of dependence between dyads | Explicitly model sources of dependence between dyads; dependencies can be asymmetric (dependency of i-->j on u-->v does not imply the reverse) |
|
Goodness of fit | Dyadic. Difference, across all dyads, between observed Yij and predicted Yij | Whole network. Do networks simulated from the model have the same overall network statistics as the observed network? | Whole network. Do networks simulated from the model have the same overall network statistics as the observed network? | Whole network. Do networks simulated from the model have the same overall network statistics as the observed network? |
|
Temporal variants | - Can include lagged versions of both Xs and Y as controls�- Can explicitly model Y(t) - Y(t-1) | - Include lagged versions of both Xs and Y as controls (aka TERGM)�- Model tie formation and dissolution independently (aka STERGM) | Not applicable | Not applicable | Not applicable |