JavaScript isn't enabled in your browser, so this file can't be opened. Enable and reload.

1 of 78

Multilevel modeling and ego network dynamics

Brea L. Perry

Indiana University

2 of 78

MLM for social networks

3 of 78

MLM for social networks

Like students in schools or obs over time in people

4 of 78

MLM for social networks

When to use MLM for ego SNA: Formal requirements

DV is an alter or tie-level variable (level-1)

If you are interested in predicting a characteristic of ego (e.g., health, employment outcomes, movement participation), MLM is not appropriate
IVs can be alter, tie, network, or ego-level variables (level-1 or 2)

Personal networks of egos do not overlap (or overlap is negligible)
Ego observations are independent of one another

5 of 78

MLM research questions

What affects formation of ties to alters with particular attributes?

What kinds of egos are more likely to form cross-racial ties?

What affects alter behavior or contributions?

What kinds of ego and network characteristics affect alter provision of instrumental support?

What affects characteristics of dyads/relationships?

What ego and alter characteristics affect likelihood of sexual activity between members of a dyad?
What ego, alter, and tie characteristics affect level of conflict in a dyad?

6 of 78

What is MLM?

Same as other terms you have heard:

General Linear Mixed Model
Random Effects Model
Fixed Effects Model
Variance Components Model
Hierarchical Linear Model
Growth Curve Model (if using longitudinal data)

7 of 78

Two major parts of any model

Part I: Model for the means

AKA fixed effects part of the model (i.e., fixed parameters)
What you are used to caring about for testing hypotheses
How the expected outcome for a given observation varies on average as a function of values of predictor variables

8 of 78

Two major parts of any model

Part II: Model for the variances

AKA random effects and residuals (i.e., stochastic or varying parameters)
What you are used to making assumptions about
How residuals are distributed and related across observations (persons, groups, time, etc.) are the primary way that multilevel models differ from general linear models (e.g., regression)

9 of 78

Dependency

Model dependency or autocorrelation across observations

10 of 78

Why use MLM?: Dependency

Where does dependency come from? Two sources:

Mean differences across sampling units (e.g., egos)

Represented by a random intercept in our models

Differences in effects of predictors across sampling units

Longitudinal: individual differences in growth or change
Clustered: group differences in slopes of predictors
Represented by random slopes in our models

11 of 78

Why use MLM?: Dependency

Ignoring multi-level structure has implications for statistical validity

Depresses standard errors, makes it easier to find significance when there really is none
Can affect parameter estimates when extreme

Multilevel model accounts for clustering (non-independence) and allows you to explicitly model it rather than just control for it

12 of 78

Review of General Linear Model (GLM)

The error term is a random parameter
Try to minimize error by adding covariates that explain as much variance as possible

13 of 78

What we’re doing with a GLM

Each red arrow represents error - the line of fitted values does not go through every data point, so there is error in the estimate

We want to draw a regression line that minimizes error…this is what OLS does

e_i

e_i = Observed value – Fitted value

14 of 78

Review of GLM

Problem is that error terms for observations clustered within an individual are probably correlated
Violates OLS assumptions

15 of 78

Random-intercept model

Explicitly model the error dependence by splitting up the error term into level-1 and level-2 components
Have a random intercept for level-2 person j that is constant across all level-1 observations
Have an error term for each obs i clustered within person j

zeta

epsilon

16 of 78

Random-intercept model

Think about this as simply splitting up a pile of variance (i.e., differences from the mean) into two types:

Between-cluster (BC) variation

Level-2 “INTER-cluster differences” (zeta)
How (and why) do ego nets differ from other ego nets?

Within-cluster (WC) variation

Level-1 “INTRA-cluster differences” (epsilon)
How (and why) do alters within the same ego differ from other alters in the same network, on average?

17 of 78

Random intercept model

We are just making piles of variance, not reducing overall variance

OLS

Random-intercept MLM

18 of 78

Random-intercept model

psi

theta

19 of 78

Intraclass correlation

Rho is a measure of between-cluster heterogeneity OR as within-cluster homogeneity (two sides of the same coin)
Typically call it the intraclass correlation, which is a measure of within-cluster correlation

rho

20 of 78

Intraclass correlation

21 of 78

Intraclass correlation

ICC is a standardized way of expressing how much we need to worry about dependency due to cluster mean differences

Bigger ICC 🡪 more messed up standard errors

22 of 78

Sexual contact example

Suppose I want to predict how often couples have sex…

23 of 78

Sexual contact example

Two egos Jane and Joe and five sex partners (dyads)

What can we say about Jane and Joe’s sex lives?
What can we say about within and between variation?

Variation within

Variation between

24 of 78

Sexual contact example

Both Jane and Joe get their own random intercept

Jane’s regression line

Joe’s regression line

y = # sexual contacts

x = number of instruments

25 of 78

Sexual contact example

Egos get their own random intercept based on their alter/tie observations

Every ego gets their own regression line

Intercept is “random” (varies)
Slope is constant

y = # sexual contacts

x = number of instruments

26 of 78

Sexual contact example

What if the effects of x on y within egos are different?

Jane’s regression line

Joe’s regression line

y = # sexual contacts

x = number of instruments alter plays

27 of 78

Random-coefficient model

The random-coefficient linear regression model:

28 of 78

Sexual contact example

Jane’s regression line

Joe’s regression line

y = # sexual contacts

x = number of instruments alter plays

29 of 78

Sexual contact example

Egos get their own random intercept and slope based on their alters

Every ego gets their own regression line

Intercept is “random” (varies)
Slope is “random” (varies)

y = # sexual contacts

x = number of instruments alter plays

30 of 78

Random coefficient model

We are still just making piles of variance, not reducing overall variance

OLS

Random-intercept MLM

Random-coefficient MLM

31 of 78

Random effects, between effects, and fixed effects

32 of 78

Cluster confounding: A threat to RE models

The RE model assumes that Level-1 covariates are uncorrelated with the random intercept
Problematic because all Level-1 variables contain information about observations and groups (or individuals and individual change over time)
Another major problem is that the process of pooling assumes that within-cluster and between-cluster effects are equal

33 of 78

Cluster confounding example

Let’s return to our example of the effects of instruments on sexual frequency in sexual networks…
In typical random effects model, we are assuming that the within and between effects are the same:

Purely between-cluster effect 🡪 Do egos with higher mean levels of instrument-playing sex partners have higher mean levels of sexual activity?
Purely within-cluster effect 🡪 Do egos have more sex with partners in their network that play more instruments relative to other partners that play fewer?

34 of 78

Sexual contact example

Number of instruments

Three egos:

Jane

Joe

Jalissa

OLS regression line (completely pooled)

Sexual activity

35 of 78

Sexual contact example

Number of instruments

Sexual activity

BE regression line (aggregate model using ego means)

Three egos:

Jane

Joe

Jalissa

36 of 78

Sexual contact example

Number of instruments

Sexual activity

FE regression line (no pooling)

Three egos:

Jane

Joe

Jalissa

37 of 78

Sexual contact example

Number of instruments

OLS regression line (completely pooled)

Sexual activity

FE regression line (no pooling)

BE regression line (aggregate model using ego means)

Three egos:

Jane

Joe

Jalissa

38 of 78

Sexual contact example

Number of instruments

RE regression line (partially pooled)

Sexual activity

FE regression line (no pooling)

BE regression line (aggregate model using ego means)

Three egos:

Jane

Joe

Jalissa

39 of 78

Cluster confounding example

Interpreting the weighted average is problematic in this example

What is the comparison group?
Coefficient is biased
Significance tests will tell us there is no effect
Meaningful and interesting effects are masked

40 of 78

There are solutions…

Add a “contextual effect”
Split alter variance into within and between
Use a fixed effects or between effects model (depending on your research question and data properties)

41 of 78

Contextual effects

Add a contextual effect of alter/tie-level variables by including the aggregated (usually mean) version of the variable

This tests whether the within and between effect are equal. If the contextual effect is non-significant, you don’t have to worry about biased estimates due to cluster confounding
Does the network context matter?

42 of 78

Contextual effects

In sexual contact example, you would add mean number of instruments across all alters within a network when modeling the effect of number of instruments each alter plays

Need to separate out the effects of ego being attracted to alters who play instruments (mean) from having Prince as a sexual partner (value for each alter)
The ego or network context could affect the true parameter value of the alter-level effect

43 of 78

Decomposing variance

44 of 78

Decomposing variance

In sexual contact example…

At Level-2, the variable is equal to the mean number of instruments played by alters in a network
At Level-1, the variable is equal to each alter’s deviation from the network mean number of instruments

45 of 78

Random vs. fixed effects models

We can think about the difference between random effects and fixed effects models as variations in the degree of pooling
Pooling = Degree to which each cluster mean is “pulled” toward the overall grand mean
As ICC 🡪 0, RE looks more like OLS; As ICC 🡪1, looks more like FE

No Pooling Partial Pooling Complete Pooling

OLS 🡪 Ignore clusters and assume they don’t matter

Random effects 🡪 Borrow information from the grand mean

Fixed effects 🡪 Within-cluster mean and variation are all that matter

46 of 78

MLM in Stata

What does the output/interpretation look like?

YOU CAN DO THIS!!!

47 of 78

MLM in Stata

48 of 78

MLM in Stata

Suppose we want to look at the effects of ego and alter gender on the number of support functions provided by an alter to an ego

49 of 78

Empty RI model

Step 1: Run an “empty” model (without covariates) to assess the random parts of the model

50 of 78

Empty RI model

If significant, you need MLM

51 of 78

Empty RI model

52 of 78

RI model with predictors

Model for the means is just business as usual

Most people ignore the model for the variances

Step 2: Add covariates and interpret as usual

53 of 78

Add a contextual effect

. *Random intercept model with contextual effect

. xtmixed support egofem altfem netfem10 || EGOID: , mle variance

Performing EM optimization:

Performing gradient-based optimization:

Mixed-effects ML regression Number of obs = 19,409

Group variable: EGOID Number of groups = 1,050

Obs per group:

min = 2

avg = 18.5

max = 67

Wald chi2(3) = 40.11

Log likelihood = -24624.611 Prob > chi2 = 0.0000

------------------------------------------------------------------------------

support | Coef. Std. Err. z P>|z| [95% Conf. Interval]

-------------+----------------------------------------------------------------

egofem | .0189512 .0216578 0.88 0.382 -.0234973 .0613998

altfem | .0761756 .0126522 6.02 0.000 .0513778 .1009735

netfem10 | -.0217038 .0073214 -2.96 0.003 -.0360534 -.0073541

_cons | .8996575 .0332023 27.10 0.000 .8345822 .9647329

------------------------------------------------------------------------------

Random-effects Parameters | Estimate Std. Err. [95% Conf. Interval]

-----------------------------+------------------------------------------------

EGOID: Identity |

var(_cons) | .0431032 .0037524 .0363419 .0511224

-----------------------------+------------------------------------------------

var(Residual) | .7121663 .0074291 .6977535 .7268769

------------------------------------------------------------------------------

LR test vs. linear model: chibar2(01) = 377.53 Prob >= chibar2 = 0.0000

Step 3: Add a contextual effect and interpret

*Could also decompose variance but not if your predictor is binary (interpretation doesn’t make sense)

54 of 78

Ego network dynamics

55 of 78

Social network dynamics

Most research provides only a “snapshot” of networks
However, networks are constantly changing
Key is to disentangle “real” change from methodological artifact

56 of 78

What we know about social network dynamics

Structural properties of networks tend to remain stable over time

E.g., homeostasis in networks; social signatures

BUT lots of “turnover” or “churn” in the individuals that make up a network

Loss of ties does not mean networks are getting smaller – may just be replacement

57 of 78

What we know about social network dynamics

Toronto, Ontario residents: only 27% of ties persist over a decade (Wellman et al. 1997)
Caregivers’ support networks: 30% of ties persist over a decade (Suitor & Keeton 1997)
Recent widows: 22% of ties persist over one year (Morgan et al. 1997)

(Cornwell et al. 2021)

58 of 78

What we know about social network dynamics

At least three mechanisms of membership turnover:

Selective activation of latent ties (e.g., job search)
Internal dynamics (e.g., conflict)
External dynamics (e.g., moving away)

59 of 78

What we know about social network dynamics

Personal network dynamics operate similarly to biological evolution, which is to say that they undergo gradual, random ebbs and flows in membership punctuated by intense rapid shifts that correspond to significant events (Wellman)

Divorce
Major acute illness
Natural disaster
Migration
Retirement
Parenthood

60 of 78

What we know about social network dynamics

Networks are comprised of two basic components:

a smaller and more stable core

Densely-knit, mostly kin, highly supportive

a larger set of temporary or sporadic ties (the periphery)

Most turnover occurs in periphery

61 of 78

What we know about social network dynamics

Periphery is a problem for cross-sectional network studies

People engage in periods of brief and sporadic periods of meaningful contact (e.g. old friend visits)
The likelihood of these sometimes-inactive relationships being present in a snapshot of a network is essentially random
When peripheral ties are not captured, they are assumed to be absent rather than inactive
Instability does not mean real change

62 of 78

How to measure network change

Problem 1: Real change or methodological artifact?

Respondents forget to name alters from previous waves 5-10% of the time
Respondents deliberately underreport alters in subsequent waves because they know each alter = more work (i.e., panel conditioning)
Respondents give different names or spellings in subsequent waves (i.e., error)

63 of 78

How to measure network change

Problem 2: Determining what alter-level changes underlie network-level change

Suppose the mean freq of contact with network members decreases from W1 to W2. This can be due to…

Ego decreasing contact with alters who were present at both W1 and W2
Loss of alters with whom ego had frequent contact
Addition of new alters with whom ego has infrequent contact

64 of 78

How to measure network change

Solution: Real change or methodological artifact?

In each follow-up wave of a study…

Have egos name their current alters
Show them their roster from the previous wave or waves
Have them match alters across waves
Ask them why they didn’t name any dropped alters, and add if they report forgetting

65 of 78

How to measure network change

66 of 78

Measures of network change

Network-level measures that capture network turnover (pooled across two or more waves)

N/Prop alters dropped
N/Prop alters added
N/Prop stable alters

67 of 78

Measures of network change

N dropped or added

N unique alters pooled

Network turnover

(Perry & Pescosolido 2012)

68 of 78

How to analyze network change

If goal is to describe network-level change:

Simple comparison of ego network characteristics over time

E.g., Avg degree at W1 compared to avg at W2

Measure of difference between two waves

E.g., W2 degree – W1 degree

69 of 78

How to analyze network change

Source: Cornwell et al. 2014

Comparing mean characteristics of networks across waves…

70 of 78

How to analyze network change

If goal is to describe alter-level change:

Distinguish alters dropped, maintained, or added across W1 and W2

Can present number or percent of each

E.g., 35% of alters dropped, 35% maintained, 30% added

Compare characteristics of each

E.g., 75% of maintained alters are “very close” compared to 35% of dropped alters

71 of 78

How to analyze network change

Source: Cornwell et al. 2014

Comparing characteristics of stable, lost, and new alters…

72 of 78

How to analyze network change

If goal is to predict network-level change

Use two-level longitudinal multilevel models or growth (curve) models where obs over time at network level are nested in egos
Model the effects of time and (often) time-squared

E.g., how does network size change (non-linearly) with age

Decompose variance into within and between effects (the latter is growth/change)
Interact time with other predictors to test whether networks change at different rates as a function of other variables

E.g., Does the rate of change in network size with age depend on gender?

73 of 78

How to analyze network change

If goal is to predict alter-level change

Use three-level longitudinal multilevel models or growth (curve) models where obs over time are nested in alters are nested in egos
Model the effects of time and (often) time-squared

E.g., how does time in the illness career affect whether an alter drops out of the network

Interact time with other predictors

E.g., Does the likelihood of an alter dropping as ego moves through the illness career depend on how the alter is related to ego?

74 of 78

Questions?

75 of 78

Three important considerations when choosing FE or RE

1. Amount of variation within clusters compared to variation across clusters

If variation is primarily within clusters, use random effects

Can assess using ICC, and rule of thumb is ICC < 0.50

If variation is primarily between clusters, solution is more complicated (see next slide)

ICC >.50 indicates a “sluggish” variable (not much within cluster variation, OR obs within cluster are highly correlated)

76 of 78

Three important considerations when choosing FE or RE

2. Amount of data (obs per cluster and number of clusters) if you have a sluggish DV

With little data, use random effects
With lots of data, use fixed effects

77 of 78

Cross-level interactions

Change in effect of alter gender when ego gender=1

Effect of alter gender when ego gender= 0

. xtmixed support altfem##egofem c.netfem10##egofem || EGOID: altfem , mle covariance(unstructured) variance

Mixed-effects ML regression Number of obs = 19,409

Group variable: EGOID Number of groups = 1,050

Wald chi2(5) = 103.07

Log likelihood = -24593.322 Prob > chi2 = 0.0000

-----------------------------------------------------------------------------------

support | Coef. Std. Err. z P>|z| [95% Conf. Interval]

------------------+----------------------------------------------------------------

1.altfem | -.0234568 .018998 -1.23 0.217 -.0606921 .0137786

1.egofem | -.0578108 .0740929 -0.78 0.435 -.2030302 .0874086

altfem#egofem |

1 1 | .191793 .0255892 7.50 0.000 .1416391 .2419468

netfem10 | -.020383 .0112081 -1.82 0.069 -.0423506 .0015845

egofem#c.netfem10 |

1 | -.0033288 .0147954 -0.22 0.822 -.0323272 .0256697

_cons | .9356164 .0481098 19.45 0.000 .841323 1.02991

-----------------------------------------------------------------------------------

------------------------------------------------------------------------------

Random-effects Parameters | Estimate Std. Err. [95% Conf. Interval]

-----------------------------+------------------------------------------------

EGOID: Unstructured |

var(altfem) | .0022581 .0066916 6.78e-06 .7518713

var(_cons) | .0358574 .0049609 .0273411 .0470265

cov(altfem,_cons) | .0067692 .0045788 -.002205 .0157434

-----------------------------+------------------------------------------------

var(Residual) | .7093482 .007568 .6946691 .7243374

------------------------------------------------------------------------------

LR test vs. linear model: chi2(3) = 385.69 Prob > chi2 = 0.0000

Change in effect of network gender comp. when ego gender=1

Effect of network gender comp. when ego gender= 0

78 of 78

Cross-level interactions

When ego is a man, there is no significant effect of an alter being a woman (b=-0.02) on number of support functions. However, when ego is a woman, women alters are expected to provide 0.17 more support functions than men alters.
Interaction at Level-2 is not significant