EGO NETWORK ANALYSIS��SOCIAL NETWORKS AND HEALTH�DUKE UNIVERSITY, 2019
Brea L. Perry
Professor of Sociology
Indiana University Network Science Institute
ROADMAP
SOCIOCENTRIC NETWORKS
EGOCENTRIC NETWORKS
MAJOR ADVANTAGES OF EGO NETWORKS
Practical advantages: Flexibility in data collection
MAJOR ADVANTAGES OF EGO NETWORKS
Broader inference
MAJOR ADVANTAGES OF EGO NETWORKS
Theoretical advantages: Unboundedness
EXAMPLE NAME GENERATOR
Multiple name generator strategy from the Social Factors and HIV Risk (SFHR) project (Friedman et al. 2006) |
In the past 30 days, who are the people who you…
|
EGO NETS: DISADVANTAGES
DISADVANTAGES (MAYBE NOT)
Jeffrey Smith’s simulation approach constructs full networks that are consistent with each piece of information extracted from the ego network sample
Smith, Jeffrey A. 2015. “Global Network Inference from Ego Network Samples: Testing a Simulation Approach.” The Journal of Mathematical Sociology 39:125-162.
Smith, Jeffrey A. 2012. “Macrostructure from Microstructure: Generating Whole Systems from Ego Networks.” Sociological Methodology 42:155-205.
Smith, Jeffrey A. and Jessica Burrow. 2018. “Using Ego Network Data to Inform Agent Based Models of Diffusion.” Sociological Methods & Research. doi: 10.1177/0049124118769100
Comparison of diffusion curves from true networks and sampled-based estimates using Add Health
“Across all analyses, the diffusion curves based on the sampled data are very similar to the curves based on the true, complete network.”
COMMON MEASURES IN EGOCENTRIC NETWORK ANALYSIS
WHAT ARE WE TRYING TO OPERATIONALIZE?
MEASUREMENT
Ego network measures are based on:
EGO-ALTER TIES
Degree
EGO-ALTER TIES
Degree
EGO-ALTER TIES
Multiplexity
EGO-ALTER TIES
Multiplexity
EGO-ALTER TIES
Tie strength
EGO-ALTER TIES
Tie strength
EGO-ALTER TIES
Other relationship characteristics
ALTER ATTRIBUTES
Composition
ALTER ATTRIBUTES
Composition (categorical)
ALTER ATTRIBUTES
Composition (continuous)
ALTER ATTRIBUTES
Ego-alter similarity
1) Preference - people tend to socialize and form bonds with others like them (homophily)
ALTER ATTRIBUTES
Ego-alter similarity
2) Availability - people tend to socialize and form bonds with people they come into contact with (shared foci of activity)
ALTER ATTRIBUTES
Ego-alter similarity
3) Influence - people become more similar over time through repeated social interactions
ALTER ATTRIBUTES
Ego-alter similarity
ALTER ATTRIBUTES
Ego-alter similarity (categorical)
Nexternal-Ninternal
network size
ALTER ATTRIBUTES
Ego-alter similarity (categorical)
ALTER ATTRIBUTES
Ego-alter similarity (categorical)
1) Calculate expected value
2) Calculate chi-square
3) Normalize so value (phi) ranges from 0-1
ALTER ATTRIBUTES
Ego-alter similarity (continuous)
ALTER ATTRIBUTES
Ego-alter similarity (continuous)
ALTER ATTRIBUTES
Heterogeneity or “range”
ALTER ATTRIBUTES
Heterogeneity (categorical)
ALTER ATTRIBUTES
Heterogeneity (categorical)
HETEROGENEITY
HETEROGENEITY
Heterogeneity (categorical)
HETEROGENEITY
HETEROGENEITY
Heterogeneity (continuous)
ALTER-ALTER TIES
ALTER-ALTER TIES
Burt’s structural holes
ALTER-ALTER TIES
Burt’s structural holes
Three actor network with no structural holes
Three actor network with one structural hole
1
2
A
B
C
A
B
C
ALTER-ALTER TIES
Coleman’s closure
ALTER-ALTER TIES
Density
ALTER-ALTER TIES
Density
Sparsely-knit, with 3 of 42 possible ties present
Density = (2*3)/(7*(7-1)) = 0.14
ALTER-ALTER TIES
Effective size
ALTER-ALTER TIES
Effective size
Effective size = 3
Effective size = actual size – redundancy = 3-2 = 1
A
B
C
D
A
B
C
D
ALTER-ALTER TIES
Network size = 7
Alters 1 and 5 are isolates
Alters 2, 4, 6, 7 are connected to one other alter
Alter 3 is connected to two alters
Mean ties per alter = (0+0+1+1+1+1+2)/7 is 0.9
Effective size = 7 – 0.9 = 6.1
ALTER-ALTER TIES
Efficiency
MANAGING
EGOCENTRIC DATA
CONVENTIONAL DATA STRUCTURE
CONVENTIONAL STRUCTURE MODIFIED FOR NETWORKS
CONVENTIONAL STRUCTURE MODIFIED FOR NETWORKS
ID | age | female | aage1 | atie1 | aclose1 | aage2 | atie2 | aclose2 |
1 | 28 | 0 | 18 | 4 | 2 | 22 | 3 | 1 |
2 | 36 | 1 | 45 | 1 | 1 | 46 | 1 | 3 |
3 | 21 | 0 | 33 | 3 | 1 | 63 | 1 | 2 |
4 | 45 | 1 | 27 | 2 | 3 | 43 | 5 | 2 |
5 | 51 | 1 | 31 | 1 | 1 | 19 | 3 | 1 |
CONVENTIONAL STRUCTURE MODIFIED FOR NETWORKS
ID | afrnd1-2 | afrnd1-3 | afrnd1-4 | afrnd2-3 | afrnd2-4 | afrnd3-4 |
1 | 0 | 0 | 1 | 1 | 0 | 0 |
2 | 0 | 1 | 1 | 0 | 1 | 0 |
3 | 1 | 0 | 0 | 0 | 1 | 0 |
4 | 1 | 1 | 0 | 1 | 1 | 1 |
5 | 1 | 1 | 1 | 0 | 1 | 0 |
LONG-FORM (“TIDY”) DATA
LONG-FORM (“TIDY”) DATA
egoID | alterID | age | female | race | aage | atie | aclose | asmoker |
1 | 1 | 28 | 0 | 2 | 46 | 1 | 4 | 0 |
1 | 2 | 28 | 0 | 2 | 52 | 4 | 1 | 1 |
1 | 3 | 28 | 0 | 2 | 19 | 3 | 3 | 1 |
2 | 1 | 45 | 1 | 1 | 23 | 2 | 2 | 0 |
2 | 2 | 45 | 1 | 1 | 47 | 3 | 1 | 1 |
3 | 1 | 53 | 0 | 3 | 61 | 2 | 1 | 1 |
3 | 2 | 53 | 0 | 3 | 33 | 1 | 2 | 0 |
3 | 3 | 53 | 0 | 3 | 39 | 1 | 3 | 0 |
LONG-FORM (“TIDY”) DATA
egoID | alterID | age | female | aage | atie | know1-2 | know1-3 | know1-4 | know2-3 |
1 | 1 | 28 | 0 | 46 | 1 | 1 | 0 | NA | 1 |
1 | 2 | 28 | 0 | 52 | 4 | 1 | 0 | NA | 1 |
1 | 3 | 28 | 0 | 19 | 3 | 1 | 0 | NA | 1 |
2 | 1 | 45 | 1 | 23 | 2 | 0 | NA | NA | NA |
2 | 2 | 45 | 1 | 47 | 3 | 0 | NA | NA | NA |
3 | 1 | 53 | 0 | 61 | 2 | 0 | 1 | NA | 1 |
3 | 2 | 53 | 0 | 33 | 1 | 0 | 1 | NA | 1 |
3 | 3 | 53 | 0 | 39 | 1 | 0 | 1 | NA | 1 |
TRANSFORMING DATA
MEASUREMENT AND AGGREGATION
EGOCENTRIC NETWORK ANALYSIS
Egocentric network analysis poses problems:
EGOCENTRIC NETWORK ANALYSIS
Two strategies for dealing with these complications:
AGGREGATION TO EGO LEVEL
EGO NETWORKS IN MULTIVARIATE REGRESSION
NETWORKS AS INDEPENDENT VARIABLES
Social integration
COMMON MODEL VIOLATIONS IN NETWORK RESEARCH
PARALLEL PLAY
Regression with ego nets in R
Suppose we are interested in knowing
how personal network density is
associated with happiness…
DESCRIBING DENSITY
> describe(data$shdensity)
data$shdensity
n missing distinct Info Mean Gmd .05
1167 367 40 0.981 1.09 0.3413 0.500
.10 .25 .50 .75 .90 .95
0.700 0.917 1.050 1.333 1.500 1.500
lowest : 0.000 0.100 0.167 0.200 0.250, highest: 1.350 1.400 1.417 1.450 1.500
LOGISTIC REGRESSION
glm(formula = vhappy ~ shdensity + female + educyrs + married,
family = binomial(link = "logit"), data = data)
Deviance Residuals:
Min 1Q Median 3Q Max
-1.0629 -0.8829 -0.7396 1.4137 1.9189
Coefficients:
Estimate Std. Error z value Pr(>|z|)
(Intercept) -2.16586 0.43834 -4.941 7.77e-07 ***
shdensity 0.45417 0.22304 2.036 0.041723 *
female -0.03881 0.13171 -0.295 0.768280
educyrs 0.03972 0.02237 1.775 0.075845 .
married 0.49404 0.13519 3.654 0.000258 ***
> exp(coef(model4))
(Intercept) shdensity female educyrs married
0.1146515 1.5748642 0.9619374 1.0405213 1.6389277
Win for Burt or Coleman?
PARALLEL PLAY
Interactions with ego nets in R
Suppose we are interested in knowing
whether the effect of personal network density on happiness is moderated by marital status…
INTERACTIONS
glm(formula = vhappy ~ shdensity * married + female + educyrs,
family = binomial(link = "logit"), data = data)
Estimate Std. Error z value Pr(>|z|)
(Intercept) -2.85703 0.54524 -5.240 1.61e-07 ***
shdensity 1.07553 0.35851 3.000 0.00270 **
married 1.61387 0.51641 3.125 0.00178 **
female -0.02274 0.13226 -0.172 0.86348
educyrs 0.04007 0.02244 1.786 0.07411 .
shdensity:married -1.01564 0.44908 -2.262 0.02372 *
> exp(coef(model5))
shdensity married female educyrs shdensity:married
2.93154622 5.02223440 0.97751515 1.04088731 0.36217219
> # Effect of density for married individuals
2.93154622*0.36217219
[1] 1.061725
MULTILEVEL MODELS
TWO MAJOR PARTS OF ANY MODEL
Part I: Model for the means
TWO MAJOR PARTS OF ANY MODEL
Part II: Model for the variances
WHEN AND WHY TO USE AN MLM FOR EGO NETWORK RESEARCH
MLM FOR SOCIAL NETWORKS
When to use MLM for ego SNA: Formal requirements
MLM FOR SOCIAL NETWORKS
Why to use MLM for ego SNA
MLM RESEARCH QUESTIONS
DEPENDENCY
Alter obs nested in same ego are not independent
DEPENDENCY
RANDOM INTERCEPT MODEL
We are just making piles of variance, not reducing overall variance
OLS
Random intercept MLM
RANDOM INTERCEPT MODEL
zeta
epsilon
INTRACLASS CORRELATION
rho
psi
theta
INTRACLASS CORRELATION
INTRACLASS CORRELATION
RANDOM INTERCEPT MODEL
WHAT RELATIONSHIP FACTORS AFFECT LIBIDO?
WHAT RELATIONSHIP FACTORS AFFECT LIBIDO?
Variation within
Variation within
Variation between
COMMUNICATION AND LIBIDO
Both Jane and Joe get their own random intercept
Jane’s regression line
Joe’s regression line
y = # sexual contacts
x = quality of communication
3
2
1
0
COMMUNICATION AND LIBIDO
Ego’s get their own random intercept based on their alter/tie observations
Every ego gets their own regression line
y = # sexual contacts
x = quality of communication
3
2
1
0
PARALLEL PLAY
Random intercept
model in R
RUNNING MLM IN R
Suppose we want to look at the effects of ego and alter gender on the number of support functions provided by an alter to an ego
EMPTY RI MODEL
SD of the residuals
SD of the random intercepts
Distribution of residuals (standardized)
y intercept
EMPTY RI MODEL
We usually prefer to report the variance rather than the SD of random components, and we need the variance to calculate ICC
EMPTY RI MODEL
RI MODEL WITH PREDICTORS
CLUSTER CONFOUNDING AND CONTEXTUAL EFFECTS
CLUSTER CONFOUNDING: A MAJOR THREAT TO RE MODELS
CONTEXTUAL EFFECTS
PARALLEL PLAY
Contextual effects
in R
CONTEXTUAL EFFECTS IN R
CONTEXTUAL EFFECTS IN R
THE RANDOM
COEFFICIENT MODEL
RANDOM COEFFICIENT MODEL
COMMUNICATION AND LIBIDO
Jane’s regression line
Joe’s regression line
y = # sexual contacts
x = quality of communication
3
2
1
0
RANDOM COEFFICIENT MODEL
The random coefficient linear regression model:
COMMUNICATION AND LIBIDO
Jane’s regression line
Joe’s regression line
y = # sexual contacts
x = quality of communication
2
1
0
COMMUNICATION AND LIBIDO
Egos get their own random intercept and slope based on their alters
Every ego gets their own regression line
y = # sexual contacts
x = quality of communication
3
2
1
0
RANDOM COEFFICIENT MODEL
We are still just making piles of variance, not reducing overall variance
OLS
Random intercept MLM
Random coefficient MLM
PARALLEL PLAY
Random coefficient
Model in R
RC MODEL IN R
RC MODEL WITH PREDICTORS
I perform a nested Likelihood Ratio test using stored estimates to determine whether the random slopes are significantly different from zero…
If p-value is less than .05, I reject the null hypothesis that the random coefficients are equal to zero and use random coefficient model
RC MODEL
SD of the residuals
SD of the random intercepts
SD of the random slopes
RC MODEL
Correlation between random slopes and random intercepts
Correlation of .13 between random slopes and intercepts suggests that in ego networks that provide more support functions, on average (intercept), the effect of alter gender (slope) is larger compared to networks that support less.
RC MODEL
Intraclass correlation
CROSS-LEVEL
INTERACTIONS
CROSS-LEVEL INTERACTIONS ARE COOL!
PARALLEL PLAY
Cross-level interactions
Model in R
CROSS-LEVEL INTERACTIONS IN R
CROSS-LEVEL INTERACTIONS IN R
Change in effect of alter gender when ego gender=1
Effect of alter gender when ego gender= 0
Change in effect of network gender comp. when ego gender=1
Effect of network gender comp. when ego gender= 0
CROSS-LEVEL INTERACTIONS IN R
not significant
EGO NETWORK
DYNAMICS
WHAT WE KNOW ABOUT SOCIAL NETWORK DYNAMICS
WHAT WE KNOW ABOUT SOCIAL NETWORK DYNAMICS
Networks are comprised of two basic components:
WHAT WE KNOW ABOUT SOCIAL NETWORK DYNAMICS
Periphery is a problem for cross-sectional network studies
HOW TO MEASURE NETWORK CHANGE
Problem 1: Real change or methodological artifact?
HOW TO MEASURE NETWORK CHANGE
Problem 2: Determining what alter-level changes underlie network-level change
Suppose the mean freq of contact with network members decreases from W1 to W2. This can be due to…
HOW TO MEASURE NETWORK CHANGE
Solution: Real change or methodological artifact?
HOW TO MEASURE NETWORK CHANGE
MEASURES OF NETWORK CHANGE
Measures that capture network turnover
MEASURES OF NETWORK CHANGE
N dropped or added
N unique alters pooled
Network turnover, Perry & Pescosolido (2012)
HOW TO ANALYZE NETWORK CHANGE
If goal is to describe change:
HOW TO ANALYZE NETWORK CHANGE
If goal is to describe change:
HOW TO ANALYZE NETWORK CHANGE