Chapter 11 �Link Prediction
Summary
Reading
Link Prediction
Goal
Understanding how networks evolve
Problem definition
Given a snapshot of a network at time t, (accurately) predict the edges that will appear in the network during the interval (t, t+1)
Liben‐Nowell, David, and Jon Kleinberg. "The link‐prediction problem for social networks."
Journal of the American society for information science and technology 58.7 (2007): 1019-1031.
Examples of uses of
Link Prediction
Monitor terrorist networks – deducing possible interactions/missing links between terrorists (without direct evidence)
Suggest interactions or collaborations that haven’t yet been exploited within an organization
Friendship prediction (i.e., Facebook)
Link Prediction
Link prediction is used to predict future possible links in the network (e.g., Facebook).
Or, it can be used to predict missing links due to incomplete data (e.g., Food-webs)
RESEARCH ARTICLE
Link Prediction in Criminal Networks: A Tool for Criminal Intelligence Analysis
Giulia Berlusconi1, Francesco Calderoni1*, Nicola Parolini2, Marco Verani2,�Carlo Piccardi3*
1 Università Cattolica del Sacro Cuore and Transcrime, Milano, Italy, 2 MOX, Department of Mathematics, Politecnico di Milano, Milano, Italy, 3 Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milano, Italy
* francesco.calderoni@unicatt.it (FC); carlo.piccardi@polimi.it (CP)
False Positive prediction issue�(i.e., we are likely to predict edges that will never appear)
Link Prediction
Task Complexity
Scientists who are close in the network �(i.e., have common colleagues)
→ will likely collaborate in the future
Concretizing an Intuition…
Goal:
score is a measure of proximity
Link Prediction
Workflow
Link Prediction
Unsupervised
Supervised
Define a set of proximity measures unrelated to the particular network
Extract knowledge from the network in order to improve prediction accuracy
Approaches
Unsupervised Link Prediction
Unsupervised Link Prediction
Unsupervised measurements rely on different structural properties of networks
Neighborhood measures
Path-based measures
Ranking
Liben‐Nowell, David, and Jon Kleinberg. "The link‐prediction problem for social networks." Journal of the American society for information science and technology 58.7 (2007): 1019-1031.
How many friends we have to share in order to become friends?
Common Neighbors:�the more friends we share, �the more likely we will become friends
Jaccard:�the more similar our friends circles are, �the more likely we will become friends
Unsupervised Link Prediction
Neighborhood measures
How many friends we have to share in order to become friends?
Adamic Adar:�the more selective our mutual friends are, �the more likely we will become friends
Preferential Attachment:�the more friends we have, �the more likely we will become friends
Unsupervised Link Prediction
Neighborhood measures
How distant are we?
Graph Distance:�(negated) length of the shortest path between two nodes
Katz:�weighted sum over all the paths between two nodes
Unsupervised Link Prediction
Path-based measures
How similar are we?
SimRank:�two nodes are similar to the extent that their neighborhoods are similar
Unsupervised Link Prediction
Ranking
Unsupervised Link Prediction
Limits
Measure comparison
Supervised Link Prediction
Supervised Link Prediction
The process is now organized in 4 steps:
A natural way to do it: �build a “classifier” over a set of network features.
Learn a Classifier (i.e., a Decision Tree) over unsupervised LP scores to generalize the assumption they made on the network growth model
Staking Unsupervised Scores
Supervised Link Prediction
Frequent Pattern
Mining
GERM
Evolution rules can be extracted from the network history and used to identify/predict recurrent patterns
Berlingerio, Michele, et al. "Mining graph evolution rules." joint European conference on machine learning and knowledge discovery in databases (2009).
Supervised Link Prediction
Network Embedding
Idea
Graphs can be mapped into vector spaces
NB: Different “mappings” facilitate the solution of different classes of problems
Supervised Link Prediction
Limits
Embedding is not The Answer, �only a different way to reason on graphs...
Results: Higher performances w.r.t. unsupervised approaches
Given a predictor p is there a way to decide if it is a “good” one?
First Step:
verify that p outperforms the random predictor. �
Random Predictor
each edge has the same probability to
appear in the network
if ratio > 1 then p is meaningful
Evaluation
Evaluation: Comparing Predictors
We need to analyze either the performances ratio, ROC and/or Precision Recall curve.
Precision Vs. Recall
ROC (Receiver operating characteristic)
Note:
Evaluation: ROC and PR curve
Accuracy could be improved extending simple models with more complex (even semantic) informations:
Link Prediction needs to be revised while in some scenarios:
Link Prediction
Something more...
Key Messages
Predict new links that will arise in a network is not easy:
Case Study:
Interaction Prediction in Dynamic Networks �
Case Study�Interaction Prediction
Link Prediction goal:
Predict ties that are not present in actual network configuration.
Ties are persistent structures that once appeared cannot disappear (i.e., friendship...)
Interaction Prediction goal:
Predict interactions that will occur (either for the first time or not) among nodes already observed in the network.
Interactions are volatile structures that can occur multiple times and whose value can vanish as time goes by (i.e., telephone calls...)
Rossetti et al. "Interaction prediction in dynamic networks exploiting community discovery." IEEE/ACM ASONAM, 2015.
Case Study�Interaction Prediction
Given a set G = {G0 , . . . Gt , . . . Gτ } of ordered network observations, with t ∈ T = {0...τ}, the interaction prediction problem aims to predict new interactions that will took place at time τ + 1 thus composing Gτ+1.
Idea:
Case Study�Interaction Prediction
Step 1: �For each temporal snapshot t ∈ T compute a partition �Ct = {Ct,0, . . . , Ct,k} of Gt using a community discovery algorithm.
Step 2: �For each t ∈ T compute a set of measures F for each nodes pair (u,v) belonging to at least a community in Ct
Step 3: �For each node pair (u, v) and feature f ∈ F build a time series Su,v and apply a forecasting techniques in order to obtain its future expected value fu,v
Step 4: �Use the set of expected values fu,v to predict future intra-community interactions.
Case Study�Interaction Prediction
Step 1: Community Discovery (CD)
Each CD algorithm proposes its own Community Definition.
Case Study�Interaction Prediction
Step 2: Features
On the identified communities we compute three set of features:
Case Study�Interaction Prediction
Step 3: Time Series Forecast
For each time series we apply several forecasting model in order to extract the expected future value.
Case Study�Interaction Prediction
Step 4: Classification
Once learned the features we design two different experiments:
Case Study�Interaction Prediction: Balanced Scenario
Very high accuracy and AUC
CD approaches contribution to IP is topology sensitive
Case Study�Interaction Prediction: Balanced Scenario (cont’d)
Which feature set is the most predictive?
False Positive Filtering (FSF) �vs.�No Filtering (SF)
All Forecast with Filtering �vs. �No Filtering
Case Study�Interaction Prediction: Unbalanced Scenario
Negative class:
Very hard baselines
Case Study�Interaction Prediction: What about weak links?
High accuracy is guaranteed by focusing the prediction on intra-community interactions.
Inter-Community Interaction Prediction
Focus on the predicting the presence/absence of at least a new interaction across two communities
Idea
Case Study�Interaction Prediction
Even though Interaction prediction is a complex problem it is possible to reach high accuracy through:
Moreover, each type of datasets demands a specific CD algorithm:
Chapter 10 �Conclusion
Take Away Messages
What’s Next
Chapter 12:�Dynamic Community Discovery��
Suggested Readings