Module 6
Big Data Analytics Applications
Contents
Recommendation Systems: Introduction
Few common applications of recommendation systems
Product Recommendations | Movie Recommendations | News Articles |
|
|
|
Utility Matrix
Utility Matrix (Contd..)
Utility Matrix (Contd..)
Approach 1 | Approach 2 |
|
|
Long tail
Long tail (Contd..)
Content-Based systems*
*Not explicitly mentioned in syllabus, however there is no harm to study something beyond syllabus!
Content-Based systems
Content-Based systems
E.g.: Discovering Features of Documents
E.g.: Discovering Features of Documents
Collaborative-Filtering System�
Ways of measuring similarity
Alternative measures to consider�
Alternative measures to consider
Alternative measures to consider
Alternative measures to consider
Alternative measures to consider
Inferences that can be drawn:
Clustering Users and Items
Clustering Users and Items
Exercise: Figure is a utility matrix, representing the ratings, on a 1–5 star scale, of eight items, a through h, by three users A, B, and C. Compute the following from the data of this matrix
Q.1. Treat ratings of 3, 4, and 5 as 1 and 1, 2, and blank as 0. Compute the
Jaccard and cosine distance between each pair of users
| a | b | c | d | e | f | g | h |
A | 1 | 1 | | 1 | | | 1 | |
B | | 1 | 1 | 1 | | | | |
C | | | | 1 | | 1 | 1 | 1 |
Sr. No. | | Ans |
1 | Jaccard Similarity between A&B | 0.4 |
2 | Jaccard Distance between A&B | 0.6 |
3 | Jaccard Similarity between B&C | 0.1667 |
4 | Jaccard Distance between B&C | 0.8333 |
5 | Jaccard Similarity between A&C | 0.4 |
6 | Jaccard Distance between A&C | 0.6 |
7 | Cosine Distance between A&B | 0.5773 |
8 | Cosine Distance between B&C | 0.2886 |
9 | Cosine Distance between A&C | 0.5 |
Exercise: Figure is a utility matrix, representing the ratings, on a 1–5 star scale, of eight items, a through h, by three users A, B, and C. Compute the following from the data of this matrix
Q.2. Normalize the matrix by subtracting from each nonblank entry the average value for its user and compute the cosine distance between each pair of users
| a | b | c | d | e | f | g | h |
A | 0.67 | 1.67 | | 1.67 | -2.33 | | -0.33 | -1.33 |
B | | 0.67 | 1.67 | 0.67 | -1.33 | -0.33 | -1.33 | |
C | -1 | | -2 | 0 | | 1 | 2 | 0 |
Sr. No. | | Ans |
1 | Cosine Distance between A&B | 0.5841 |
2 | Cosine Distance between B&C | -0.739 |
3 | Cosine Distance between A&C | -0.1151 |
Mining Social-Network Graphs
Introduction
Social Networks as Graphs
Social Networks as Graphs
Types of Social-Network
Telephone Networks | Email Networks | Collaboration Networks |
|
|
|
Clustering of Social Graphs�
Clustering of Social Graphs using ‘Betweenness’�
Examples done as numerical*
Clustering of Social Graphs using Girvan-Newman Algorithm
Examples done as numerical*