Expert Users:
A Hybrid Approach to Clickstream Analytics
Elizabeth Haubert
OpenSource Connections
April 10, 2018
Outline
Test what?
UI
API
Data
RESULTS
QUERIES
Implicit Feedback
Query Features | Session Features | User Features |
|
|
|
|
|
|
|
|
|
|
| |
|
| |
Laboratory Benchmarks
The Cranfield (TREC) Model
The Philosophy of Information Retrieval Evaluation (2002) Ellen Voorhees.
https://www.nist.gov/publications/philosophy-information-retrieval-evaluation
Evaluation Metrics
Without
With Judgements
Laboratory to Practice
How Many?
1,000,000 docs / 50 Topics
= 20,000 docs / Topic
Rank: top 100 / 20,000
0.5% of docs in topic per topic
Queries Instead of Topics
| Query | Doc | Judgement Scale | Year |
LETOR - 3 (TREC - GOV) | 575 | 568 k | 2 | 2008 |
LETOR - 3 (TREC-OHSUMED) | 106 | 16 k | 3 | 2008 |
LETOR 4 | 2476 | 85 k | 3 | 2009 |
Yahoo! | 36,251 | 883 k | 5 | 2010 |
Microsoft | 31,531 | 3,771 k | 5 | 2010 |
LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval (2007)
https://pdfs.semanticscholar.org/dbcd/79bd7edcdcbb5912a50796fc3c2746729eb5.pdf
Laboratory to Practice
Laboratory to Practice
A Note on Sampling
Laboratory to Practice
Trec Interactive Track History
Human Judgements
When you must:
Collecting Judgements By Survey
Were these results helpful?
Please rate this document:
Inferring Judgements
Query Chaining
Doc1: { “Title”: “Caring for cats”
“Body” : “Feed cats. Take videos”.}
Doc2: { “Title”: “Why CAT videos are funny”
“Body”: “Because they are goofy”}
Doc3: { “Title”: “Mouser videos”
“Body”: “A mouser caught a rat.” }
Doc4: { “Title”: “Puss in boots”
“Body”: “Story about a cat.” }
Click 1
Click 2
Query Chaining
Doc1: { “Title”: “Caring for cats”
“Body” : “Feed cats. Take videos”.}
Doc2: { “Title”: “Why CAT videos are funny”
“Body”: “Because they are goofy”}
Doc3: { “Title”: “Mouser videos”
“Body”: “A mouser caught a rat.” }
Doc4: { “Title”: “Puss in boots”
“Body”: “Story about a cat.” }
Click 1
Click 2
-2
0
-1
+1
Score
Implicit Feedback
Query Features | Session Features | User Features |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Sanity Check
Sanity Check
Sanity Check
Sanity Check
Sanity Check
Sanity Check
Sparsity
Summary
<query, document, judgement> tuples