VIVA: An End-to-End System for Interactive Video Analytics
Francisco Romero, Johann Hauswald, Aditi Partap,
Daniel Kang, Matei Zaharia, Christos Kozyrakis
1
Growing demand for video analytics
2
“How many cars passed by on Monday?”
“Find clips of Jake Tapper interviewing an angry Bernie Sanders”
“Show me Ronaldo’s headers when he played for Juventus”
Example – analyzing TV news
3
“Find clips of Jake Tapper interviewing an angry Bernie Sanders”
Not obvious which is best!
Train specialized model to detect Tapper and Sanders
Detect frames with Tapper and Sanders
Detect “angry Sanders” frames
Detect angry faces in frames
Detect Sanders from angry face frames
Detect Tapper in frames with “angry Sanders”
Query result
…
Existing work focuses on specific components
4
System | Component | Query |
NoScope, Focus, … | Query optimizer/execution | Selection |
MIRIS | Query execution | Tracking |
BlazeIt | Query optimizer | Aggregation, limit |
TASTI | Index | Proxy-based |
Users today must manually combine these techniques/systems!
Video query challenges
Goal: enable videos to be searched like structured data
Challenges:
Expensive hardware: A100 is $4.10/hr
Slow: as low as 3 frames per second
5
Spark SQL
VIVA
Under active development!
6
VIVA
Heterogeneous Serverless Backends
SELECT time_window FROM news_analysis
WHERE tapper_angry_sanders = TRUE
Open research directions
7
VIVA
Query frontend
Relational hint explorer
Mixed-data optimizer
Accelerator-based execution engine
Structured table
Embedding cache
Serverless backends
Video file manager
Model
(re)-training service
Stop by my poster to learn more!