SpotWeb: Running Latency-sensitive Distributed Web Services on Transient Cloud Servers
Ahmed Ali-Eldin, Jonathan Westin, Bin Wang, Prateek Sharma*, Prashant Shenoy UMass Amherst, *Indiana University
Introduction
Data Center
Request Server
Provide Server
(fixed rate)
On-demand Server
Introduction
Data Center
Request Server
Provide Server
(fixed rate)
On-demand Server
Introduction
Data Center
Request Server
Provide Server
(fixed rate)
Transient Server
On-demand Server
Transient Server
Latency-sensitive Applications
Spotweb’s Objective
Spotweb’s Approach
SpotWeb System Architecture
SpotWeb System Architecture
MPO Optimizer
SpotWeb System Architecture
MPO Optimizer
SpotWeb Load-Balancer
SpotWeb Load-Balancer
Transiency-aware Load-balancer
SpotWeb Load-Balancer
SpotWeb System Architecture
MPO Optimizer
SpotWeb System Architecture
SpotWeb System Architecture
SpotWeb System Architecture
MPO Optimizer
SpotWeb System Architecture
MPO Optimizer
SpotWeb System Architecture
MPO Optimizer
SpotWeb System Architecture
MPO Optimizer
SpotWeb System Architecture
MPO Optimizer
Server Portfolio?
Portfolio A
stock A
stock B
The Key Insight
Choosing an uncorrelated mix of assets can insulate the overall portfolio
if stock A experiences large declines,
it cannot affect stock B
Server Portfolio?
Portfolio
A x 2
Server A)
10 req/s
2 c/hour
Server B)
100 req/s
15 c/hour
1. → 25 req/s
Portfolio
A x 3
6 c/hour
Server Portfolio?
Portfolio
A x 2
Server A)
10 req/s
2 c/hour
Server B)
100 req/s
15 c/hour
2. → 115 req/s
Portfolio
A x 2
B x 1
19 c/hour
Optimization Method - Single Point portfolio Optimization (SPO)
Optimization Method - Single Point portfolio Optimization (SPO)
Portfolio A
High Expected Returns
High Risk
Cost : 50$
Portfolio B
Low Expected Returns
Low Risk
Cost : 50$
Optimization Method - Single Point portfolio Optimization (SPO)
Portfolio A
High Expected Returns
High Risk
Cost : 50$
Portfolio B
Low Expected Returns
Low Risk
Cost : 50$
Optimization Method - Multi-period Portfolio Optimization (MPO)
Optimization Method - Multi-period Portfolio Optimization (MPO)
t
t
H
Optimization Method - Multi-period Portfolio Optimization (MPO)
Optimization Method - in the SpotWeb
Optimization Method - in the SpotWeb
= (fraction of predicted workload) x (cost of service per request)
= (predicted cost of service per ‘total request’)
Optimization Method - in the SpotWeb
= Risk cost for penalty in case of interrupt
Optimization Method - in the SpotWeb
= Risk cost of server-to-server impact in portfolio
Experimental Evaluation
Transiency-aware load balancing Efficacy
Transiency-aware load balancing Efficacy
Transiency-aware load balancing Efficacy
Intelligent over-provisioning
Intelligent over-provisioning
Benefits of Price-awareness
Benefits of Price-awareness
Exploiting Future Workload Knowledge
Impact of Prediction Accuracy
Scalability of SpotWeb
Discussion
Conclusion
Q&A