Redivis: A Scalable Web Platform for Business Research
Alex Storer�Ian Mathews �Erin Delaney
Stanford GSB – Director of Data, Analytics & Research Computing�Redivis – CEO �Redivis – Head of Design
PEARC 2024
Doesn’t Business School Just Do MBAs?
Research is a key facet of Stanford’s Graduate School of Business
Academic Departments
PEARC 2024
What distinguishes Business Data?
Licensed Data
Diverse Data
Private Data
PEARC 2024
What distinguishes Business Data?
Licensed Data
Diverse Data
Private Data
“To examine rent control’s effects on tenant migration and neighborhood choices, we make use of new panel data which provide address-level migration decisions and housing characteristics for the majority of adults living in San Francisco in the early 1990s.”
Diamond, Rebecca, Tim McQuade, and Franklin Qian. 2019. "The Effects of Rent Control Expansion on Tenants, Landlords, and Inequality: Evidence from San Francisco." American Economic Review, 109 (9): 3365–94.
DOI: 10.1257/aer.20181289
PEARC 2024
A recurring problem
“To examine rent control’s effects on tenant migration and neighborhood choices, we make use of new panel data which provide address-level migration decisions and housing characteristics for the majority of adults living in San Francisco in the early 1990s.”
PEARC 2024
What about HPC?
My experience: This is hard!
PEARC 2024
What about the Cloud?
My experience: This is hard!
PEARC 2024
What would we like?
PEARC 2024
What would we like?
These are key features of Redivis!
PEARC 2024
Introducing Redivis
PEARC 2024
Redivis by Example – Data Discovery
PEARC 2024
Redivis by Example – Dataset Access
URL traffic for a representative panel of users, dozens of TB of data
PEARC 2024
Redivis by Example – Table Exploration
Easily scroll through a single 2.5TB table
- Quick Summary Statistics
- SQL Queries on this table
- Integrated Data Dictionary
– 11.5 billion web site visits from the year 2020
PEARC 2024
Redivis by Example – Data Analysis
Find the visits to CDC.gov in 2020 and investigate the relationship to Household Income
- Beginner Friendly Interface
- Compiles to Standard SQL
- 27 seconds to execute!
PEARC 2024
Redivis by Example – SQL Queries
Write SQL Queries Directly
- Within a project
- Using the API
PEARC 2024
Redivis by Example – Notebook Interface
Notebook Interface
- Pick your VM size
- Easy integration with Queries
- Same network controls as source data
- Project members can collaborate
- Code history is saved automatically
PEARC 2024
Redivis by Example – Project Interface
Shareable projects show:
- included data and tables
- transforms and notebooks in a directed graph
PEARC 2024
Redivis by Example – Administrator View
Easy to read view of who did what on Redivis
PEARC 2024
Conclusion
8 years of Iterative Development → 10 minute Presentation 😤
Stanford GSB Data, Analytics, and Research Computing: https://gsbresearchhub.stanford.edu/
Key Redivis Features – More at redivis.com
PEARC 2024