Measurement of Partisan Segregation of 180 million U.S. voters using advanced GIS Data Science
Project Overview
Objective: Measure partisan segregation at individual level for 180 Million U.S. voters
This work is partially sponsored by NSF Awards #1841403 and OmniSci
Challenges: Big geospatial data processing
| Traditional method | Challenges |
K-Nearest Neighbor search
|
|
|
Partisan exposure calculations |
|
|
[Images: Nature Human Behaviour]
Solution: Available Computing Resources
+
Solution: Two-Layered Approach for KNN Calculations
1. Geohash based Spatial Clustering
2. R-tree based Index Search
Solution: Accelerated GPU based processing of partisan exposure
180 Billion relations
180 million�voters
1000 neighbors
Solution: Novelty of our approach
KNN calculations @ 200,000 distances/sec
Partisan weights @ 800M distances/sec
Accelerated GPU processing
Big geospatial data using Data Science
Extremely fast I/O on big data
Cost and Time Effective
Results: Partisan exposure of individual US voters
[Images: Nature Human Behaviour]
Results: Publications and news coverage
References
[1] Brown J. & Enos R., The measurement of partisan sorting for 180 million voters, Nature Human Behavior, 2021 https://www.nature.com/articles/s41562-021-01066-z.epdf
[2] Badger B., Quealy K. & Katz. J, A Close-Up Picture of Partisan Segregation, Among 180 Million Voters, The New York Times, 2021 https://www.nytimes.com/interactive/2021/03/17/upshot/partisan-segregation-maps.html
[3] Kakkar D., Lewis B., Singh R., OmniSci Virtual Summit, 2020 https://www.youtube.com/watch?v=3DlOeWqDMSs
[4] Kakkar D., Lewis B., Scaling geospatial processes on Harvard’s high-performance cluster, Harvard DataFest, 2020 https://drive.google.com/file/d/1FEnh-okCNLuthtyQtoBldyid7D6Sb-F_/view?usp=sharing
[5] Introduction to Cluster Computing on FASRC: https://www.rc.fas.harvard.edu/wp-content/uploads/2019/12/Intro-to-Cannon.pdf
[6] About Postgis: https://postgis.net/
[7] OmniSci Overview: https://docs.omnisci.com/latest/4_distributed.html