1 | |||||
|---|---|---|---|---|---|
2 | CSED/DATA 516 - Final Project Presentations | ||||
3 | December 5th 2017 | ||||
4 | |||||
5 | |||||
6 | Presentation # | Time | Title | Presenter | Link to slides |
7 | |||||
8 | 1 | 5:00 | Data Ingestion in Amazon Redshift & Runtime between GMM and K-Mean in Spark with various Partitions | Zicong Liang | |
9 | 2 | 5:10 | Parallel Processing Performance in Amazon Redshift/EMR and Apache Spark | Rex Thompson | |
10 | 3 | 5:20 | Analysis of Amazon Redshift Parallel Ingestion / Analysis of GMM Training Times Using Spark | Dane Jordan | |
11 | 4 | 5:30 | Analysis of Query Run-Times on Distributed Platforms:Redshift, Hive and Spark | Samir Patel | |
12 | 5 | 5:40 | Diana Chenyu Zhang | ||
13 | 6 | 5:50 | Data Ingestion on Amazon Redshift & Compare python Spark | Runlai Zeng | |
14 | 7 | 6:00 | SQLWorkbenchJ vs Pycharm | Jahnavi Jasti | |
15 | BREAK | ||||
16 | 8 | 6:20 | Exploration of Query Optimization in Redshift and SparkSQL | Deepa Agrawal | |
17 | 9 | 6:30 | Factors affect Data Ingestion and Query Execution on Amazon Redshift ; Apache Spark: RDD vs Dataframe | Angel Wang | |
18 | 10 | 6:40 | Comparing Performance of Data Ingestion with Redshift and Cluster Model Training with Spark | Erin Orbits | |
19 | 11 | 6:50 | Experiment with Redshift, scikit-learn and MLlib | Jingyan Yang | |
20 | 12 | 7:00 | Redshift Data Ingestion & Spark Data Processing | Haowen Ni | |
21 | 13 | 7:10 | Analysis of Parallel Data Ingestion on Amazon Redshift & Building A Recommendation System Using Elastic MapReduce and Spark | Mobing Zhuang | |
22 | 14 | 7:20 | Yawen Li | TBD | |
23 | BREAK | ||||
24 | 15 | 7:40 | Impact of Dist/Sort Keys in Redshit, GMM vs KMeans vs Bisecting KMeans in Spark | Rajiv Veeraraghavan | |
25 | 16 | 7:50 | Performance of different types of queries on Redshift & Comparison of GMM performance on MLlib and Scikit-learn | Becky Wang | |
26 | 17 | 8:00 | Scalable methods for recommending relevant products to users | Niharika Sharma | |
27 | 18 | 8:10 | Timing Tests with AWS | Gary Gregg | |
28 | 19 | 8:20 | Scaling Redshift clusters in production+Video Game Vizualization is Spark | Abhishek Varma | |
29 | 20 | 8:30 | Redshift, Spark and the Hive | Michael Grant | |
30 | 21 | 8:40 | Pyspark vs Python | Khyati Parekh | |
31 | END |