NSDC Project Fall 2023: Galaxy Classification
Gianna Pedroza, Aidan Nguyen, Tishi Avvaru, Benjamin Yu
The Dataset: Galaxy Zoo
In the original Galaxy Zoo project, volunteers classified images of Sloan Digital Sky Survey galaxies as belonging to one of six categories - elliptical, clockwise spiral, anticlockwise spiral, edge-on , star/don't know, or merger.
The Dataset: Galaxy Zoo Table
The table gives classifications of galaxies with the fraction of the vote in each of the six categories is given flags identifying systems as classified as spiral, elliptical or uncertain.
The Galaxy Zoo project collected simple classifications of nearly 900,000 galaxies drawn from the Sloan Digital Sky Survey with classifications given by hundreds of thousands of volunteers.
Goals:
Cleaning the Data:
Visualization:
Model 1: K-Nearest Neighbors
Model 2: Logistic Regression
Model 3: Naive Bayes
Model 4: Neural Network
Comparisons:
| Model 1 | Model 2 | Model 3 | Model 4 |
Accuracy | 87% | 86% | 79% | 90.35% |
Precision | 80% 68% 95% | 81% 59% 96% | 79% 46% 93% | |