Speaker Introduction
Rajneesh Tiwari
Co-founder : Bulian AI, CueNex
CueNex
Bulian AI
Low-code, API-first Synthetic Data
Decisioning made easy!
Mars Spectrometry: Gas Chromatography�
8th place solution – Rajneesh Tiwari
Did Mars ever have environmental conditions that could have supported life?
Data Generation: �Gas Chromatography Mass-Spectrometry
Data Sample
Example of a mass spectrum. This mass spectrum shows a large peak at m/z=147.0 and smaller peaks at 73.0, 233.0 and 40.0. Plotted data is for sample S0002 taken at time=9.9513.
Example visualization for sample S0002 showing intensities over time for all ions by mass, as a chromatogram. Each m/z is plotted as a separate time series, with m/z values of 147, 73, 233, 40, and 44 highlighted. In contrast to the previous mass spectrum showing a time snapshot, we can see that these different masses peak at different times in the analysis run
Construct Mean/Max Intensity Spectrogram (scale: Time X Mass)
Feature Creation: Spectrogram like images and tabular features
Clean up
Mean/max intensity by time & rounded mass
Aggregate
Minimum abundance subtracted for all observations
Remove b/g intensity
Min-max scale intensities (0-1)
Min-max Scale
Divide time in 0.25/0.5 sec buckets
Construct time-buckets
Spectrograms
Tabular Features
Model Pipeline – Vision Models
Spectrograms
K-folds
convnext_base_in22ft1k
convnext_tiny in22ft1k
maxvit_tiny_rw_224
CAIT-s24-224
coatnet_1_rw_224
coatnet_0_rw_224
vit_small_r26_s32_224
vit_small_patch32_224
volo_d1_224
nf_regnet_b0
densenet121
eca_nfnet_l0
eca_ecaresnet50t
Nf_regnet_b1
resnet50d
seresnext50d
2D Backbones
LSTM
1D LSTM
Concat
Multi-label predictions
2D Backbones
Model Pipeline: Tabular Models
Tabular Features
K-folds
Catboost
XGBOOST
LogReg
Random Forest
LightGBM
Lasso
Multi-label predictions
Multi-label tabular models
Greedy Hill-Climbing Ensemble Weight Optimization
Catboost
XGBOOST
LogReg
Random Forest
LightGBM
Lasso
convnext_base_in22ft1k
convnext_tiny in22ft1k
maxvit_tiny_rw_224
CAIT-s24-224
coatnet_1_rw_224
coatnet_0_rw_224
vit_small_r26_s32_224
vit_small_patch32_224
volo_d1_224
nf_regnet_b0
densenet121
eca_nfnet_l0
eca_ecaresnet50t
Nf_regnet_b1
resnet50d
seresnext50d
Greedy Hill Climbing via Optuna
Optimized Weights
(OOF)
Ensemble Predictions (Preds)
8th place (Pvt LB)
2D Backbone + LSTM
Tabular Models
Thanks ☺