X International conference�“Information Technology and Implementation” (IT&I-2023)�Kyiv, Ukraine
Vadym Pakholchuk
Ph. D. in Finance
Tetiana Zatonatska
doctor of economic sciences, professor
Alim Syzov
Candidate of economic science, associate professor
Daryna Vorontsova
student
1
Dedicated to the tenth anniversary of the Faculty of Information Technology
IMPLEMENTATION OF DATA SCIENCE METHODS IN ARMED FORCES BUDGETING: CHALLENGES AND OPPORTUNITIES�
Information Technology and Implementation, November 20, 2023, Taras Shevchenko National University of Kyiv, Kyiv, Ukraine
Data science in modern budgeting
Domain
Computer Science
Math
Public Finance
Defense Finance
What areas of Data Science should be mentioned?
Clustering
Sentiment
Forecasting
Classification
NLP
Data collection and analysis
Cascade modeling
Base model assumption
SMA
Times Series Models
ML Models
Simple Rolling mean forecast
Exponential smoothing
Double exponential smoothing
Triple exponential smoothing
Classic TS SARIMA models
ARIMA (p, d, q) x (P, D, Q) S =
ARIMA (0, 0, 0) x (3, 1, 2)12
Meta Prophet as a SOTA model in time series forecasting
This model includes g(t) - a non-linear trend model that allows for considering trends in the context of forecast horizons, s(t) - seasonality, and h(t) - holidays and important events, as well as an error term e
Good old linear regression
Boosting is an old new fashion
Explainability always matter
SHAP-values for linear regression
SHAP-values for boosting
Building the best mix of models
Metrics | Holt-Winter | SARIMA | Prophet | Regression | XGBoost |
MAE | 606972 | 1682813 | 812210 | 595491 | 567103 |
MAPE | 8.42% | 24.50% | 12.49% | 7.81% | 7.16% |
The Problem with High-Scale Forecasting
Source: https://cloud.google.com/bigquery/docs/reference/standard-sql/bigqueryml-syntax-create-time-series
Final thoughts and further �research ideas