SIEVA
SIEM Visibility assessment
Nil Ortiz - Senior R&D Cybersecurity Engineer
Albert Calvo - Research engineer | PhD Candidate
TNC23 — SIG-CISS meeting
5/06/2023
Who we are
Research engineer | PhD Candidate
Trust-aware systems for cybersecurity and utilities domain
Msc. in Artificial Intelligence
albert.calvo@i2cat.net LinkedIn/in/albertcalvo/
Senior R&D Cybersecurity engineer
Incident response and threat intelligence analysis
Msc. in Cybersecurity
nil.ortiz@i2cat.net LinkedIn/in/nilortiz/
2
Nil Ortiz
Albert Calvo
i2CAT in �a nutshell
Never stop designing the digital future
3
i2CAT is a research center focused on mission driven projects to tackle the challenge of designing the digital society of the future based on research and innovation in advanced digital technologies.
Who we are
i2CAT in �a nutshell
Never stop designing the digital future
Research and Innovation areas
SMART NETWORKS AND SERVICES�6G, 5G, IoT
IMMERSIVE & �INTERACTIVE �TECHNOLOGIES
CYBERSECURITY �& BLOCKCHAIN
DISTRIBUTED ARTIFICIAL INTELLIGENCE
ARTIFICIAL INTELLIGENCE
DRIVEN SYSTEMS
SPACE COMMUNICATIONS
DIGITAL SOCIAL�TECHNOLOGIES
eHEALTH
CCAM
INDUSTRY 4.0
SMART CITIES
AGRICULTURE
CONNECTED, COOPERATIVE AND AUTOMATED MOBILITY
PUBLIC STRATEGIES �& POLICIES
i2CAT in �a nutshell
Never stop designing the digital future
Some of our initiatives
5
DetectUEBA (Threat-Centric ML for Detection capabilities)
openUEBA (User-Centric ML for prevention capabilities)
SIEVA
Limited disclosure, restricted to participants’ organizations.
i2CAT in �a nutshell
Never stop designing the digital future
SIEVA
Q&A
i2CAT in �a nutshell
Never stop designing the digital future
Context
7
i2CAT in �a nutshell
Never stop designing the digital future
Problem
8
i2CAT in �a nutshell
Never stop designing the digital future
Solution
9
i2CAT in �a nutshell
Never stop designing the digital future
MITRE ATT&CK Framework
10
The MITRE ATTACK Framework is a curated knowledge base that tracks cyber adversary tactics and techniques used by threat actors across the entire attack lifecycle.
i2CAT in �a nutshell
Never stop designing the digital future
Architecture
11
i2CAT in �a nutshell
Never stop designing the digital future
Architecture
12
i2CAT in �a nutshell
Never stop designing the digital future
AI Engine - How does it work?
The AI engine is a three-step procedure:
(*) A Dataset containing logs from different vendors is defined during step 1 and 2
13
Classification Step
Identification Step
Visibility Heuristic
Train Dataset
Validation Dataset
AI Engine
i2CAT in �a nutshell
Never stop designing the digital future
AI Engine - How does it work?
Classification Step
- A classification model is used to classify the logs according to the Data Sources defined in Mitre ATT&CK
- We manually build a dataset using logs samples from different vendors.
14
i2CAT in �a nutshell
Never stop designing the digital future
AI Engine - How does it work?
Identification Step
- Named-entity recognition (NER) techniques seeks to locate and classify the different entities in text into pre-defined categories.
- Text files → SIEM Raw Logs
- Categories → IP, Domain, host ..
Common NER pretrained model
Custom NER model
15
i2CAT in �a nutshell
Never stop designing the digital future
AI Engine - How does it work?
Identification Step (Custom NER training)
- Self-made dataset where we manually label the entities in the logs (using custom Regex for each vendor and category)
- A custom NER model is built using the spaCy library
16
Labeling
Train
i2CAT in �a nutshell
Never stop designing the digital future
AI Engine - How does it work?
Visibility Heuristic
(Partial visibility → !0% predicted accuracy per class)
(Visibility → !0% predicted accuracy and > n categories defined)
17
Visibility
URL and Domain Categories
i2CAT in �a nutshell
Never stop designing the digital future
How does it look?
18
i2CAT in �a nutshell
Never stop designing the digital future
Integration into SOCTools
19
i2CAT in �a nutshell
Never stop designing the digital future
Summarising
How can SIEVA help you?
Reduce efforts in data engineering
Identify gaps on your visibility
Improve your data management & threat
monitoring strategies
20
Open Source available
License AGPLv3
@ i2CAT’s Github SIEVA repository
https://github.com/Fundacio-i2CAT/SIEVA
i2CAT in �a nutshell
Never stop designing the digital future
Planned next steps
Next version :: Planned start Sept/23
SIG-CISS Requirements
-
-
-
-
-
-
-
21
i2CAT in �a nutshell
Never stop designing the digital future
Open Source available
@ i2CAT’s Github SIEVA repository
https://github.com/Fundacio-i2CAT/SIEVA
nil.ortiz@i2cat.net albert.calvo@i2cat.net
LinkedIn/in/nilortiz LinkedIn/in/albertcalvo
Q&A
i2CAT in �a nutshell
Never stop designing the digital future