Towards AI-driven User Support for CNAF Tier-1
M. Barbetti, A. Trashaj, E. Ronchieri, C. Pellegrino, and the contribution of many others
CNAF seminar | 21 November 2024
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
THE EVOLUTION OF
ARTIFICIAL INTELLIGENCE
A sensationalistic overview of the history of Artificial Intelligence
.1.0.
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Can you find the (AI)ntruder?
3
3
Credits: Jorg Karg
Credits: Boris Eldagsen
Credits: Aleksei Andreev
Credits: Evgeny Lutsko
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Can you find the (AI)ntruder?
4
Credits: Jorg Karg
Credits: Aleksei Andreev
Credits: Evgeny Lutsko
Credits: Boris Eldagsen
(by using DALL-E 2)
PSEUDOMNESIA: The Electrician, 2023
WINNER
of the creative open category at the
Sony World Photography Awards
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
History of Artificial Intelligence
5
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
History of Artificial Intelligence
6
First model of an
Artificial Neural Network
by W. McCulloch and W. Pitts
Alan Turing introduced
a test to answer the question
“Can machines think?”
1943
1950
The term Artificial Intelligence was coined for a thematic workshop hosted at Dartmouth
1956
Geoffrey Hinton coined the term
Deep Learning to refer to
deep ANN-powered algorithms
2006
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
History of Artificial Intelligence
7
The modern technique for training MLPs, known as backpropagation, was first published by S. Linnainmaa
P. Werbos and D.E. Rumelhart independently developed and applied backpropagation to MLPs in the way that has become standard
1970
1982
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
History of Artificial Intelligence
8
The birth of the
World Wide Web (WWW)
by Tim Berners-Lee
The launch of Google
by Larry Page and Sergey Brin
1989
1998
The launch of Facebook
by Mark Zuckerberg
The launch of
Amazon Web Service (AWS)
The launch of
Google Cloud Platform (GCP)
2004
2002
2008
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
History of Artificial Intelligence
9
The foundation of NVIDIA by Jensen Huang
The release of Torch, the first open-source machine learning library
The release of CUDA, software that enables parallel computing on NVIDIA GPUs
The publication of Minecraft by Mojang Studios
The release of several deep learning libraries, like Caffe (2013), Keras (2015), TensorFlow (2015), PyTorch (2016)
1993
2011
2013 - 2016
2002
2007
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
History of Artificial Intelligence
10
The defeat of Garry Kasparov by IBM’s Deep Blue, the first computer to win against a world chess champion
The defeat of Lee Sedol by DeepMind’s AlphaGo, the first computer to win against a 9-dan professional Go player
1997
2016
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
History of Artificial Intelligence
11
The first ImageNet Large Scale Visual Recognition Challenge (ILSVRC), an annual contest on image classification and detection via software programs
The victory of the ImageNet 2012 Challenge by a CNN-based model called AlexNet
The invention of Generative Adversarial Nets by Ian Goodfellow and colleagues
The victory of the ImageNet 2015 Challenge by a novel architecture introduced with ResNet
The launch of
DALL-E by OpenAI
2010
2012
2014
2015
2021
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
History of Artificial Intelligence
12
The release of ELIZA by Joseph Weizenbaum, the computer program to engage conversations with humans
The introduction of the Transformer architecture with the paper “Attention is all you need”
The introduction of BERT by Google and GPT-1 by OpenAI, models for NLP problems
The launch of several chatbot platforms, like ChatGPT by OpenAI (2022), Bard/Gemini by Google (2023), Claude by Anthropic (2023)
1966
2022 - 2023
2017
2018
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
The evolution of image generation
13
<PROMPT> A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. She wears a black leather jacket, a long red dress, and black boots, and carries a black purse. She wears sunglasses and red lipstick. She walks confidently and casually. The street is damp and reflective, creating a mirror effect of the colorful lights. Many pedestrians walk about. </PROMPT>
Credits:
DCGAN (arXiv:1511.06434), 2015
Sora (OpenAI text-to-video model), 2024
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Chatbot, chatbot everywhere!
14
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
EVOLVING THE CNAF USER SUPPORT
THROUGH AI-DRIVEN SOLUTIONS
How AI technologies can help in supporting Tier-1 users
.2.0.
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Scientific computing at CNAF
16
NON LHC
LHC
As INFN national center dedicated to R&D on Information and Communication Technologies, CNAF host the Italian Tier-1 data center since 2003, providing computing resources to a wide scientific community:
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Support for experiments at Tier-1
17
User
User Support
Infrastructure
Storage
Network
AAI
Security
Farming
Cloud
+
The Italian Tier-1 ensures support for experiments and users through the dedicated User Support (US) unit:
Three levels of support:
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Support for experiments at Tier-1
18
User
User Support
Infrastructure
Storage
Network
AAI
Security
Farming
Cloud
+
The Italian Tier-1 ensures support for experiments and users through the dedicated User Support (US) unit:
Three levels of support:
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Support for experiments at Tier-1
19
User Support
Infrastructure
Storage
Network
AAI
Security
Farming
Cloud
+
User
The Italian Tier-1 ensures support for experiments and users through the dedicated User Support (US) unit:
Three levels of support:
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Support for experiments at Tier-1
20
User Support
Infrastructure
Storage
Network
AAI
Security
Farming
Cloud
+
Software
developer
User
The Italian Tier-1 ensures support for experiments and users through the dedicated User Support (US) unit:
Three levels of support:
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Evolving the User Support using Artificial Intelligence
21
User Support
Infrastructure
Storage
Network
AAI
Security
Farming
Cloud
+
In the context of NRRP and thanks to the action of the ICSC and TeRABIT projects, the Italian Tier-1 will evolve significantly in the next few years:
Employing AI models to build a Digital User Supporter able to help in supporting a growing number of new users and in adopting the latest software technologies
IDEA
User
GenAI
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Prototyping a Digital User Supporter
22
User request
User Support
Support of 1st/2nd/3rd level?
Which units are involved?
ML multi-label classifier
The answer is available in the Tier-1 User Guide. Can we automate it?
GenAI-powered digital agent
Automated identification of involved units using the first-contact email
Automated answer to the user request relying on the acquired knowledge base
The process of developing an AI-powered digital agent for managing User Support duties has been organized into two parallel and complementary sub-projects:
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
2.1 Multi-label classification
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Steps for building an email classifier
24
2
1
Preparation of the dataset of emails for training
Mapping of the textual features into a numerical representation (embedding)
Classifier training using differente ML algorithms
Benchmark of the classification performance and results combination (ensemble)
3
4
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
A quick look at the data sample
25
~30k emails processed in 6 years
Extensive interaction between users, US, and specialized units
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Preparation of the training dataset
26
Data collection
Anonymization
Labelling
Data cleaning
The sample contains ~30k emails received/sent in the period 06/2017 – 05/2023
Each has been saved as an individual JSON file (~260 MB)
No attachments, and (almost) no HTML content
Each file includes:
All the email addresses have been anonymized by replacing them with UUIDs
Any references to real individuals (e.g., names or usernames) found in the email subject/body have been anonymized and replaced with the placeholder [NAME]
The anonymization process was carried out using automated scripts or by editing manually the email metadata
Each email is assigned a label based on the text contained in its subject/body
The email thread between a user and the various Tier-1 units involved is reconstructed using the information in the parent field
All labels associated to the same thread are propagated to the first contact email as a part of the labelling process
The email body may contain escape sequences (e.g., \n, \t) o HTML tags
The emails are multilingual (English/Italian) and may include special characters (e.g., accented letters, emojis)
The data cleaning process transforms the text to retain only words, placeholder, and punctuation
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
The prepared training set
27
Hello,
How can I use Jupyter Notebooks at CNAF?
Thank you,
[NAME]
[user-support]
[farming]
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
How to represent the textual features
28
Representation for “👩🏻”
(x1, y1)
(x2, y2)
(xR, yR)
Representation for “👑”
Representation for “👸🏻”
+
=
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
How emails look like in the representation space
29
paraphrase-multilingual-mpnet-base-v2
paraphrase-multilingual-MiniLM-L12-v2
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Performance of the trained classifiers
30
Different ML models were trained to perform multi-label classification
Various embedding models were used in combination with these classification models
Different preprocessing strategies → 4 (emb) x 4 (prep) x 4 (clf) = 64 models in total
By combining the outputs of the most promising models using an ensemble approach, the best performance was achieved: ~95% of ROC AUC score on the test set
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
2.2 Digital User Supporter
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
The problem of artificial hallucinations
32
One of the major challenges in working with AI chatbots and Computer Vision tools is the issue of artificial hallucinations
A potential solution to limit the “creativity” of Large Language Models (LLMs) is to integrate them into an architecture designed to retrieve relevant information and compose answers based on that data
The Retrieval-Augmented Generation (RAG) architecture provides a semantic-aware pipeline that retrieves information from a knowledge base and generates text grounded in the retrieved data
The first prototype of the AI-powered Digital User Supporter has been developed upon a RAG model, implemented with LangChain and provisioning the LLM relying on ollama
PROTOTYPE
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
How to build a Digital User Supporter
33
Vector database
Knowledge base
Embedding
User query
Similarity search
Embedding
Prompt assembly
Large
Language Models
AI-based answer
Retrieval-Augmented Generation (RAG)
Credits: Alberto Trashaj
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
How to build a Digital User Supporter
34
Vector database
Knowledge base
Embedding
Similarity search
Embedding
Prompt assembly
Large
Language Models
AI-based answer
Retrieval-Augmented Generation (RAG)
User query
Hello,
How can I use Jupyter Notebooks at CNAF?
Thank you,
[NAME]
User ask a question to User Support
of Tier-1
Credits: Alberto Trashaj
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
How to build a Digital User Supporter
35
Vector database
Embedding
User query
Similarity search
Embedding
Prompt assembly
Large
Language Models
AI-based answer
Retrieval-Augmented Generation (RAG)
Knowledge base
Credits: Alberto Trashaj
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
How to build a Digital User Supporter
36
Vector database
Knowledge base
User query
Similarity search
Embedding
Prompt assembly
Large
Language Models
AI-based answer
Retrieval-Augmented Generation (RAG)
Embedding
The knowledge base (Tier-1 User Guide) is mapped to a vector space through an embedding model, such as all-mpnet-base-v2
Vector space
Credits: Alberto Trashaj
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
How to build a Digital User Supporter
37
Knowledge base
Embedding
User query
Similarity search
Embedding
Prompt assembly
Large
Language Models
AI-based answer
Retrieval-Augmented Generation (RAG)
Vector database
To store the embedding results, a database optimized for handling large vectors and their metadata (e.g., source text) is required. It should also support efficient searches within the vector space (features like these are at the core of Chroma)
Credits: Alberto Trashaj
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
How to build a Digital User Supporter
38
Vector database
Knowledge base
Embedding
User query
Similarity search
Prompt assembly
Large
Language Models
AI-based answer
Retrieval-Augmented Generation (RAG)
Embedding
As well as the knowledge base, also the user request is mapped to a vector space relying on the same embedding model (all-mpnet-base-v2)
Vector space
Credits: Alberto Trashaj
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
How to build a Digital User Supporter
39
Vector database
Knowledge base
Embedding
User query
Embedding
Prompt assembly
Large
Language Models
AI-based answer
Retrieval-Augmented Generation (RAG)
Similarity search
To retrieve relevant information from the guide to respond to the user request, we can compare the embeddings from both cases based on similarity (e.g., dot product)
Once the guide embeddings that best match the request embedding are identified, the vector database can be queried to retrieve the corresponding metadata
✓
✗
Credits: Alberto Trashaj
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
How to build a Digital User Supporter
40
Vector database
Knowledge base
Embedding
User query
Similarity search
Embedding
Large
Language Models
AI-based answer
Retrieval-Augmented Generation (RAG)
Prompt assembly
The retrieved text segments are used to tailor the template for the textual prompt submitted to the Language Model
Answer the user’s questions based on the below context as you were answering to an email in a professional style.
<context>
{context}
</context>
Credits: Alberto Trashaj
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
How to build a Digital User Supporter
41
Vector database
Knowledge base
Embedding
User query
Similarity search
Embedding
Prompt assembly
AI-based answer
Retrieval-Augmented Generation (RAG)
Large
Language Models
The generated prompt can then be submitted to a Large Language Model (LLM), which, even if not an “expert” in a specific subject, is capable of extracting the “relevant” content from {context} and rephrasing it
Since fine-tuning is not required, LLMs are often described as general-purpose technologies and are thus also referred to as Foundation Models
Credits: Alberto Trashaj
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
How to build a Digital User Supporter
42
Vector database
Knowledge base
Embedding
User query
Similarity search
Embedding
Prompt assembly
Large
Language Models
Retrieval-Augmented Generation (RAG)
AI-based answer
Dear User,
To answer your question directly, yes, you can use Jupyter notebooks through the JupyterHub service at Tier-1. The service is reachable via browser at the following page: <https://jupyterhub-t1.cr.cnaf.infn.it/>. Please note that you will need to have an account and be logged in to access the notebooks.
If you have any further questions or concerns, please feel free to ask.
Best regards,
[Your Name]
Note: automated answer generated with Llama 2
Credits: Alberto Trashaj
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
MOVING TO THE CONCLUSIONS
Summary, conclusions, and final remarks
.3.0.
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Here are my conclusions!
44
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing
Thanks!
Any questions or comments?
Matteo Barbetti (INFN CNAF)
email: matteo.barbetti@cnaf.infn.it
Missione 4 • Istruzione e Ricerca
ICSC Italian Research Center on High-Performance Computing, Big Data and Quantum Computing