FOUNDATIONS OF ARTIFICIAL INTELLIGENCE
Towards a working understanding of AI and its relationship to evaluation
AN OIL PAINTING BY MATISSE OF A HUMANOID ROBOT PLAYING CHESS
Generated by AI using DALL-E
Machine Learning�Systems
Data
Pattern detection
Extensible Model
At the core of AI:
How Machine Learning Systems are Trained
Machine learning systems may be supervised or unsupervised
Models can only be as strong as the data on which they are trained
The patterns these models detect and use are not always accessible to humans
Random Forests: �an example
https://datahacker.rs/012-machine-learning-introduction-to-random-forest/
Example ML System Types
Natural language processing
Neural networks/deep learning
Generative AI
ANGLE VIEW OF CIRCUIT SHAPED LIKE A BRAIN
Recommended by AI based on text in this slide
AI AS BOTH SUBJECT AND OBJECT IN EVALUATION
Potential uses for AI in an evaluation space
vs
Evaluating work produced by AI
AN ABSTRACT PAINTING OF ARTIFICIAL INTELLIGENCE
Generated by AI using DALL-E
Evaluation using AI
Non-generative AI
Generative AI
COMPUTER SCRIPT ON A SCREEN
Recommended by AI based on text in this slide
Evaluation of AI
Qualitative
Mixed Methods
Quantitative
Hsiao, J.H.-W.; Ngai, H.H.T.; Qiu, L.; Yang, Y.; Cao, C.C. Roadmap of Designing Cognitive Metrics for Explainable Artificial Intelligence (XAI). arXiv 2021, arXiv:2108.01737
Lin, Y.-S.; Lee, W.-C.; Berkay Celik, Z. What Do You See? Evaluation of Explainable Artificial Intelligence (XAI) Interpretability through Neural Backdoors. arXiv 2020, arXiv:2009.10639
Rosenfeld, A. Better Metrics for Evaluating Explainable Artificial Intelligence. In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems, Virtual Event, UK, 3–7 May 2021
Measures in this field are emergent and consistent metrics for evaluation are yet to crystallize
Limitations to consider
A Meta Conclusion:
What AI thinks evaluators need to know about AI