What are the four key components of the proposed Gen AI Evaluation Framework?
*Which of the following are common benchmarks for open source leaderboards?
*Which of the following are not key steps in chain-of-verification?
*For Large Language Models, what might constitute "Conceptual Soundness"?
*Which of the following are examples of guardrails?
*Which of the following are not types of attacks against a LLM?
*