5 of 117

Learning goals:

Consider ML models as unreliable components
Use safety engineering techniques STPA and FTA to anticipate and analyze possible mistakes
Design strategies for mitigating the risks of failures due to ML mistakes

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

6 of 117

Readings

Required reading: Kocielnik, Rafal, Saleema Amershi, and Paul N. Bennett. "Will you accept an imperfect ai? exploring designs for adjusting end-user expectations of ai systems." In Proceedings of the 2019 CHI conference on human factors in computing systems, pp. 1-14. 2019.

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

7 of 117

ML Models = Unreliable Components

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

8 of 117

Models make mistakes

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

9 of 117

Common excuse: Software mistake -- nobody's fault

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

10 of 117

Common excuse: The problem is just data

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

11 of 117

Common excuse: Nobody could have foreseen this...

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

12 of 117

What responsibility do designers have to anticipate problems?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

13 of 117

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

14 of 117

Designing for Mistakes

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

15 of 117

Planning for �Mistakes

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

16 of 117

Living with ML mistakes

No model is every "correct"

Some mistakes are unavoidable

Anticipate the eventual mistake

Make the system safe despite mistakes
Consider the rest of the system (software + environment)
Recall: Thermal fuse in smart toaster

ML model = unreliable component

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

17 of 117

Many different strategies

Based on fault-tolerant design, assuming that there will be software/ML mistakes or environment changes violating assumptions

We will cover today:

Human in the loop
Undoable actions
Guardrails
Mistake detection and recovery (monitoring, doer-checker, fail-over, redundancy)
Containment and isolation

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

18 of 117

Designing for Mistakes Strategy:�Human in the Loop

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

19 of 117

Today's Running Example: Autonomous Train

REQ: The train shall not collide with obstacles
REQ: The train shall not depart until all doors are closed
REQ: The train shall not trap people between the doors
...

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

20 of 117

Human-AI Interaction Design �(Human in the Loop)

Recall:

Automate: Take an action on user's behalf
Prompt: Ask the user if an action should be taken
Organize, annotate, or augment: Add information to a display
Or hybrid of these

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

21 of 117

Human in the Loop

AI and humans are good at predictions in different settings

AI better at statistics at scale and many factors
Humans understand context and data generation process; often better with thin data

AI for prediction, human for judgment?
But be aware of:

Notification fatigue, complacency, just following predictions; see Tesla autopilot
Compliance/liability protection only?

Deciding when and how to interact
Lots of UI design and HCI problems

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

22 of 117

Human in the Loop - Examples

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

23 of 117

Human in the Loop - Examples

Fall detection /

crash detection

with smartwatch

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

24 of 117

From the reading…

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

25 of 117

Human in the Loop - Examples?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

26 of 117

Designing for Mistakes Strategy:�Undoable Actions

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

27 of 117

Undoable actions

Automating only actions that can be undone
Design system to make actions undoable
Designing a process to appeal decisions

Examples?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

28 of 117

Undoable actions - �Examples

Override thermostat setting
Powerpoint design suggestions
1-Click shopping with free return shipment
Appeal process for banned "spammers" or "bots"
Easy to repair bumpers on autonomous vehicles?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

29 of 117

Undoable actions - Examples?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

30 of 117

Designing for Mistakes Strategy:�Guardrails

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

31 of 117

Guardrails

Post-process ML predictions before taking actions
Limit/truncate predictions to safe thresholds
Manual overrides for certain values
Backup models for known problematic conditions
Hardware protections

Ensures safe operation parameters despite wrong model predictions without having to detect mistakes

Traditionally symbolic guardrails, today often another model to increase reliability

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

32 of 117

Guardrails: Bollards

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

33 of 117

Guardrails: Bollards

https://twitter.com/WorldBollard/status/1542959589276192770

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

34 of 117

Guardrails: Bollards

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

35 of 117

Guardrails - Examples

Recall: Thermal fuse in smart toaster

maximum toasting time + extra heat sensor

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

36 of 117

Guardrails - Examples?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

37 of 117

Guardrails - Examples

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

38 of 117

Designing for Mistakes Strategy:�Detection and Recovery

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

39 of 117

Mistake detection and �recovery

Design a recovery mechanism if mistakes are detectable, directly or indirectly

Requires (1) a detection mechanism (e.g., external monitor, redundancy) and (2) a response

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

40 of 117

Mistake detection

An independent mechanism to detect problems (in the real world)

Example: Gyrosensor to detect a train taking a turn too fast

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

41 of 117

Mistake detection -- many strategies

Detect sensor failures with diagnostics
Detect sensor failures with redundancies
Monitor software for crashes
Monitor for expected environmental conditions

e.g., proper lighting of security camera footage

Check the outcome of an action against expectation

e.g., Vehicle accelerating, human clicking on something

Examples in autonomous train scenario?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

42 of 117

Doer-Checker Example: AV

ML-based controller (doer): Generate commands to steer the vehicle

Complex DNN; makes performance-optimal control decisions

Safety controller (checker): Checks commands from ML controller; overrides it with a safe default command if the ML action is risky

Simpler, based on verifiable, transparent logic; conservative control

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

43 of 117

Doer-Checker Example: AV

Yellow region: Slippery road, ignored by ML -> Causes loss of traction
Checker: Monitor detects lane departure; overrides ML with a safe steering command

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

44 of 117

Graceful Degradation (Fail-safe)

Goal: When a component failure is detected, achieve system safety by reducing functionality and performance

Switches operating mode when failure detected (e.g., slower, conservative)

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

45 of 117

Designing for Mistakes Strategy:�Redundancy

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

46 of 117

Redundancy

Useful for problem detection and response

Redundant sensors
Redundant models/subsystems

Hot Standby: Standby watches & takes over when primary fails
Voting: Select the majority decision

Challenge: Software + models are rarely really independent

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

47 of 117

Redundancy Example: Sensor Fusion

Combine data from a wide range of sensors

Provides partial information even when some sensor is faulty

A critical part of modern self-driving vehicles

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

48 of 117

Designing for Mistakes Strategy:�Containment and Isolation

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

49 of 117

Containment: Decoupling & Isolation

Design principle: Faults in a low-critical (LC) components should not impact high-critical (HC) components

Example: Do not connect fly-by-wire software with plane's entertainment system

Example in autonomous train?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

50 of 117

Poor Decoupling: USS Yorktown (1997)

Invalid data entered into DB; divide-by-zero crashes entire network

Required rebooting the whole system; ship dead in water for 3h

Lesson: Handle expected component faults; prevent propagation

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

51 of 117

Recall: A Secure (but Less Useful?) Version

def analyze_email(email):

prompt = f"Rate sentiment: 1=positive, 0=neutral, -1=negative\n{email}"

response = ai_model.generate(prompt)

return int(response.strip()) if response.strip() in ['-1','0','1'] else 0

def generate_report(email_batch):

emails = split_emails(email_batch)

scores = [analyze_email(email) for email in emails]

positive = scores.count(1)

negative = scores.count(-1)

total = len(scores)

return f"Sentiment Report: {positive}/{total} positive, {negative}/{total} negative"

Note the clear separation between symbolic and neural reasoning

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

52 of 117

Containment in AI Agents?

def react_agent(query):

context = llm(f"Think: How to answer '{query}'?")

while not is_final_answer(context):

action = llm(f"Context: {context}\nAction:")

if "get_email" in action:

result = get_email(extract_date_range(action))

elif "send_email" in action:

result = send_email(extract_to_body(action))

context = llm(f"Previous: {context}\n

Action: {action}\nResult: {result}\nContext:")

return llm(f"Final answer based on: {context}")

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

53 of 117

Poor Decoupling: Automotive Security

Main components connected through a common CAN bus

Broadcast; no access control (anyone can read/write)

Can control brake/engine by playing a malicious MP3

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

54 of 117

Containment: Decoupling & Isolation

Design principle: Faults in a low-critical (LC) components should not impact high-critical (HC) components

Apply the principle of least privilege: LC components should have minimal necessary access

Limit interactions across criticality boundaries: Deploy LC & HC components on different networks; add monitors/checks at interfaces

Is an ML component in my system performing an LC or HC task? If HC, can we "demote" it into LC? Alternatively, if possible, replace/augment HC ML components with non-ML ones

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

55 of 117

Simplified Agent Implementation

def react_agent(query):

context = llm(f"Think: How to answer '{query}'?")

while not is_final_answer(context):

action = llm(f"Context: {context}\nAction:")

if "get_email" in action:

result = get_email(extract_date_range(action))

elif "send_email" in action:

result = send_email(extract_to_body(action))

context = llm(f"Previous: {context}\n

Action: {action}\nResult: {result}\nContext:")

return llm(f"Final answer based on: {context}")

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

56 of 117

Design Strategies Summary

Human in the loop

Undoable actions

Guardrails

Mistake detection and recovery (monitoring, doer-checker, fail-over, redundancy)

Containment and isolation

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

57 of 117

Breakout

What harms from ML mistakes are possible and what design strategies would you consider to mitigate them?

Automatic audio captioning (subtitles) for streaming services
Calendar agent that reschedules meetings when running late, traffic, weather, etc.

Consider: Human in the loop, Undoable actions, Guardrails, Mistake detection and recovery (monitoring, doer-checker, fail-over, redundancy), Containment and isolation

As a group, post #lecture and tag all group members.

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

58 of 117

Hazard Analysis:�Anticipating and Analyzing Risks

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

59 of 117

What's the worst that could happen?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

60 of 117

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

61 of 117

What's the worst that could happen?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

62 of 117

What's the worst that could happen?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

63 of 117

What's the worst that could happen?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

64 of 117

What's the worst that could happen?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

65 of 117

What to mitigate?

Recall:

We can reduce/eliminate many risks, but not for free

We can only mitigate risks we know

Wait for problems to occur or be proactive?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

66 of 117

Hazard Analysis for Risk Identification

Proactively identifying potential problems before they occur

Traditional safety engineering techniques

Essentially “structured brainstorming”

Resulting risks are subsequently analyzed and possibly mitigated

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

67 of 117

STPA

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

68 of 117

STPA (system-theoretic process analysis)

Identify stakeholders, including indirect ones

Identify their stakes, values, goals

For each value explore corresponding loss�(e.g., loss of life, injury, damage, loss of mission, loss of customer satisfaction, financial loss, environmental loss, information leakage)

For each loss, identify requirement to prevent it

Identify possible reasons for violating requirement

…

See also Leveson, Nancy G. Engineering a safer world: Systems thinking applied to safety. The MIT Press, 2016.

STPA Handbook https://psas.scripts.mit.edu/home/materials/

We only use the initial steps of �STPA for hazard identification here.

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

69 of 117

STPA Example

Stakeholders?
Their values/goals?
Losses of concern?
Corresponding requirements?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

70 of 117

STPA Advice

Explore stakeholders broadly, both direct (e.g., train passengers, operators) and indirect (e.g., people living near tracks, city government) – often many

Understand what they care about, see user goals (e.g., fast travel times, safety)

Explore possible losses broadly, small and large, including financial, injuries, mental stress (e.g., late for work, injured in doors of train)

Translation to requirements often straightforward (e.g., leave on time, do not trap passengers in doors)

Be comprehensive but focus on more severe problems

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

71 of 117

Example: Hazard Analysis �for Trail Recommendation

Stakeholders: end users, app developers, API providers, trail management organizations, local businesses.

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

72 of 117

Hazard Analysis as Structured Brainstorming

Lots of paperwork? Tedious?

Reliable?

LLM assistance?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

73 of 117

Other Risk Identification Strategies

Brainstorm worst case scenarios and their causes � (from the perspective of different stakeholders)

Read about common risks in domains (e.g., web security risks, accounting) and accidents/failures in competing projects

Expert opinions

Early warning indicators, incident analysis, near-miss reporting

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

74 of 117

Risk Analysis

For each risk judge severity and likelihood to prioritize

Focus on high severity or high frequency issues first

Involve more people in the conversation, plan next steps

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

75 of 117

Fault Tree Analysis

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

76 of 117

Analyzing Possible Causes of Loss

Risk identification with hazard analysis tells us what to avoid

Next: What can go wrong to lead to the loss? How can we prevent it?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

77 of 117

Fault Tree Analysis (FTA)

Fault tree: A diagram that displays relationships between a system failure (i.e., requirement violation) and potential causes.

Identify event sequences that can result in failure
Prioritize contributors leading to a failure
Inform design decisions
Investigate an accident & identify the root cause

Often used for safety & reliability, but can also be used for other types of requirements (e.g., poor performance, security attacks...)

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

78 of 117

Fault Tree Analysis & ML

ML is increasingly used in safety-critical domains such as automotive, aeronautics, industrial control systems, etc.,
ML models are just one part of the system
ML models will EVENTUALLY make mistakes

Output wrong predictions/values
Fail to adapt to the changing environment
Confuse users, etc.,

How do mistakes made by ML contribute to system failures? How do we ensure their mistakes do not result in a catastrophic outcome?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

79 of 117

Fault Trees: Basic Building Blocks

Event: An occurrence of a fault or an undesirable action

(Intermediate) Event: Explained in terms of other events
Basic Event: No further development or breakdown; leaf (choice!)

Gate: Logical relationship between an event & its immediate subevents

AND: All of the sub-events must take place
OR: Any one of the sub-events may result in the parent event

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

80 of 117

Fault Tree Example

Every tree begins with a TOP event (typically a violation of a requirement)

Every branch of the tree must terminate with a basic event

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

81 of 117

Analysis: What can we do with fault trees?

Qualitative analysis: Determine potential root causes of a failure through minimal cut set analysis
Quantitative analysis: Compute the probability of a failure

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

82 of 117

Minimal Cut Set Analysis

Cut set: A set of basic events whose simultaneous occurrence is sufficient to guarantee that the TOP event occurs.

Minimal cut set: A cut set from which a smaller cut set can't be obtained by removing a basic event.

What are minimal cut sets here?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

83 of 117

Failure Probability Analysis

To compute the probability of the top event:

Assign probabilities to basic events (based on domain knowledge)
Apply probability theory to compute probabilities of intermediate events through AND & OR gates
(Alternatively, as sum of prob. of minimal cut sets)

In this class, we won't ask you to do this.

Why is this especially challenging for software?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

84 of 117

FTA Process

Specify the system structure

Environment entities & machine components
Assumptions (ASM) & specifications (SPEC)

Identify the top event as a requirement violation (REQ)
Construct the fault tree

Derive intermediate events from a violation of ASM or SPEC
Decompose the intermediate events further down based on the knowledge of the domain or components

Analyze the tree, identify all possible cause combinations that can lead to requirement violation (“minimal cut sets”)
Consider design modifications

Eliminate causes (subtrees, cutsets), or
Increase the size number of conditions needed (new events with and connection, increase minimal cut sets)

Repeat

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

85 of 117

Example: Autonomous Train

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

86 of 117

Example: Autonomous Train

Modern ML-powered vision system to efficiently and safely close doors before departure

REQ: The train shall not trap people between the doors

Using a fault tree identify possible problems that could lead to trapping a person in the door.

Hint: What assumptions and specifications might be violated?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

87 of 117

Remove basic events with mitigations
Increase the size of cut sets with mitigations
Recall: Guardrails

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

88 of 117

Remove basic events with mitigations
Increase the size of cut sets with mitigations
Recall: Guardrails

Probably unavoidable, but can increase reliability

Necessary risk remaining?

Reduce with legal threats?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

89 of 117

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

90 of 117

Mitigation so that vision failure alone does not cause violation (add AND event)

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

91 of 117

Terrible design, have�safe default

Single point of failure?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

92 of 117

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

93 of 117

Eliminate software crash�as possible cause

(remove event)

Add redundancy to increase reliability (add AND event)

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

94 of 117

One more example: FTA for Lane Assist

REQ: The vehicle must be prevented from veering off the lane.

SPEC: Lane detector accurately identifies lane markings in the input image; the controller generates correct steering commands

ASM: Sensors are providing accurate information about the lane; driver responses when given warning; steering wheel is functional

Possible mitigations?

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

95 of 117

FTA: Caveats

In general, building a complete tree is impossible

There are probably some faulty events that you missed
"Unknown unknowns", black swan events
Events can always be decomposed; detail level is a choice.

Domain knowledge is crucial for improving coverage

Talk to domain experts; augment your tree as you learn more

FTA is still very valuable for risk reduction!

Forces you to think about, document possible failure scenarios
A good starting basis for designing mitigations

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

96 of 117

Breakout: Fault Tree

REQ: The generated music featured on the front page should not contain lyrics denigrating minorities

As a group,

(a) draw a small fault tree involving at least 1 non-ML and one ML-related basic event
(b) introduce a mitigation and highlight it in the diagram

Use pen&paper or any software. As a group, post photo or screenshot to #lecture, tagging all members

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

97 of 117

Aside: STPA View

Most losses are caused by complex issues, not just single mistakes

Consider the entire system, control structure, including humans and their training/oversight

Systematically evaluate whether the controls are effective

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

98 of 117

Zooming Out: �General Safety Engineering Strategies

Identify possible hazards from stakeholder goals (STPA, early steps)

Identify possible hazards from component failures (FMEA, HAZOP)

Forward: from cause to hazard

Analyze causes of anticipated/known hazards (FTA)

Backward: from hazard to cause

Analyze effectiveness of control mechanisms for anticipated/known hazards, including non-technical controls (STPA)

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

99 of 117

Bonus Slides: FMEA

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

100 of 117

Failure Mode and Effects Analysis (FMEA)

Forward search from possible root causes to hazards

Does not assume the hazards are known (as FTA requires)

Consider component failures (SPEC violations) and failed assumptions (ASM violations) as possible causes

Widely used in aeronautics, automotive, healthcare, food services, semiconductor processing, and (to some extent) software

100

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

101 of 117

Failure Mode and Effects Analysis (FMEA)

101

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

102 of 117

FMEA Process

(a) Identify system components

(b) Enumerate potential failure modes for each component

for ML component: Always suspect prediction may be wrong

Potential hazardous effect on the system
Method for detecting the failure
Potential mitigation strategy

102

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

103 of 117

FMEA Example: Autonomous Train Doors

Failure modes? Failure effects? Detection? Mitigation?

103

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

104 of 117

FMEA Example Excerpt: Autonomous Car

104

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

105 of 117

"Wrong Prediction" as Failure Mode?

"Wrong prediction" is a coarse grained failure mode of every model

May not be possible to decompose further

However, may evaluate causes of wrong prediction for better understanding, as far as possible (FTA could be used for this)

105

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

106 of 117

FMEA Summary

Forward analysis: From components to possible failures

Focus on single component failures, no interactions

Identifying failure modes may require domain understanding

106

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

107 of 117

Bonus Slides: HAZOP

107

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

108 of 117

Hazard and Interoperability Study (HAZOP)

Identify hazards and component fault scenarios through guided inspection of requirements

108

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

109 of 117

Hazard and Operability Study (HAZOP)

A forward search method to identify potential hazards from component failures (and assumption violations)

For each component, use a set of guide words to generate possible deviations from expected behavior

Consider the impact of each generated deviation: Can it result in a system-level hazard?

109

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

110 of 117

HAZOP Example: Emergency Braking (EB)

Specification: EB must apply a maximum braking command to the engine.

NO OR NOT: EB does not generate any braking command.
LESS: EB applies less than max. braking.
LATE: EB applies max. braking but after a delay of 2 seconds.
REVERSE: EB generates an acceleration command instead of braking.
BEFORE: EB applies max. braking before a �possible crash is detected.

110

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

111 of 117

HAZOP & ML

In addition to traditional analysis: Analyze possible mistakes of all ML components

Original guidewords: NO OR NOT, MORE, LESS, AS WELL AS, PART OF, REVERSE, OTHER THAN / INSTEAD, EARLY, LATE, BEFORE, AFTER

Additional ML-specific guidewords: WRONG, INVALID, INCOMPLETE, PERTURBED, and INCAPABLE.

111

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

112 of 117

Breakout: Automated Train Doors

Analyze the vision component to detect obstacles in train doors

NO OR NOT, MORE, LESS, AS WELL AS, PART OF, REVERSE, OTHER THAN / INSTEAD, EARLY, LATE, BEFORE, AFTER, WRONG, INVALID, INCOMPLETE, PERTURBED, and INCAPABLE.

Using HAZOP: As a group answer in #lecture, tagging group members:

What is the specification of the perception component?
What are possible deviations from the specification?
What are potential hazards resulting from these deviations?
What possible mitigations would you consider? (e.g., human in the loop, undoable actions, guardrails, mistake detection and recovery, containment)

112

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

113 of 117

HAZOP: Benefits & Limitations

Easy to use; encourages systematic reasoning about component faults

Can be combined with STPA/FMEA to generate faults (i.e., basic events in FTA)

Potentially labor-intensive; relies on engineer's judgement

Does not guarantee to find all hazards (but also true for other techniques)

113

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

114 of 117

Remarks: Hazard Analysis

None of these methods guarantee completeness

You may still be missing important hazards, failure modes

Intended as structured approaches to thinking about failures

But cannot replace human expertise and experience

114

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

115 of 117

Summary

115

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025

116 of 117

Summary

Accept that a failure is inevitable

ML components will eventually make mistakes, reasons barely matter
Environment may evolve over time, violating assumptions

Design strategies for mitigating mistakes

Human in the loop, undoable actions, guardrails, mistake detection and recovery (monitoring, doer-checker, fail-over), redundancy, containment and isolation

Use risk analysis to identify and mitigate potential problems

STPA, FTA, FMEA, HAZOP

116

Machine Learning in Production • Christian Kaestner & Bogdan Vasilescu, Carnegie Mellon University • Fall 2025