1 of 56

Human-Computer Interaction

saadh.info/hci

Week 13 (Thursday): Designing Human-AI Interaction

2 of 56

Attendance and Agenda

Design for Human-AI Interaction

3 of 56

Announcements

Assignment 3 overdue. 7 missing…
Continue working on milestone 2! Due December 6

Assignment 4 (due December 2) is conducting heuristic evaluation of one screen flow
Discuss dividing among team members

Test 2 on Nov 21

Same format as test 1
Read chapters 4-7 of Human-Computer Interaction: An Empirical Research Perspective by MacKenzie, I. Scott, if you can

4 of 56

Test 2 Next Thursday

5 of 56

Cognitive Walkthroughs

Walkthroughs are methods where an expert (that would be you, the designer) defines tasks. Afterward, rather than testing those tasks with real people, you walk through each step of the task and verify that a user would:

know to do the step,
know how to do the step,
would successfully do the step, and
would understand the feedback the design provided.

If you go through every step and check these four things, you’ll find most problems with a design.

Polson, P. G., Lewis, C., Rieman, J., & Wharton, C. (1992). Cognitive walkthroughs: a method for theory-based evaluation of user interfaces. International Journal of Man-Machine Studies.

6 of 56

Performing Cognitive Walkthrough

Select a task to evaluate (probably a frequently performed important task that is central to the user interface’s value). Identify every individual action a user must perform to accomplish the task with the interface.
Obtain a prototype of all of the states necessary to perform the task, showing each change. This could be anything from a low-fidelity paper prototype showing each change along a series of actions, or it might be a fully-functioning implementation.
Develop or obtain persona of representative users of the system. You’ll use these to help speculate about user knowledge and behavior.

7 of 56

Identifying Design Flaws using Walkthrough

Will the user try to achieve the right effect?

Would the user even know that this is the goal they should have?

Will the user notice that the correct action is available?

If they wouldn’t notice, you have a design flaw.

Will the user associate the correct action with the effect that the user is trying to achieve?

Even if they notice that the action is available, they may not know it has the effect they want.

If the correct action is performed, will the user see that progress is being made toward the solution of the task?

Is there feedback that confirms the desired effect has occurred?

8 of 56

GenderMag Walkthrough

Four customizable personas to cover:
A user’s motivations for using the software.
A user’s information processing style (top-down, which is more comprehensive before acting, and bottom-up, which is more selective.)
A user’s computer self-efficacy (their belief that they can succeed at computer tasks).
A user’s stance toward risk-taking in software use.
A user’s strategy for learning new technology

9 of 56

Usability Evaluation Steps

Plan and prepare
Conduct the test
Collect data
Analyze data
Draw conclusions
Document results
Repeat step

11 of 56

1. Perception and Cognition

12 of 56

2. User Research Methods & Qualitative Analysis

Interviews

Contextual Inquiry

Think-out Aloud

13 of 56

3. Experimental Research in HCI

Error bars show

±1 standard deviation

14 of 56

4. Analytical Evaluations

Next week!

Last Week

15 of 56

5. Modeling Interactions

Units: bits

RT = a + b log₂(n + 1)

Fitts’ Law

Hick-Hyman’ Law

16 of 56

6. Designing for Human-AI Interaction

17 of 56

What we know about design so far

Gestalt Principles

Visual Design

Norman’s Design Principles

Nielsen's Heuristics

18 of 56

Can we apply these to Human-AI Interactions?

AI-infused systems can violate established usability guidelines of traditional user interface design

Inherently inconsistent due to poorly understood underlying probabilistic systems and blackbox implementations
Change over time, e.g., learn more, learn false information
React differently in different conditions
Behave differently from one user to next (e.g., browsers due to personalization)

19 of 56

Shneiderman-Maes Debate

Ben Shneiderman and Pattie Maes. 1997. Direct manipulation vs. interface agents. interactions 4, 6 (Nov./Dec. 1997), 42–61. https://doi.org/10.1145/267505.267514 (569 Citations)

20 of 56

This week: Meredith Morris & Michael Bernstein Vs. Andrés Monroy-Hernández & Jeff Bigham

21 of 56

Guidelines for Human-AI Interaction

Saleema Amershi, Dan Weld, Mihaela Vorvoreanu, Adam Fourney, Besmira Nushi, Penny Collisson, Jina Suh, Shamsi Iqbal, Paul N. Bennett, Kori Inkpen, Jaime Teevan, Ruth Kikin-Gil, and Eric Horvitz. 2019. Guidelines for Human-AI Interaction. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (CHI '19). Association for Computing Machinery, New York, NY, USA, Paper 3, 1–13. https://doi.org/10.1145/3290605.3300233 (1604 Citations)

22 of 56

Guidelines for Human-AI Interaction

23 of 56

Guidelines for Human-AI Interaction

24 of 56

Guidelines for Human-AI Interaction

25 of 56

Implications throughout the design cycle…

26 of 56

Google People+AI Research Guidelines

https://pair.withgoogle.com/guidebook/

27 of 56

Human-AI Interaction

AI is better at some things than others. Make sure that it’s the right technology for the user problem you’re solving.
Three Four fundamental questions

How do I get started?
How do I onboard new users?
How to explain AI performance?
How do I help build and calibrate trust in my product?

28 of 56

How to get started?

29 of 56

Determine if AI adds value

When AI is probably better

Recommending different content to different users, such as movie suggestions
Predicting future events, such as weather events or flight price changes
Natural language understanding
Image recognition

A heuristic-based solution is better

Maintaining predictability is important, e.g., scheduling
Users, customers or developers need complete transparency, e.g., credit card scoring?
People don’t want a task automated

30 of 56

Automation vs. Augmentation

Augment: When a machine, software, or function extends a person’s abilities or potential while maintaining their agency.

Automate: When a machine, software, or function performs a task without user involvement.

31 of 56

Determine if AI adds value

Don’t use AI just because you can. Heuristics or manual control can often create better experiences. Here, using music preferences to suggest workouts will likely lead to a worse experience than letting people manually choose workouts.

32 of 56

Setting the Right Expectations

Avoid suggesting that the technology works perfectly in high-stakes situations if the tech isn’t yet reliable.

33 of 56

Be accountable for errors

Providing access to a person can be one way to make sure users’ concerns and problems are directly addressed. Sometimes the user’s error can’t be directly remedied but actions can be taken to make sure other users don’t encounter the same problem.

34 of 56

Invest early in good data practices

The better your data planning and collection processes, the higher quality your end output.

Collect data in batches.
Embrace “noisy” data
Plan for data maintenance
Partner with domain experts

35 of 56

How to best do data collection?

Embrace Noisy Data

Design for data labelers (supervised learning)

Learn from disagreements

36 of 56

Make Precision and Recall Trade-offs Carefully

Precision

No false positives are classified, but some true positives are missed.

Recall

All true positives are classified, but some false positives are captured.

37 of 56

What if tool

https://pair-code.github.io/what-if-tool

38 of 56

Make Precision and Recall Trade-offs Carefully

Enable users to include results (true positives) that may have been excluded.

Enable users to exclude results (false positives) that may have been included.

39 of 56

How do I onboard new users?

40 of 56

Explain the benefit, not the technology

Emphasize how the app will benefit users. Avoid emphasizing the underlying technology.

41 of 56

Anchor on familiarity

Use familiar concepts from your product’s domain to help users set expectations and feel comfortable with the material. Avoid using clever and novel solutions just for the sake of it when a familiar solution will be more effective.

42 of 56

Automate in phases

As you design your product, think critically about the balance of automation and control that you need to offer your users for them to use your product successfully

43 of 56

How to explain AI performance?

44 of 56

Determine how to show model confidence, if at all

Show confidence in a way that is easier to interpret and understand when making a decision. Provide recourse for when the system is less than fully confident. Don’t user numeric numbers

45 of 56

Explain for understanding, not completeness

Don't try to explain the entire system, especially when the rationale is complex or unknown.

46 of 56

Go beyond in-the-moment explanations

Help users better understand your product with deeper explanations outside immediate product flows.

47 of 56

How do I help users build and calibrate trust in my product?

48 of 56

Setting the Right Expectations

Avoid suggesting that the tech works perfectly in high-stakes situations if the tech isn’t yet reliable.

49 of 56

Be transparent about privacy and data settings

Communicate what data is being collected and shared, and give users the ability to control their preferences.

50 of 56

Add context from human sources

Third Party Experts

Social Proofs

51 of 56

Let users give feedback

Don’t just thank users—reveal how feedback will benefit them. They’ll be more likely to give feedback again. Let users know what adjustments would happen.

52 of 56

Let users supervise automation

Avoid automating without giving users a way to undo, or allow users to make a choice in the first place.

53 of 56

Let users supervise automation

Be more proactive with automation when failure tolerance is higher.

Avoid automating without user control in high-stakes situations.

54 of 56

Give control back to users when automation fails

Help users to take over when automation fails.

55 of 56

56 of 56

Attendance & Next Time

Presentations for milestone 1 review
Test 2 Review

1 of 56

2 of 56

3 of 56

4 of 56

5 of 56

6 of 56

7 of 56

8 of 56

9 of 56

10 of 56

11 of 56

12 of 56

13 of 56

14 of 56

15 of 56

16 of 56

17 of 56

18 of 56

19 of 56

20 of 56

21 of 56

22 of 56

23 of 56

24 of 56

25 of 56

26 of 56

27 of 56

28 of 56

29 of 56

30 of 56

31 of 56

32 of 56

33 of 56

34 of 56

35 of 56

36 of 56

37 of 56

38 of 56

39 of 56

40 of 56

41 of 56

42 of 56

43 of 56

44 of 56

45 of 56

46 of 56

47 of 56

48 of 56

49 of 56

50 of 56

51 of 56

52 of 56

53 of 56

54 of 56

55 of 56

56 of 56