1 of 38

IA, AI & Voice:

How can we make voice interactions easier?

a talk by

Lade Tawak

2 of 38

3 of 38

The cognitive considerations for voice interfaces

miller’s law = the span of immediate memory is limited to around 7 pieces of information

Avoid lengthy responses, chunk responses,

confirm attention and understanding

4 of 38

LADE TAWAK

Low digital literacy +

no/low visual context

= fear and anxiety

CONSIDERATION 1

5 of 38

I press something and something happens

I’m talking to someone?

No direct manipulation

6 of 38

LADE TAWAK

“The main bottleneck of using a voice assistant system is that the user cannot know all the commands.” — Park, Park, Song (2020)

What does this do?

7 of 38

What do I need to make a marinade?

You need oil, salt…

(still listing)

Okay

(affirming)

Conversational cues

8 of 38

How do we come in?

Set context, explain to the user what the agent can do

Low digital literacy

9 of 38

Proper data structures which allow AI properly parse user requests to map to attributes

Low digital literacy

How do we come in?

10 of 38

Create Dialog Flows and maps that include, clarification paths, error messages, and error recovery

Low digital literacy

How do we come in?

11 of 38

That’s card activity, is that correct?

How can I help you? You can say report a stolen card, card activity, insurance, or I need help with something else

Card activity

Yes

What card activity can I help with? Say: balance, recent transactions, dispute a charge or I need help with something else.

12 of 38

LADE TAWAK

Communication styles and language

CONSIDERATION 2

13 of 38

Identifying user intent

Book me an uber to the airport

I’d like a pizza

Order a medium triple decker bbq chicken pizza

I need to go pick up Tunta and then go to church

14 of 38

  • Language is more than grammar and sentence structure

  • Slangs, social norms, or cultural information add meaning

Context and content

15 of 38

Cultural cognition

When I say stove, what do you picture?

16 of 38

Cultural cognition

When I say cooker, what do you picture?

  • pressure cooker?
  • rice cooker?
  • slow cooker?
  • nothing?

17 of 38

Two chatbots to teach about consent in the context of sexual activities. One LC, one HC. (Fromet, 2021)

Low context, high context

Can the agent match my interaction style

HC Culture Participants

LC Culture Participants

Low context interaction

Neutral

Slight preference

High context interaction

Strong preference

Neutral

18 of 38

Call Onyinye

I didn’t get that

Call Onyinye

Try again

Accent recognition

19 of 38

How to make stock

I’m sorry I can only teach how to make a marinade or curry or stock

I want to make stock

Okay, we can resume whenever you’re ready

What would you like to learn

Accent recognition

20 of 38

LADE TAWAK

“Having to adapt our way of speaking to interact with speech recognition technologies is a familiar experience for people whose first language is not English or who do not have conventionally American-sounding names. I have even stopped using Siri because of it” Claudia Lopez Lloreda

21 of 38

Multilingual behaviour

Can the agent understand me if I speak multiple languages at the same time

22 of 38

Ensure data sets represent a wide range of accents, dialects, and cultural references.

Communication styles

How do we come in?

23 of 38

Ensure taxonomies, metadata, and data labelling are free from biases, but also consider cultural contexts

Communication styles

How do we come in?

24 of 38

Customise agent personality based on location and other signals

Communication styles

How do we come in?

25 of 38

Allow personalisation and customization

Communication styles

How do we come in?

26 of 38

In creating DialogFlows, include multiple sample utterances that cover diverse cultural approaches

Communication styles

How do we come in?

27 of 38

Communication styles

This is not new

28 of 38

Communication styles

Different food menu items based on location

This is not new

29 of 38

LADE TAWAK

Environmental context

CONSIDERATION 3

30 of 38

What do I need to make a marinade?

You need oil, soy sauce, garlic, pepper, and lemon juice

I don’t have lemon juice

…..

Should I place an order for you?

Unavailable in my country

31 of 38

Create flexible interaction structures such as branching choices and decision points that can respond based on predetermined attributes

Communication styles

How do we come in?

32 of 38

Customize what agent can do and not do based on what is available to the user in their context (e.g. based on location)

Communication styles

How do we come in?

33 of 38

Map user journeys and understand the surrounding context outside of your product

Communication styles

How do we come in?

34 of 38

Wrapping up

35 of 38

…understand user (intent)

…retrieve relevant information

…generate the right output

HELP THE AGENT

36 of 38

…customizing agent based on sociocultural context

…using inclusive taxonomies and data labelling

…designing multiple branching choices in the dialogflow

BY…

37 of 38

make voice interactions more inclusive, usable, and accessible

TO…

38 of 38

Lade Tawak

THANK YOU