Harshvardhan
Takawale
Nirupam
Roy
Yang
Bai
Irtaza
Shahid
Scribe: Simultaneous Voice and Handwriting Interface
Motivation
System Design
Evaluation
Application
Motivation
System Design
Evaluation
Application
Motivation
System Design
Evaluation
Application
Motivation
System Design
Two-Factor Authentication (2FA)
Evaluation
Application
Motivation
System Design
Two-Factor Authentication (2FA)
Something you
ARE
Voice
Biometric
Evaluation
Application
Motivation
System Design
Two-Factor Authentication (2FA)
Something you
ARE
Something you
KNOW
Voice
Biometric
Signatures
+
Evaluation
Application
Motivation
System Design
Two-Factor Authentication (2FA)
Something you
ARE
Something you
KNOW
Secure
Access
Voice
Biometric
Signatures
=
+
Evaluation
Application
Motivation
System Design
Evaluation
Application
[1] Z. Wang et al.: Hand Gesture Recognition Based on Active Ultrasonic Sensing of Smartphone: A Survey
Motivation
System Design
Evaluation
Application
Motivation
System Design
Evaluation
Application
How do people prefer to write? Especially in-Air!
Motivation
System Design
Evaluation
Application
Can voice interfaces detect hand signatures?
Can voice interfaces detect hand signatures?
Motivation
System Design
Evaluation
Application
Challenges
Motivation
System Design
Limited frequency band for tracking
High precision tracking
Multipath interference
Evaluation
Application
Challenges
Motivation
System Design
Limited frequency band for tracking
High precision tracking
Multipath interference
Evaluation
Application
Frequency band is limited resource
10k
Frequency
20k
30k
40k
Amplitude
Inaudible
Audible
Motivation
System Design
Evaluation
Application
Frequency band is limited resource
10k
Frequency
20k
30k
40k
Amplitude
Inaudible
Audible
Microphone
filter
Microphone’s recordable bandwidth
Motivation
System Design
Evaluation
Application
Frequency band is limited resource
Frequency band of human voice
Motivation
System Design
Applications
Conclusion
Frequency band is limited resource
Amazon Alexa and Amazon Echo’s recordable bandwidth
10k
Frequency
20k
30k
40k
Amplitude
Higher frequencies
Human voice
Microphone
filter
Human voice’s bandwidth
Motivation
System Design
Evaluation
Application
Frequency band is limited resource
Amazon Alexa and Amazon Echo’s recordable bandwidth
10k
Frequency
20k
30k
40k
Amplitude
Higher frequencies
Human voice
Microphone
filter
Human voice’s bandwidth
Frequency band of human voice
Frequency band for sensing
Motivation
System Design
Applications
Conclusion
Amazon Alexa and Amazon Echo’s recordable bandwidth
10k
Frequency
20k
30k
40k
Amplitude
Higher frequencies
Human voice
Human voice’s bandwidth
Frequency band of human voice
Frequency band for sensing
The solution: pure-tone based ranging
Occupy a single frequency, minimizing frequency resource needed for ranging!
Motivation
System Design
Applications
Conclusion
Amazon Alexa and Amazon Echo’s recordable bandwidth
10k
Frequency
20k
30k
40k
Amplitude
Higher frequencies
Human voice
Human voice’s bandwidth
Motivation
System Design
Evaluation
Application
Occupy a single frequency, minimizing frequency resource needed for ranging!
Frequency band for sensing
The solution: pure-tone based ranging
Amazon Alexa and Amazon Echo’s recordable bandwidth
10k
Frequency
20k
30k
40k
Amplitude
Higher frequencies
Human voice
Human voice’s bandwidth
Frequency band for sensing
The solution: pure-tone based ranging
Occupy a single frequency, minimizing frequency resource needed for ranging!
Easily interfered by multipath!
Audible to users!
Motivation
System Design
Evaluation
Application
Amazon Alexa and Amazon Echo’s recordable bandwidth
10k
Frequency
20k
30k
40k
Amplitude
Higher frequencies
Human voice
Human voice’s bandwidth
Frequency band of human voice
Frequency band for sensing
The solution: pure-tone based ranging
Occupy a single frequency, minimizing frequency resource needed for ranging!
Easily interfered by multipath!
Audible to users!
Motivation
System Design
Applications
Conclusion
Multipath: Ranging error by tracking phase
Ground-truth phase rotation
Error of phase
Multipath: Ranging error by tracking phase
Ground-truth phase rotation
Vector of NLOS path
Error of phase
Multipath: Ranging error by tracking phase
Ground-truth phase rotation
Vector of NLOS path
Captured phase rotation
Error of phase
Multipath: Ranging error by tracking phase
Ground-truth phase rotation
Vector of NLOS path
Captured phase rotation
Error of phase
(b) Low-frequency signal
t = t1
t = t2
Multipath: Ranging error by tracking phase
Ground-truth phase rotation
Vector of NLOS path
Captured phase rotation
Error of phase
(b) Low-frequency signal
t = t1
t = t2
High-frequency signal
t = t1
t = t2
Tracking using high-frequency signal induces less distance error under multipath
High-frequency signal leads to smaller distance offset
Easily interfered by multipath!
Audible to users!
Motivation
System Design
Distance offset is proportional to the inverse of frequency
Error of phase
Evaluation
Application
Challenges
Motivation
System Design
Limited frequency band for tracking
High precision tracking
Multipath interference
Evaluation
Application
Cross-Frequency Continuous Wave (CFCW) Design: frequency shifting
10k
Frequency
20k
30k
40k
50k
60k
70k
80k
90k
100k
Amplitude
Inaudible
Audible
Microphone
filter
F1
Primary signal
Motivation
System Design
Evaluation
Application
Cross-Frequency Continuous Wave (CFCW) Design: frequency shifting
10k
Frequency
20k
30k
40k
50k
60k
70k
80k
90k
100k
Amplitude
Inaudible
Audible
Microphone
filter
F1
Primary signal
Whole process is inaudible for human!
Motivation
System Design
Evaluation
Application
Cross-Frequency Continuous Wave (CFCW) Design: frequency shifting
F2
10k
Frequency
20k
30k
40k
50k
60k
70k
80k
90k
100k
Amplitude
Inaudible
Audible
Microphone
filter
F1
Primary signal
Secondary
signal
Motivation
System Design
Evaluation
Application
Cross-Frequency Continuous Wave (CFCW) Design: frequency shifting
F2
10k
Frequency
20k
30k
40k
50k
60k
70k
80k
90k
100k
Amplitude
Inaudible
Audible
Microphone
filter
F1
Primary signal
Secondary
signal
Whole process is inaudible for human!
Motivation
System Design
Evaluation
Application
Cross-Frequency Continuous Wave (CFCW) Design: frequency shifting
F2
10k
Frequency
20k
30k
40k
50k
60k
70k
80k
90k
100k
Amplitude
Inaudible
Audible
Microphone
filter
F1
Primary signal
Secondary
signal
Implicit frequency shifting
Motivation
System Design
Evaluation
Application
Cross-Frequency Continuous Wave (CFCW) Design: frequency shifting
F2
10k
Frequency
20k
30k
40k
50k
60k
70k
80k
90k
100k
Amplitude
Inaudible
Audible
Microphone
filter
F1
Primary signal
Secondary
signal
Implicit frequency shifting
Whole process is inaudible for human!
Motivation
System Design
Evaluation
Application
Cross-Frequency Continuous Wave (CFCW) Design: frequency shifting
F2
10k
Frequency
20k
30k
40k
50k
60k
70k
80k
90k
100k
Amplitude
Inaudible
Audible
Microphone
filter
F1
Primary signal
Secondary
signal
Implicit frequency shifting
Whole process is inaudible for human!
Motivation
System Design
Evaluation
Application
Cross-Frequency Continuous Wave (CFCW) Design: phase mapping
Motivation
System Design
Evaluation
Application
Cross-Frequency Continuous Wave (CFCW) Design: phase mapping
Motivation
System Design
Evaluation
Application
Challenges
Motivation
System Design
Limited frequency band for tracking
High precision tracking
Multipath interference
Evaluation
Application
Cross-Frequency Continuous Wave (CFCW) Design: multipath elimination
LOS
NLOS
Environmental reflections
Motivation
System Design
Evaluation
Application
Cross-Frequency Continuous Wave (CFCW) Design: multipath elimination
LOS
NLOS
Environmental reflections
Time
LOS
NLOS
WIN-LOS
WIN-NLOS
Received signal
Motivation
System Design
Evaluation
Application
Freq. hop
Multipath noise
Primary
signal
Frequency
Time
Freq. hop
Multipath noise
Primary
signal
Frequency
Time
Freq. hop
Freq. hop
Multipath noise
Secondary
signal
Primary
signal
Frequency
Time
Frequency hopping
is transparent to target signal
Freq. hop
Freq. hop
Multipath noise
Produces
difference
signal
Implicit
mixing
Secondary
signal
Primary
signal
Frequency
Time
X
Frequency hopping
is transparent to target signal
Freq. hop
Freq. hop
Multipath noise
Produces
difference
signal
Implicit
mixing
Secondary
signal
Primary
signal
Frequency
Time
X
Multipath produces
out-of-band noise
Application-specific design to convert 3D in-air handwritings into 2D “on-paper” handwritings.
Motivation
System Design
Applications
Conclusion
Application-specific design
Motivation
System Design
3D location optimization
Pen-lift removal
Flatten writing surface
Alexa
Alexa
0 degree
60
degree
90
degree
Evaluation
Application
Application-specific design
Motivation
System Design
3D location optimization
Pen-lift removal
Flatten writing surface
0 degree
60
degree
90
degree
Evaluation
Application
Application-specific design: 3D location optimization
Motivation
System Design
Applications
Conclusion
Alexa
Application-specific design: remove pen-lifts from virtual surface
Motivation
System Design
Applications
Conclusion
Application-specific design: remove pen-lifts from virtual surface
Motivation
System Design
Applications
Conclusion
Application-specific design: flatten writing surface
Alexa
0 degree
60
degree
90
degree
Motivation
System Design
Applications
Conclusion
Motivation
System Design
Quantitative Evaluation
Speaker
Authentication
Professional Signature Verification
Evaluation
Application
1D ranging error: 73 μm
3D tracking error: 1.4 mm
Increase in speaker authentication error: 2%
Signature verification accuracy: 90%
Application 1: �Capture physical signatures on Alexa
Motivation
System Design
Evaluation
Application
Trajectory of stylus
Recovered trajectory
Motivation
System Design
Evaluation
Application
Application 1 : Capture physical signatures on Alexa
Application 2:
Handwriting tracking co-exist with voice interface
Motivation
System Design
Evaluation
Application
Trajectory of stylus
Trajectory of stylus
Recovered trajectory
Speech-to-text:
“Hi, I am drawing a star”
Trajectory of stylus
Recovered trajectory
Motivation
System Design
Evaluation
Application
ASR
Application 2 : Handwriting tracking co-exist with voice interface
Application3:
Bring Alexa to speech disabilities
Motivation
System Design
Evaluation
Application
Trajectory of stylus
Recovered trajectory
Postprocessed handwriting
Motivation
System Design
Evaluation
Application
Application 3 : Bring Alexa to speech disabilities
Thank you!
Harshvardhan Takawale
Nirupam Roy
Yang Bai
Irtaza Shahid
Check out
our paper:
Recovered trajectory
Evaluation
Ranging accuracy
Ranging accuracy
Localization accuracy
1-D ranging error of 73 micro-meter and below 1.4 millimeters of median error in 3D trajectory tracking
Motivation
System Design
Applications
Conclusion
Performance under multipath