Drawing in Talking: Using Pen and Voice for Drawing System Configuration Figures in Talking

Drawing in Talking:
Using Pen and Voice for Drawing System Configuration
Figures in Talking
Research and Technology Department
Xingya Xu (xingya.xu@fujixerox.co.jp)
December 8, 2017
IDW/AD’ 17
December 6-8
Sendai, Japan
INP7/UXC6 - 2
Fuji Xerox Co., Ltd.

Drawing in talking vs Making in advance
2
Shop Server Database
Cloud
Drawing by hand
• Quick and easy
• Interact with listeners actively
Making in advance
• Neat and precise
• Well-designed icons and graphs

Purpose
3
How to drawing system configuration
figures easily and quickly?
Support drawing quickly and easily
Shop Server Databas
e
Cloud
How to drawing system
configurations in real-time talking?
Support drawing in talking

4
To draw quickly and easily
Multimodal input
Make use of different input modalities such as
touch, pen, and speech in an integrated manner
The strength of Pen
• Talking or thinking during drawing
• Express the position and shape of objects
The strength of Voice
Express linguistic information
Approach 1
PC
smartphone
a. Circle  Icon
b. Line  Text

Previous Research
5
A user sitting on the chair can move the object by pointing to it and saying “move that
there”.
Put-That-There
Bolt, R.A. Put-that-there: Voice and gesture at the graphics
interface. ACM Computer Graphics 14, 3 (1980), 262–270

Previous Research
6
The problem of Put-That-There
Voice has two meanings
• to convey messages to the listeners
• to issue commands to the system
The problem of Put-That-There
Cause unintentional system behaviors
when the speaker talks to the listeners
Speaker
Listeners
System
Message
Command
In talking and drawing case, voice not only conveys
commands to the system, but also conveys messages
to the listeners

7
To draw in talking smoothly and naturally
Approach 2
Free mode & Command mode
• In the free mode pen or speech input is
not considered as command.
• In the command mode inputs are
considered as part of a command.
Smooth mode switching
Switch between the free mode and the
command mode smoothly and not disturb
talking
PC
smartphone
a. Circle  Icon
b. Line  Text

8
Approach 2
Mode switch techniques
Button
A basic technique
Tap
No need to specify the end, but need to
change hand holding posture
Pen-holding
No need to change hand holding posture
Pigtail
Draw a pigtail at the end of drawing
Pigtail gesture examples
Technique Description Start End
Button Press button before and
after drawing
Click Click
Tap Tap the panel before
drawing
Tap ―
Pen-holding Hold the pen for a while
before drawing
Hold the
pen
―
Pigtail Draw a pigtail at the end
of the stroke
― Pigtail

System implement
Design
9
TalkingDraw
A prototype system using C# on a Surface Pro 3
with a Surface Pen
Speech recognition
• Recognize users’ speech during the command
mode
Recognize Pen strokes
• Recognize the shape of users’ pen strokes when
the command is ended
• $P Point-Cloud Recognizer (R.D. Vatavu et.al.,
2012)
Talking
Drawing
Voice that will be recognized
Delay
(0.5s)
Start End
The command is automatically ended if there is
no pen and voice input detected in a 0.5s time
break.

Elements of system configurations
Design
smartphone
c. Line  Line Text
PC
a. Circle  Icon
cloud
cloud
b. Rectangle  Box
text
d. Line  Link
10
Shape of a
stroke Text of voice Behavior of
TalkingDraw
Circle “PC” Input an icon whose
name is “PC”
Rectangle “cloud” Show “cloud” in a text
box
Line
“smartphone” Show “smartphone” as
a simple text
― Make a link between
two objects

12
Experiment 1
Participants: 16 people (12 males and 4 females, age avg. 48.1)
Scenario: TalkingDraw used as a drawing tool in talking.
Task: Participants must speak a given sentence and insert icons while speaking.
練習1) 「ネット」から「資料」をダウンロードしましょう。 The task sentence
The icons to be inserted
Talking-in-drawing task

13
Experiment 1
Result
Task completion time
• One-way ANOVA: The main effect of
techniques was significant (F(3,45)=6.39,
p<.01).
• Tukey's method: Pigtail = Tap << Pen =
Button
Interview
• Pigtail was comfortable even the accuracy of
gesture recognition is complained.
• Pressing the button twice was a pain.
• It is hard to hold the pen on the screen
0.0
2.0
4.0
6.0
8.0
10.0
12.0
14.0
16.0
Button Tap Pen Pigtail
Taskcompletiontime(s)
(a) Experiment 1

14
Experiment 2
Participants: 16 people (12 males and 4 females, age avg. 48.1)
Scenario : TalkingDraw as a drawing tool for system configuration figures.
Task: Participants must draw a given figure.
携帯写真
アップロードデータ
ベース
Example: The given figure and the sample
figure
Making-in-advance

15
Experiment 2
Result
Task completion time
• One-way ANOVA: The main effect of
techniques was significant (F(3,45)=5.22,
p<.004).
• Tukey's method: Tap = Pigtail = Pen < Button
Interview
• There is no big difference between techniques.
• Button was more comfortable than in
Experiment 1.
0.0
5.0
10.0
15.0
20.0
25.0
30.0
35.0
Button Tap Pen Pigtail
Taskcompletiontime(s)
(b) Experiment 2

Pigtail performs best in experiment 1
• Specify the command mode after actions
• Need to improve the accuracy of pen gesture recognition
No big difference in experiment 2
• Participants don’t need to think during drawing a figure
• Techniques that specify the command mode before actions
perform better than in experiment 1
16
Discussion

17
Future work
The accuracy of Pigtail recognition
• More samples
• Normalize
The accuracy of speech recognition
• Google cloud speech recognition
Context sensitive
• The voice input and the drawing are not concurrent
• Timestamp
• Semantic analysis
Voice input
Pen input
Key content
The command duration
Noise

Xerox、Xeroxロゴ、およびFuji Xeroxロゴは、米国ゼロックス社の登録商標または商標です。
Thanks for Icons made by Freepik from www.flaticon.com

Drawing in Talking: Using Pen and Voice for Drawing System Configuration Figures in Talking

Recommended

Recommended

More Related Content

Similar to Drawing in Talking: Using Pen and Voice for Drawing System Configuration Figures in Talking

Similar to Drawing in Talking: Using Pen and Voice for Drawing System Configuration Figures in Talking (20)

Recently uploaded

Recently uploaded (20)

Drawing in Talking: Using Pen and Voice for Drawing System Configuration Figures in Talking

Editor's Notes