Robustness Analysis of Adaptive Chinese Input Methods @ WTIM2011

ANALYSIS
Mike Tian-Jian Jiang, Cheng-Wei Lee, Chad Liu, Yung-Chun Chang,
Wen-Lian Hsu
Institute of Information Science, Academia Sinica
OF
ADAPTIVE
ROBUSTNESS
入力中
文

入力, Input Method (IM)
Text Entry
11/13/11IIS, Sinica 2/30

11/13/11IIS, Sinica 3/30
http://www.gadgetvenue.com/samsung-omnia-2-swype-text-entry-system-
11243013/
http://www.inquisitr.com/97719/circboard-brings-easier-text-entry-to-xbox-360-just-needs-
investors/
http://myapplenewton.blogspot.com/2010/01/text-entry-speed-face-off.html
http://research.microsoft.com/en-
us/um/redmond/groups/cue/mobileinteraction/
http://en.wikipedia.org/wiki/File:Palm_Graffiti_gestures.png
http://wol.inf.phy.cam.ac.uk/TryJavaDasherNow.html

Radical vs. Phonetic
Homographs vs. Homophones
11/13/11IIS, Sinica 4/30

Disambiguation
To predict or not
11/13/11IIS, Sinica 5/30
http://thereality.nl/potd/1552/donderdag-31-december-2009.html http://www.ehow.com/how_6788906_use-multi_tap.html http://en.wikipedia.org/wiki/File:ITap_on_Motorola_C350.jpg

HCI, NLP (, SE)
• Unified error metrics (Soukoreff
and MacKenzie, 2001)
• Error correction (Arif and
Stuerzlinger, 2010)
• Reused vocabulary (Tanaka-
Ishii et al., 2003)
• Backward compatibility (Suzuki
and Gao, 2005)
11/13/11IIS, Sinica 6/30
http://orionwell.files.wordpress.com/2007/07/hal-9000.jpg

“Prediction and spell correction can be very annoying if
they are not smart enough. For many applications, user
input can be very noisy (imagine voice recognition or
typing on a small screen), so the input methods must be
robust against such noise. Finally, there is no standard
data set or evaluation metric, which is necessary for
quantitative analysis of user input experience.”
– WTIM 2011 statements of call for
papers
11/13/11IIS, Sinica 7/30

Prediction & Adaptation
Properties of Chinese Phonetic IM
11/13/11IIS, Sinica 8/30

Adaptation via
Online Implicit User Feedback
(Online × Offline × Implicit × Explicit) user feedback
Adaptation procedure extends Tanaka-Ishii et al. (2003)
User → ambiguous source keystroke string s
IM retrieve(s ∈ D) → candidate chunks c[]; D ≡ {db ∪ profile}
IM sort(c[])
IM compose(c[]) → target string t ∝ eval(t)
User modify(t) → t’
IM adapt(t’ ∖ t) → {feedback ∪ profile}
11/13/11IIS, Sinica 9/30
http://www.johnehrenfeld.com/2009/05/be-careful-with-adaptation.html

Dilemma
Type long and (either right or wrong things) prosper
11/13/11IIS, Sinica 10/30
http://www.personal.psu.edu/afr3/blogs/SIOW/2011/10/live-long-and-prosper.html

Amortized Cost
Trade-off between benefic and cost of error correction
11/13/11IIS, Sinica 11/30

…based on
Unified Error Metrics
Related to
minimum string distance (MSD) error and
key-stroke per character (KSPC)
11/13/11IIS, Sinica
With
Fitts’ law and Hick’s law
12/30

Notations
P: presented text
T: transcribed text
IS: input stream
C: number of correct characters in T
F: keystrokes for fixing in IS like editing, modifier, or navigation.
INF: number of incorrect yet not fixed errors in T
IF: number of incorrect but fixed errors (keystrokes in IS that
are not F and not in T)
11/13/11IIS, Sinica 13/30

 P: the quick brown fox
 T: the quixck brwn fox
 MSD(P, T) = 2 here
 Only for T without editing process
11/13/11 14/30IIS, Sinica
MSD(P,T)
SA
´100%

KSPC
|IS| / |T|
11/13/11IIS, Sinica 15/30

 T: the quixck brwn fox
 T’: the quixck brown fox
 Total Error Rate(T’) = (2/18)%
 MSD Error Rate(T’) = (1/17)%
 KSPC(T’) = 19/17
11/13/11 16/30IIS, Sinica
Total Error Rate =
INF + IF
C + INF + INF
´100%
MSDError Rate =
INF
C + INF
´100%
KSPC »
C + INF + IF + F
C + INF

 t: time
 a, b: empirical constants
 d: distance to target
 w: width of target
 n: number of equal possible choices
 Index of difficulty (ID): log2((d/w) + 1)
11/13/11 17/30IIS, Sinica
tF
= a+ blog2
(
d
w
+1)
tH
= blog2
(n+1)

Error Correction Conditions
None, Forced, or Recommended conditions
No relations between typing speed and correction attempt
Spectrum of Recommended condition
11/13/11IIS, Sinica
Situation Fixed
characters
INF IF F
S0 none INF0 0 0
Si some INFi IFi Fi
Sall all 0 IFall Fall
18/30

11/13/11 19/30IIS, Sinica
AC =
Wasted Bandwidth
Utilized Bandwidth
=
INF + IF + F
C + INF + IF + F
C
C + INF + IF + F
=
INF + IF + F
C
INF0
C
£ ACi
=
INFi
+ IFi
+ Fi
C
£
IFall
C
+
Fall
C
=
INF0
C
+
Fall
C
penalty =
tH
´ INF0
+tF
´max(D)
C + INF0
reward =
C
C + INF0
ACmodification
=
penalty
reward
=
tH
´ INF0
+tF
´max(D)
C
MAC =
INF0
C
+ ACmodification

Vocabulary Reuse
70 – 80 % vocabulary reused only after a small offset window in KB
(such that simulations of typing repeatedly are representative
enough)
11/13/11IIS, Sinica 20/30

Backward Compatibility
Error Ratio = |errors by adaptation| / |corrections by adaptation|
11/13/11IIS, Sinica 21/30

… however,
Error correction can be complicated.
11/13/11IIS, Sinica 22/30

Input
Correction
Process of input Process of Correction
[User corrects]
[User verify]
[User doesn’t verify]
[There are errors]
[User doesn’t correct]
[User corrects]
[There are no errors]
[User verify]
[User doesn’t correct]
[User doesn’t verify]
[There are no errors]
i
onverificati
[There are errors]
c
onverificati
hi
error
si
error
,,
 
i
correction
c
correction
hc
error
sc
error
,,
 
c
correction
c
onverificati
i
correction
i
onverificati
h
correction  
hc
error
sc
error
hi
error
si
error
h
error
s
error
,,,,
 

Simulation
3 IMs A, B, and C
Phonetic method
Bopomofo (Zhuyin)
Daqian keyboard layout
Data
Academia Sinica Balanced Corpus
4,000 sentences
39,469 words
Independent variables
Context length k
ρh
correction
11/13/11IIS, Sinica 24/30

Comparison of MAC
IM-A seems to be different to others?
11/13/11IIS, Sinica 25/30

GBC at Context Length 6
Again, IM-A is segregated……
11/13/11IIS, Sinica 26/30

…more aspects wanted
Than this V curve…
11/13/11IIS, Sinica 27/30

Error Tolerance
Level
• Futile Effort (Ef): |never adapted chunks|
• Beneficial Effort (Eb):|adapted chunks|
• Utility (U):|before forgotten|
• Eb(IM-A) vs. Eb(IM-B)
• “his” or “her” or “its”?
• 他 or 她 or 它
• /ta1/
11/13/11IIS, Sinica
Correlation Coefficient to CAR
Ef
avg Eb
avg Uavg
IM-A 0.49 0.92 0.66
IM-B -0.78 -0.62 -0.51
28/30

How about a shared task?
Just my humble suggestion :p
11/13/11IIS, Sinica 29/30
http://xkcd.com/114/

Thank YOU…?
Or do we have time for…
11/13/11IIS, Sinica 30/30

Many MORE Things…
11/13/11IIS, Sinica 31/30

zhi1-shi4
tou2-shi4
dao4-shi4
fang1-shi4
chang2-shi4
yi4-shi4
zhi4-shi4
shi4-wei4
shi4-li4
shi4-shang4
shi4-gu4
shi4-yi2
shi4-ji1
shi4-zhong1
11/13/11IIS, Sinica 32/30

Reduced n-gram
British Rail Enquiries
P(Enquiries|British, Rail)
P(Enquiries|Rail)
11/13/11IIS, Sinica 33/30

OpenVanilla
11/13/11IIS, Sinica 34/30

SearchTyping
http://www.youtube.com/watch?v=jgl23E6wzVA
http://www.youtube.com/watch?v=xJsXaPe_f8c
11/13/11IIS, Sinica 35/30

Thank YOU!
Any question or comment?
11/13/11IIS, Sinica 36/30

Robustness Analysis of Adaptive Chinese Input Methods @ WTIM2011

Recommended

Recommended

More Related Content

Recently uploaded

Recently uploaded (20)

Featured

Featured (20)

Robustness Analysis of Adaptive Chinese Input Methods @ WTIM2011