Presentation for dissertation liu

A Study on Handwritten Chinese Character Synthesis
using Generative Networks
Graduate School of System and Engineering
Advisor: Hidemoto Nakada
Author: Liangyu Liu

Overview
• The challenge of a large amount of characters, fonts
and complex structure for Chinese characters.
• Zi2Zi is a powerful model for printed type Chinese
character synthesis, while underwhelming for
handwritings generation.
Overview
Badly formed
handwritings

Overview
• We aim at improving zi2zi on Chinese handwriting
character synthesis using three new training
methods.
• Result shows that using our methods can get a
better image quality and learning from easy to hard
tasks using curriculum learning can improve
learning effect.
Overview
Badly formed
handwritings

01 Background
03 Experiment and
Result
Contents
02 Method
04 Conclusion

Background
• Numerous characters.
• Complex structure.
• A few handwriting samples for a certain font.
• A sample shows the difference between Chinese
characters and alphabets.
Challenges of Chinese
character synthesis

Background
• Re-purpose a well-trained
model onto another related
task.
• Train faster and more effectively.
• Improve performance when
tackling another related task by
fine-tuning.
Transfer Learning
Taken from https://ruder.io/transfer-learning/index.html

Background
• Based on conditional GAN for image to image transition.
• Main loss functions: Adversarial loss and L1 loss.
• Discriminator to distinguish
whether the images are
real or fake.
• Generator to synthesize
more realistic images.
Pix2Pix
Taken from Image-to-Image Translation with Conditional Adversarial Networks
https://arxiv.org/abs/1611.07004

Background
• An encoder-decoder model
based on the fully convolutional
network.
• Skip-connections between the
encoding layers and the
corresponding decoding layers.
U-net
Taken from U-Net: Convolutional Networks for Biomedical Image Segmentation
https://arxiv.org/pdf/1505.04597

Background
• Based on pix2pix model for Chinese characters synthesis.
• Learn multiple fonts at the same time.
• A non-trainable Gaussian noise to map the corresponding style.
• A multi-class category loss to avoid confusion of multiple styles
by predicting the style of the character.
• A constant loss to make the generated character resemble the
source.
• Good result in printed font synthesis while underwhelming on
generating handwritings.
Zi2Zi
Badly formed
result

Background
Architecture of Zi2Zi
• Main loss functions:
• G loss, calculated by L1 loss,
category loss and constant
loss.
• D loss, calculated by true or
fake loss and category loss.
• L1 loss to measure the
difference between
generated and real images.

Background
• Training process:
• Prepare plenty of fonts, sampling
characters from each font and draw
with the source font in pair.
• Training all the paired images
together at the same time.
• Test process:
• Select the test font by designating
the embedding id.
Process of Zi2Zi
Paired images for
different fonts

Background
• Evaluate the quality and similarity of images.
• SSIM (Structural Similarity Index) :
• Perceived similarity between the two given images.
• From 0 to 1
• The larger the more similar.
• PSNR (Peak Signal to Noise Ratio) :
• The quality of generated image.
• The larger the clearer.
Image quality assessment

Method
• Different stroke weight between printed type,
hard-pen and brush calligraphy.
• Same component (radical) showed many times
in one font.
• More personal stylized fonts.
Hypothesis of causes for badly
formed handwritings by Zi2Zi
Badly formed
samples

Method
1. Using all hard-pen handwritings
for training.
2. Reducing the characters that have
common radical in the same font.
3. Training with less stylized fonts
and fine-tune with more stylized
fonts.
Training methods
Hard-
pen
Printed
type
Brush
calligraphy

Method
• Different stroke weights between each
font have a critical influence on training.
• Better to concentrate on learning a
similar type of handwritings.
• Hard-pen handwritings tend to have
clear structure because of their light
stroke weight.
1. Using all hard-pen handwritings
Hard-
pen
Printed
type
Brush
calligraph
y

Method
2.Reducing the characters
that have common radical
• Same radical in different characters
looks similar in same hard-pen
handwriting font, while differs from
other hard-pen handwriting font.
• Learning an excessively
repeated radical in the same font
are less effective than learning
other totally different characters.
Font I
Font II
Fonts
Samples
Same
radical
Same
radical

Method
• Big differences between handwritings
for personal writing habits.
• Curriculum learning: Learn from easy
to hard tasks.
• Train the more printed like
handwritings and then fine-tune with
more personal stylized handwritings.
3.Training from less stylized fonts
to more stylized fonts
easy
normal
hard

Experiment and Result
• Three types of handwriting fonts are prepared: Printed,
hard-pen and bush calligraphy.
• Format of font: TrueType.
• Source font: SIMSUN.
• A sample of SIMSUN in a TrueType file is shown.
Dataset

• Experiment I
Experiment design
Use a mixed dataset, including
printed, hard-pen and brush
calligraphy fonts for training.
Mixed
关键词
Use all hard-pen handwriting
fonts for training.
Hard-pen
关键词
Hard-
pen
Printed
type
Brush
calligraphy
Hard-
pen
Test: 5 hard-pen fonts, each font includes 3 characters.

• Experiment II
Experiment design
Use 30 fonts, 500 characters in
each font including 25 characters
with common radical 亻.
Original
关键词
Use 50 fonts, 300 characters in
each font including 15 characters
with common radical 亻.
Reduced
关键词
15 亻
285 B
15 亻
285 B
fonts
characters ……
1 50
关键词
25 亻
475 B
25 亻
475 B
fonts
characters ……
1 30

• Experiment III
Experiment design
Mix 25 printed like handwriting
fonts and 5 more stylized fonts
together for training.
Normal
关键词
Train
together
Train with 25 printed like
handwriting fonts and fine-tune
with 5 more stylized fonts.
Easy to hard(e2h)
easy
hard
Fine-tune
Train

Result-experiment I
Method SSIM PSNR
Mixed 0.401 8.791
Hard-
pen
0.387 8.861
Loss function Generated samples Evaluation
mixed hard-pen
d_loss
g_loss
L1_loss
truth mixed hard-pen

Result-experiment II
original reduced
d_loss
g_loss
L1_loss
truth original reduced
Method SSIM PSNR
Original 0.434 9.643
Reduce
d
0.430 9.912

Result-experiment III
train fine-tune
d_loss
g_loss
L1_loss
truth normal e2h
Method SSIM PSNR
Normal 0.541 10.489
E2h 0.553 10.638

• Our methods get a higher image quality, the generated samples are less blurred.
• Training from easy to hard tasks using curriculum learning makes sense for
Chinese handwriting character synthesis.
• The similarity are not improved for the first two methods, the reason might be:
• The strokes show more significant differences in handwriting than in mixed
type fonts.
• The selected radical has fewer strokes and simpler structure than other
components.
Discussion

Conclusion
• The initial results show that we can get less blurred generated images
and improve the image quality by using our training methods.
• Learning from easy to hard tasks using curriculum learning is effective
to improve learning effect for zi2zi on Chinese handwriting synthesis
tasks.
• We want to use the model to create TrueType fonts by given only a
few samples and generate all the others, as our future work.
Conclusion and future work

Presentation for dissertation liu

Recommended

Recommended

More Related Content

Similar to Presentation for dissertation liu

Similar to Presentation for dissertation liu (20)

Recently uploaded

Recently uploaded (20)

Presentation for dissertation liu

Editor's Notes