The Development of Thai-OCR by Using Dynamic Time Warping technique with Font Files
Mr. Prayook Jatesiktat M . 6/3 No. 15
Mr. Teerapon Thewmorakot M . 6/1 No. 17
Mr. Pongsate Tangseng M . 6/1 No. 19
Adviser Aj. Ptoomsiri Songsiri
Waste time resource
Waste personal resource
If you use scanning, waste memory and difficult to edit
Easier and faster
Easy to edit
Require a little memory
Introduction Working diagram of Artificial Neural Network Feature extraction Learning until get satisfying value Target data Input data Adjust weight of feature value Get pattern for analyze real input data (need skill and experience of researchers) (need the large number of data) (waste much time) (limited fonts are allowed)
To develop Thai-OCR algorithms using font files to solve the problems in OCR research.
Concept Font files Printed matters Same pattern Easy to recognize
Range of research
Characteristic of input data
Thai language only
Grayscale or monochrome bitmap images
No pictures or tables
Create test data by printing on A4 paper and scanning with resolution at 300 dpi
Characteristic of font files
Users must know font’s name and have font file
Study in pre-processing and processing only
Comparison efficiency between Hausdorff Distance and Dynamic Time Warping technique
Range of research
Working structure Post Processing Preprocessing Processing Hausdorff Distance Dynamic Time Warping Efficiencies comparison
Result Table 1 : Time usage 26.45 29.89 Average 27.67 33.33 Cordia New 30.00 30.67 PS Pimpdeed 21.67 25.67 Angsana New Hausdorff Distance Dynamic Time Warping Time usage (second/1 page) Fonts
Result Table 2 : Accuracy 76.19 76.50 Average 75.71 77.07 Cordia New 82.19 71.80 PS Pimpdeed 70.68 80.64 Angsana New Hausdorff Distance Dynamic Time Warping Accuracy (%) Fonts
In time usage comparison, Hausdroff Distance use less time than Dynamic Time Warping.
In accuracy comparison, accuracy efficiencies are up to type of font
It will be better if we can read data form font files.
It will be useful if we can recognize type of font without input from user.