2007 05 Ocropus Million Books Projects - Presentation Transcript
Issues / Applications of OCR
Thomas M. Breuel
background
► CUDA
● reliable OCR-free conversion of scanned documents for
handheld readers through layout analysis
► Image-Based Personal Computing Project
● key idea: the human-readable document image, not its
structural markup, carries the meaning
► OCRopus
● open source OCR project, sponsored by Google, for
digital library/book scanning applications
0 comments
Post a comment