IMPACT Final Conference - Research Parallel Sessions - 01 impact conference_research_session_aa

1,410 views

Published on

Published in: Education, Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,410
On SlideShare
0
From Embeds
0
Number of Embeds
1,034
Actions
Shares
0
Downloads
11
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

IMPACT Final Conference - Research Parallel Sessions - 01 impact conference_research_session_aa

  1. 1. IMPACT Research Image Enhancement, Segmentation, Experimental OCR Apostolos Antonacopoulos PRImA Lab, The University of Salford, United Kingdom www.primaresearch.org
  2. 2. Outline <ul><li>Overview: digitisation workflow </li></ul><ul><li>Image enhancement </li></ul><ul><ul><li>Border removal </li></ul></ul><ul><ul><li>Page curl removal </li></ul></ul><ul><ul><li>Correction of arbitrary warping </li></ul></ul><ul><li>Segmentation </li></ul><ul><ul><li>Recognition-based </li></ul></ul><ul><ul><li>Standalone </li></ul></ul><ul><li>Typewritten document OCR </li></ul><ul><li>Wordspotting </li></ul>
  3. 3. Overview: Digitisation Workflow <ul><li>Main steps: </li></ul><ul><ul><li>Scanning </li></ul></ul><ul><ul><li>Image enhancement </li></ul></ul><ul><ul><ul><ul><li>Page splitting </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Border removal </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Page curl removal </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Dewarping </li></ul></ul></ul></ul><ul><ul><li>Layout analysis </li></ul></ul><ul><ul><ul><ul><li>Segmentation of regions, lines, words and characters </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Region classification </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Logical layout analysis </li></ul></ul></ul></ul><ul><ul><li>OCR (incl. specialist or wordspotting) </li></ul></ul><ul><ul><li>Post-processing </li></ul></ul>
  4. 4. Correction of Arbitrary Warping <ul><li>Fully-automated tool for large-scale digitisation </li></ul><ul><li>Interface for interactive fine correction (e.g. for boutique digitisation projects) </li></ul><ul><li>Arbitrary geometric artefacts correction </li></ul><ul><li>Multi-column documents </li></ul><ul><li>Fully-parameterised process (reversible) </li></ul><ul><li>No adverse effects on non-warped documents </li></ul>22 March 2011 – EC review
  5. 5. Fully-Automated Dewarping 22 March 2011 – EC review
  6. 6. Global Grid Construction 22 March 2011 – EC review Original Image Region Segmentation Global Grid
  7. 7. Sub-Grid Correction 22 March 2011 – EC review Sub-grid text lines Sub-grid aligned to baselines Corrected sub-grid
  8. 8. Multi-Column Document Correction Original image Baseline-aligned sub-grids Corrected image
  9. 9. Preliminary Results <ul><li>Evaluation calculates deviation from straight lines (shaded area) </li></ul><ul><li>Method compared with IMPACT page-curl removal method and with original image </li></ul>22 March 2011 – EC review
  10. 10. Textline and Word Segmentation <ul><li>Standalone methods that can be integrated to systems without the need to integrate FR engine </li></ul><ul><li>Not based on recognition of characters/words – suitable for documents with non-dictionary words or not practical to OCR to OCR (word spotting) </li></ul><ul><li>Used in other IMPACT methods: </li></ul><ul><ul><li>Typewritten OCR </li></ul></ul><ul><ul><li>Correction of arbitrary warping </li></ul></ul><ul><ul><li>Word spotting </li></ul></ul>date footertext
  11. 11. Hybrid Text Line Segmenter <ul><li>Hybrid approach based on connected component clustering and projection profiles </li></ul><ul><li>Connected component extraction (incl. noise filtering) </li></ul><ul><li>Group components into line candidates using an efficient data structure </li></ul><ul><li>Find and split under-segmented lines using local projection profiles </li></ul><ul><li>Merge small peripheral lines to appropriate neighbour (e.g. for i-dots etc.) </li></ul>Bitonal image Text regions (PAGE XML) Regions with text lines (PAGE XML) Parameters
  12. 12. Density Word Segmenter <ul><li>Adaptive projection-profile based approach using foreground pixel density </li></ul>Bitonal image Text regions and lines (PAGE XML) Regions, text lines and words (PAGE XML) Parameters <ul><li>For each text line: </li></ul><ul><ul><li>Generate vertical projection profile </li></ul></ul><ul><ul><li>Find delimiting white spaces using an adaptive threshold based on the density of foreground pixels in the line </li></ul></ul><ul><ul><li>Group connected components into words </li></ul></ul>
  13. 13. Evaluation <ul><li>Text line ground truth: 25 historical documents (more than 2700 text lines) </li></ul><ul><li>Results (using USAL layout evaluation tool): </li></ul><ul><li>Word ground truth: 15 historical documents (more than 14500 words) </li></ul><ul><li>Results (using USAL layout evaluation tool): </li></ul>
  14. 14. Further Information <ul><li>PRImA </li></ul><ul><ul><li>http://www.primaresearch.org </li></ul></ul><ul><li>IMPACT </li></ul><ul><ul><li>http://www.impact-project.eu </li></ul></ul>

×