View stunning SlideShares in full-screen with the new iOS app!Introducing SlideShare for AndroidExplore all your favorite topics in the SlideShare appGet the SlideShare app to Save for Later — even offline
View stunning SlideShares in full-screen with the new Android app!View stunning SlideShares in full-screen with the new iOS app!
Census which is conducted using ‘canvasser’ method is in two phases:
Census Organization has experimented with new IT innovations since the beginning
Technology is required particularly for data capture/processing – mainly due to large volume and for speedier tabulation & release of Census results
MODE FOR DATA CAPTURE & PROCESSING SINCE 1961 Census 1961 1971 1981 1991 2001 Population 43.9 Million 54.8 Million 68.3 Million 84.6 Million 102.8 Million Collection% 100 100 100 100 100 Capture % 5 15 25 45 100 Mode Hand Punch Key Punch Data Entry Data Entry Scanning/ICR Time taken 8-9Years 8-9Years 8-9 Years 7-8 Years 3-5 Years
Services of System Integrator hired to guide and assist in the implementation of ICR technology.
An unique model for Outsourcing
SI to work in our premises for better
communication and control
maintain data security, safety and confidentiality
Capacity building (Training and guiding to IT staff)
Production Linked payment to SI
DATA CAPTURE & PROCESSING IN 2001 CENSUS Work Flow of ORGI (TIS Eflow characteristic) Design data capture workflow Presents a graphical view of the system Monitors the processing and workflow in real time Enables to customize applications and add custom features
DATA CAPTURE & PROCESSING IN 2001 CENSUS Work flow Modules Scan Portal, File Portal, Controller FormID, Manual FormID RC Processing [OCR/ICR] Tile, Completion, CAC & Exception Export
DATA CAPTURE & PROCESSING IN 2001 CENSUS ORGI Workflow Stages ASCII FILE Prepare Batch Scanning Recognition Tiling Completion Exception Export / Archival Server
Server Controller station Tiling & Completion stations Export station Scanning station Recognition stations Exception stations DATA CAPTURE & PROCESSING IN 2001 CENSUS L AN SETUP - ORGI DATA CENTERs Forms are fed thru SCANNER(S) batch by batch Field by field character images are automatically RECOGNISED Tile/Correction station - Un-recognised Characters are corrected by OPERATORS Supervisors Handle Exceptional cases referred by Operators Supervisor Export completed batches as ASCII file for further processing Supervisor Monitor the workflow & Balance the load at different stages of operation Form IMAGES stored in Network DISK
Special training to enumerators for filling the forms
For CAC, use knowledge Based dictionaries to increase throughput
Use of concurrent quality check procedures on the line of USA and UK
DATA CAPTURE & PROCESSING Technology for 2011 Census
Continuation of ICR Technology
International and national experience shows as on date no better substitute for scanning & ICR technology
Expertise and competence gained in using ICR technology available in the organization
DATA CAPTURE & PROCESSING Technology for 2011 Census (contd..)
Use more efficient scanners having facility for image enhancement, noise removal, color drop-out, better throughput and on-spot detection and correction (through in-built software) of bad images to be used.
Use of improved version of ICR software with better recognition and built-in enhanced workflow management capability.
Use new features in Auto/Computer Assisted Coding in ICR software
Thank you. Visit Our Website at www.censusindia.gov.in
Intelligent Character Recognition (ICR) Technology is used to extract the handwritten/machine printed (typeset) character(s) from the scanned images to generate the computer processable data file. In brief, following steps are involved in using ICR technology.
Sc anning :- Paper based forms are scanned to create bit map image file
Fi le Portal ::- It is an Image File Registration module in eflow as an input to next activity.
Fo rm Identification :- Automatically identifies the Images of various schedules based on the Empty Form Image (EFI) template created during the designing stage.
Mother Tongue & Other languages Name of SC/ST Education Religion
NCO HOUSEHOLD SCHEDULE- SIDE B NCO NIC Place of Birth & Last residence
DATA CAPTURE & PROCESSING Selection of technology OMR/OCR / ICR in 2001
Recognition of hand written descriptive entries in different languages is beyond the capabilities of the known ICR SW and hence a conscious decision was taken to go in for the recognition of Only Numeric Characters, leaving the rest to be handled thru Image enabled computer assisted coding (CAC) . Following key features were introduced in the data capture solution.
Parameters for selecting the ICR Software
Highest recognition rate and lowest percentage of false positive with customization and assured support & Training
Facility of organized workflow in LAN environment with centralized controls with Computer Assisted Coding facility.
In built quality enhancement tools to trap the wrongly recognized characters so as to facilitate corrective action.
U se of multiple engines with voting algorithm. Ability to incorporate validation rules to trap inconsistent entries/wrong recognition. Learning capabilities of engines.