This document describes SuperVISOR, an adaptive system for paper data form recognition and extraction. It analyzes problems with existing data requirements and status of current software. SuperVISOR uses an OCR engine like Tesseract for recognition, has a flexible XML-based supervisor-builder for creating extraction rules, and provides high accuracy and flexibility though computational speed remains slow on large files. A demo is available.