1) The document surveys approaches for scene text recognition in images over the past few years. It discusses techniques for text detection and localization, recognition, and end-to-end systems.
2) Traditional techniques included connected component analysis and texture-based methods using filters. Recent deep learning methods apply convolutional neural networks for region proposal, segmentation, and recognition.
3) State-of-the-art approaches use CNN-RNN models to recognize text as character sequences. Future areas of interest include handling more complex scenes and multi-language text in real-time systems.