Abstract:
This paper is considered training data formation for machine learning methods in the context of text recognition problem. Initial data is a set of images containing documents with Machine Readable Zone (MRZ) shown a camera in some orientation. Relative positions of the camera and the document, light and camera models may vary within wide limits.
Keywords:mono-font text recognition, synthesis of training data, neural networks.