RUS  ENG
Full version
JOURNALS // Computer Optics // Archive

Computer Optics, 2017 Volume 41, Issue 3, Pages 441–452 (Mi co404)

This article is cited in 8 papers

IMAGE PROCESSING, PATTERN RECOGNITION

A review of algorithms for text detection in images and videos

Yu. A. Bolotova, V. G. Spitsyn, P. M. Osina

Tomsk Polytechnic University, Tomsk, Russia

Abstract: This article reviews the history and state-of-the-art optical character recognition systems, such as ABBYY FineReader, Tesseract, CuneiForm, with particular attention given to their inner algorithms, including page layout analysis; page segmentation and document skew angle estimation. The overview includes the description and comparison of different methods proposed for the last 30 years in terms of speed and versatility. Critical analysis and discussions about the status of the field and open problems are reported.

Keywords: OCR, page layout analysis, text segmentation, skew detection.

Received: 21.02.2017
Accepted: 19.04.2017

DOI: 10.18287/2412-6179-2017-41-3-441-452



© Steklov Math. Inst. of RAS, 2024