Список литературыR. Sánchez-Rivero, P. V. Bezmaternykh, A. V. Gayer, A. Morales-González, F. J. Silva-Mata, K. B. Bulatov“A joint study of deep learning-based methods for identity document image binarization and its influence on attribute recognition”, Компьютерная оптика, 2023, том 47, выпуск 4,страницы 627

A joint study of deep learning-based methods for identity document image binarization and its influence on attribute recognition
R. Sánchez-Rivero, P. V. Bezmaternykh, A. V. Gayer, A. Morales-González, F. J. Silva-Mata, K. B. Bulatov



Список литературы

1.	Doermann D, Tombre K, Handbook of document image processing and recognition, Springer Publishing Company Inc, 2014
2.	Arlazarov VV, Andreeva EI, Bulatov KB, Nikolaev DP, Petrova OO, Savelev BI, Slavin OA, “Document image analysis and recognition: a survey”, Computer Optics, 46:4 (2022), 567–589
3.	Bulatov KB, Bezmaternykh PV, Nikolaev DP, Arlazarov VV, “Towards a unified framework for identity documents analysis and recognition”, Computer Optics, 46:3 (2022), 436–454
4.	Arlazarov VL, Arlazarov VV, Bulatov KB, Chernov TS, Nikolaev DP, Polevoy DV, Sheshkus AV, Skoryukina NS, Slavin OA, Usilin SA, “Mobile ID document recognition-coarse-to-fine approach”, Pattern Recognit Image Anal, 32:1 (2022), 89–108
5.	Arlazarov VV, Bulatov K, Chernov T, Arlazarov VL, “MIDV-500: a dataset for identity document analysis and recognition on mobile devices in video stream”, Computer Optics, 43:5 (2019), 818–824
6.	Bulatov K, Emelianova E, Tropin D, et al., MIDV-2020: A comprehensive benchmark dataset for identity document analysis, 2021, arXiv: 2107.00396
7.	Sánchez-Rivero R, Bezmaternykh P, Morales-González A, Silva-Mata FJ, Bulatov K, “Assessing the relationship between binarization and OCR in the context of deep learning-based ID document analysis”, Progress in artificial intelligence and pattern recognition, eds. Heredia YH, Núñez VM, Shulcloper JR, Springer International Publishing, Cham, 2021, 134–144
8.	Lins RD, Almeida MMD, Bernardino RB, Jesus D, Oliveira JM, “Assessing binarization techniques for document images”, DocEng 2017: Proc 2017 ACM Symposium on Document Engineering, 2017, 183–192
9.	Mustafa WA, Kader MMMA, “Binarization of document images: A comprehensive review”, J Phys: Conf Ser, 1019 (2018), 012023
10.	Tensmeyer C, Martinez T, “Historical document image binarization: A review”, SN Comput Sci, 1:3 (2020), 173
11.	Pratikakis I, Zagoris K, Barlas G, Gatos B, “ICFHR2016 handwritten document image binarization contest (H-DIBCO 2016)”, 2016 15th Int Conf on Frontiers in Handwriting Recognition (ICFHR), 2016, 619–623
12.	Pratikakis I, Zagoris K, Karagiannis X, Tsochatzidis L, Mondal T, Marthot-Santaniello I, “Document image binarization (DIBCO 2019)”, 2019 Int Conf on Document Analysis and Recognition (ICDAR), 2019, 1547–1556
13.	Smith EHB, “An analysis of binarization ground truthing”, Proc 8th IAPR Int Workshop on Document Analysis Systems (DAS ’10), 2010, 27–34
14.	Ntirogiannis K, Gatos B, Pratikakis I, “Performance evaluation methodology for historical document image binarization”, IEEE Trans Image Process, 22:2 (2013), 595–609
15.	Rani U, Kaur A, Josan G, “A new binarization method for degraded document images”, Int J Inf Technol, 15:1 (2019), 1035–1053
16.	Milyaev S, Barinova O, Novikova T, Kohli P, Lempitsky V, “Image binarization for end-to-end text understanding in natural images”, 2013 12th Int Conf on Document Analysis and Recognition, 2013, 128–132
17.	Chou C-H, Lin W-H, Chang F, “A binarization method with learning-built rules for document images produced by cameras”, Pattern Recogn, 43:4 (2010), 1518–1530
18.	Wen J, Li S, Sun J, “A new binarization method for non-uniform illuminated document images”, Pattern Recogn, 46:6 (2013), 1670–1690
19.	Tafti AP, Baghaie A, Assefi M, Arabnia HR, Yu Z, Peissig P, “OCR as a service: An experimental evaluation of google docs OCR, tesseract, ABBYY FineReader, and transym”, Advances in visual computing, eds. Bebis G, Boyle R, Parvin B, Koracin D, Porikli F, Skaff S, Entezari A, Min J, Iwai D, Sadagic A, Scheidegger C, Isenberg T, Springer International Publishing AG, Cham, Switzerland, 2016, 735–746
20.	Li Z, Yang C, Shen Q, Wen S, “A document image dataset for quality assessment”, J Phys: Conf Ser, 1828:1 (2021), 012033
21.	Ye P, Doermann D, “Document image quality assessment: A brief survey”, 2013 12th Int Conf on Document Analysis and Recognition, 2013, 723–727
22.	Polevoy DV, Bulatov KB, Skoryukina NS, Chernov TS, Arlazarov VV, Sheshkus AV, “Key aspects of document recognition using small digital cameras”, RFBR J, 4:92 (2016), 97–108
23.	Chernov T, Ilyuhin S, Arlazarov VV, “Application of dynamic saliency maps to the video stream recognition systems with image quality assessment”, Proc SPIE, 11041 (2019), 110410T
24.	Shemiakina J, Limonova E, Skoryukina N, Arlazarov VV, Nikolaev DP, “A method of image quality assessment for text recognition on camera-captured and projectively distorted documents”, Mathematics, 9:17 (2021), 2155
25.	Bezmaternykh PV, Ilin DA, Nikolaev DP, “U-Net-bin: hacking the document image binarization contest”, Computer Optics, 43:5 (2019), 825–832
26.	Calvo-Zaragoza J, Gallego AJ, “A selectional auto-encoder approach for document image binarization”, Pattern Recogn, 86 (2019), 37–47
27.	Masyagin M, Robust document image binarization tool, 2021 https://github.com/masyagin1998/robin
28.	Otsu N, “A threshold selection method from gray-level histograms”, IEEE Trans Syst Man Cybern Syst, 9:1 (1979), 62–66
29.	Lins RD, Simske SJ, Bernardino RB, “DocEng'2020 time-quality competition on binarizing photographed documents”, Proc ACM Symposium on Document Engineering, 2020, 2
30.	Yu D, Li X, Zhang C, Liu T, Han J, Liu J, Ding E, “Towards accurate scene text recognition with semantic reasoning networks”, Computer Vision and Pattern Recognition (CVPR), 2020, 12113–12122
31.	Du Y, Li C, Guo R, Cui C, Liu W, Zhou J, Lu B, Yang Y, Liu Q, PP-OCRv2: Bag of tricks for ultra lightweight OCR system, 2021, arXiv: 2109.03144
32.	Lee J, Park S, Baek J, Oh SJ, Kim S, Lee H, “On recognizing texts of arbitrary shapes with 2D self-attention”, Proc IEEE/CVF Conf on Computer Vision and Pattern Recognition Workshops, 2020, 546–547
33.	Baek J, Kim G, Lee J, Park S, Han D, Yun S, Oh SJ, Lee H, “What is wrong with scene text recognition model comparisons? dataset and model analysis”, 2019 IEEE/CVF Int Conf on Computer Vision (ICCV), 2019, 4714–4722
34.	Cai H, Sun J, Xiong Y, Revisiting classification perspective on scene text recognition, 2021, arXiv: 2102.10884
35.	Smith R, “An overview of the tesseract OCR engine”, IEEE Int conf on Document Analysis and Recognition (ICDAR’07), 2 (2007), 629–633
36.	Michalak H, Okarma K, “Robust combined binarization method of non-uniformly illuminated document images for alphanumerical character recognition”, Sensors, 20:10 (2020), 2914
37.	Yujian L, Bo L, “A normalized Levenshtein distance metric”, IEEE Trans Pattern Anal Mach Intell, 29:6 (2007), 1091–1095
38.	Schulz D, Maureira J, Tapia J, Busch C, “Identity documents image quality assessment”, 2022 30th European Signal Processing Conf (EUSIPCO), 2022, 1017–1021