RUS  ENG
Full version
JOURNALS // Informatsionnye Tekhnologii i Vychslitel'nye Sistemy // Archive

Informatsionnye Tekhnologii i Vychslitel'nye Sistemy, 2019 Issue 2, Pages 32–43 (Mi itvs343)

DATA PROCESSING AND ANALYSIS

Making a Web service from OCR SDK

D. P. Matalovab, E. L. Pliskinc

a LLC "Smart Engines Service", Moscow, Russia
b National Research University MIPT (Moscow Institute of Physics and Technology), 141700, Dolgorpudny, Russia
c Federal Research Center «Computer Science and Control» RAS, 119333, Moscow, Russia

Abstract: This article summarizes authors’ experience of developing a web service (WS) based on a document optical character recognition (OCR) software development kit (SDK). We consider issues of WS stability and performance, including: ability not to lose data under high load and after restart; ability to timely detect errors and limit their spread and duration; as well as deterministic WS behavior under conditions of parallel processing of multiple requests. High WS performance implies moderate overhead costs associated with receiving web requests and sending web responses to clients, besides of OCR engine costs itself. The described solution can be used to create a web service from any SDK which enables developer to process input documents and obtain output files from them, not necessarily in connection with optical recognition technologies.

Keywords: SDK, SOAP, REST, Java, Web service, optical character recognition, OCR, multithreading.

DOI: 10.14357/20718632190204



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024