Kh. S. Khechoyan, “Synthetic document generation for the task of visual document understanding”, Proceedings of the YSU, Physical and Mathematical Sciences, 2024, Volume 58, Issue 3,Pages <nobr>79

Informatics

Synthetic document generation for the task of visual document understanding

Kh. S. Khechoyan

Yerevan State University, Faculty of Informatics and Applied Mathematics

Abstract: Solving the problem of document analysis using machine learning methods requires a large amount of labeled data. Such data is not always available, and if available, it only covers certain types of documents.
In this paper, we present a method for creating synthetic data that allows creating documents of any type by pre-defining the document components. By changing the arrangement of document components, text content, and visual elements using configurations, we create diverse and realistic datasets that mimic real documents. This method addresses the problem of the lack of labeled datasets and offers a flexible solution to improve the results of a machine learning model.

Keywords: machine learning, data generation, document understanding

MSC: 68T20

Received: 22.05.2024
Revised: 03.01.2025
Accepted: 17.01.2025

Language: English

DOI: 10.46991/PYSUA.2024.58.3.079