RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2023 Volume 17, Issue 4, Pages 42–47 (Mi ia872)

An extensible approach to data fusion in distributed computing environments

V. V. Sazontev, S. A. Stupnikov, V. N. Zakharov

Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation

Abstract: The paper belongs to the area of development of methods and tools for data integration. One of the most important stages of data integration is data fusion, i. e., the combination of records relating to the same real-world entity into a single record with conflict resolution for each of the attributes. The paper considers the formal statement of the data fusion problem, provides a brief review of major groups of data fusion methods. An approach for implementation of the data fusion stage within an extensible heterogeneous data integration system in a distributed computing environment is proposed. Software architecture and basic implementation ideas of the approach are considered.

Keywords: data fusion, distributed computing environment.

Received: 29.09.2023

DOI: 10.14357/19922264230406



© Steklov Math. Inst. of RAS, 2024