RUS  ENG
Full version
JOURNALS // Computer Research and Modeling // Archive

Computer Research and Modeling, 2015 Volume 7, Issue 3, Pages 517–520 (Mi crm212)

This article is cited in 2 papers

ÑÅÊÖÈÎÍÍÛÅ ÄÎÊËÀÄÛ

Efficient processing and classification of wave energy spectrum data with a distributed pipeline

I. G. Gankevich, A. B. Degtyarev

Saint Petersburg State University, University ave. 35, Peterhof, St. Petersburg, 198504, Russia

Abstract: Processing of large amounts of data often consists of several steps, e.g. pre- and post-processing stages, which are executed sequentially with data written to disk after each step, however, when pre-processing stage for each task is different the more efficient way of processing data is to construct a pipeline which streams data from one stage to another. In a more general case some processing stages can be factored into several parallel subordinate stages thus forming a distributed pipeline where each stage can have multiple inputs and multiple outputs. Such processing pattern emerges in a problem of classification of wave energy spectra based on analytic approximations which can extract different wave systems and their parameters (e.g. wave system type, mean wave direction) from spectrum. Distributed pipeline approach achieves good performance compared to conventional “sequential-stage” processing.

Keywords: distributed system, big data, data processing, parallel computing.

UDC: 004.04

Received: 01.10.2014

Language: English

DOI: 10.20537/2076-7633-2015-7-3-517-520



© Steklov Math. Inst. of RAS, 2024