RUS  ENG
Full version
JOURNALS // Avtomatika i Telemekhanika // Archive

Avtomat. i Telemekh., 2022 Issue 10, Pages 80–93 (Mi at16053)

This article is cited in 1 paper

Topical issue

Cloning and conversion of an arbitrary voice using generative flows

D. S. Obukhov

Novosibirsk State Technical University, Novosibirsk, 630073 Russia

Abstract: To improve the quality of generated speech signals, this paper proposes a method for taking into account time-varying information about the speaker. Using this technique, the system synthesizes more natural speech with a voice similar to the given target voice in both the voice cloning and voice conversion problems.

Keywords: voice cloning, voice conversion, speech synthesis, streaming generative model, speaker embedding, pitch frequency.

Presented by the member of Editorial Board: A. A. Lazarev

Received: 22.01.2022
Revised: 25.04.2022
Accepted: 29.06.2022

DOI: 10.31857/S0005231022100087


 English version:
Automation and Remote Control, 2022, 83:10, 1555–1566


© Steklov Math. Inst. of RAS, 2024