Abstract: Emotional voice conversion (EVC) transforms the emotional state of speech while preserving linguistic content and speaker identity. Although sequence-to-sequence models have achieved ...
Abstract: Generative adversarial network (GAN)-based vocoders have been intensively studied because they can synthesize high-fidelity audio waveforms faster than real-time. However, it has been ...