Local partial least square regression for spectral mapping in voice conversion

Xiaohai Tian, Zhizheng Wu, Engsiong Chng

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Joint density Gaussian mixture model (JD-GMM) based method has been widely used in voice conversion task due to its flexible implementation. However, the statistical averaging effect during estimating the model parameters will result in over-smoothing the target spectral trajectories. Motivated by the local linear transformation method, which uses neighboring data rather than all the training data to estimate the transformation function for each feature vector, we proposed a local partial least square method to avoid the over-smoothing problem of JD-GMM and the over-fitting problem of local linear transformation when training data are limited. We conducted experiments using the VOICES database and measure both spectral distortion and correlation coefficient of the spectral parameter trajectory. The experimental results show that our proposed method obtain better performance as compared to baseline methods.
Original languageEnglish
Title of host publicationSignal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages1-6
Number of pages6
DOIs
Publication statusPublished - 2013

Fingerprint

Dive into the research topics of 'Local partial least square regression for spectral mapping in voice conversion'. Together they form a unique fingerprint.

Cite this