On the Quality Improvement of Voice Conversion Systems Based on GMM Model

Eslami, Mahdi; sayadian, abolghasem

All

Webpages

Books

Journals

Tarbiat Modares University Press

The Modares Journal of Electrical Engineering

Volume 5 - MJEE 2005, 5 - : 23-36 | Back to browse issues page

Mendeley

Zotero

RefWorks

Eslami M, sayadian A. On the Quality Improvement of Voice Conversion Systems Based on GMM Model. MJEE 2005; 5 :23-36
URL: http://mjee.modares.ac.ir/article-17-1280-en.html

On the Quality Improvement of Voice Conversion Systems Based on GMM Model

Mahdi Eslami¹

, Abolghasem Sayadian¹

1- Amirkabir Univ. of Tech.

Abstract: (3348 Views)

In a voice conversion system speech signal of A speaker (i.e. source speaker) is modified so that it sounds as if it had been pronounced by B speaker (i.e. target speaker). This process, sometimes, is called speaker conversion (changing speaker identity). Achieved signal from speaker conversion system is desired to have high quality and very natural. To satisfy this, three major methods are proposed as follows: VQ_based, LMR_based and GMM_based voice conversion methods. DTW is the most popular way to warp corresponded words in two sentences. In this paper, DTW is used to design corresponding transfer function. To decrease the distance between two speakers, DTW warps the couple phonemes of two speakers, instead of two words or couple sentences while a linear temporal transform which depends on phonemes is used for error decreasing. By using other appropriate corrections that are used in learning and designing of the linear transforms, a high quality voice conversion system is achieved.

Keywords: Voice Conversion, Speaker Transformation, Spectral Mapping, Gaussian Mixture Model

Full-Text [PDF 2549 kb] (2574 Downloads)

Received: 2004/03/7 | Accepted: 2005/03/7 | Published: 2006/03/8

Rights and permissions
	This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.