Volume 5 -                   MJEE 2005, 5 - : 23-36 | Back to browse issues page

XML Persian Abstract Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Eslami M, sayadian A. On the Quality Improvement of Voice Conversion Systems Based on GMM Model. MJEE 2005; 5 :23-36
URL: http://mjee.modares.ac.ir/article-17-1280-en.html
1- Amirkabir Univ. of Tech.
Abstract:   (3348 Views)
In a voice conversion system speech signal of A speaker (i.e. source speaker) is modified so that it sounds as if it had been pronounced by B speaker (i.e. target speaker). This process, sometimes, is called speaker conversion (changing speaker identity). Achieved signal from speaker conversion system is desired to have high quality and very natural. To satisfy this, three major methods are proposed as follows: VQ_based, LMR_based and GMM_based voice conversion methods. DTW is the most popular way to warp corresponded words in two sentences. In this paper, DTW is used to design corresponding transfer function. To decrease the distance between two speakers, DTW warps the couple phonemes of two speakers, instead of two words or couple sentences while a linear temporal transform which depends on phonemes is used for error decreasing. By using other appropriate corrections that are used in learning and designing of the linear transforms, a high quality voice conversion system is achieved.
Full-Text [PDF 2549 kb]   (2574 Downloads)    

Received: 2004/03/7 | Accepted: 2005/03/7 | Published: 2006/03/8

Add your comments about this article : Your username or Email:
CAPTCHA

Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.