Volume 4, Issue 1 (2004)                   MJEE 2004, 4(1): 49-65 | Back to browse issues page

XML Persian Abstract Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

RAZAZI F, SAYADIYAN A. A SOFT SEGMENT MODELING APPROACH FOR DURATION MODELING IN PHONEME RECOGNITION SYSTEMS. MJEE 2004; 4 (1) :49-65
URL: http://mjee.modares.ac.ir/article-17-10928-en.html
1- Amirkabir university of technology
Abstract:   (3962 Views)
The geometric distribution of states duration is one of the main performance limiting assumptions of hidden Markov modeling of speech signals. Stochastic segment models, generally, and segmental HMM, specifically, overcome this deficiency partly at the cost of more complexity in both training and recognition phases. In this paper, a new duration modeling approach is presented. The main idea of the model is to consider the effect of adjacent segments on the probability density function estimation and evaluation of each acoustic segment. This idea not only makes the model robust against segmentation errors, but also it models gradual change from one segment to the next one with a minimum set of parameters. The proposed idea is analytically formulated and tested on a TIMIT based context independent phoneme classification system. During the test procedure, the phoneme classification of different phoneme classes was performed by applying various proposed recognition algorithms. The system was optimized and the results have been compared with a continuous density hidden Markov model (CDHMM) with similar computational complexity. The results show slight improvement in phoneme recognition rate in comparison with standard continuous density hidden Markov model. This indicates improved compatibility of the proposed model with the speech nature.
Full-Text [PDF 2263 kb]   (2716 Downloads)    

Received: 2003/01/31 | Accepted: 2004/01/31 | Published: 2004/09/1

Add your comments about this article : Your username or Email:
CAPTCHA

Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.