The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The paper presents a support vector machine based Part-Of-Speech tagging on Chinese database which is part of our speech synthesis system. The model can be classified as SVM model and uses many sequential features to predict the POS tag. The text database was download from the internet with 1,280,000 words and 33 parts of Speech. The total accuracy of our experiments is 99.31%.
Token-based approaches have proven quite effective for spoken language identification (LID). Traditionally, Speech utterances are first decoded into token sequences, and then LID tasks are performed on these token sequences by either n-gram language models or support vector machines. In this paper, we propose a hierarchical system design, which utilizes a group of bayesian logistic regression models...
In this study, some research activities on expressive speech recognition and conversion will be introduced. A database consisting of five kinds of speech emotions (i.e. happiness, sadness, surprise, anger and neutral) is used. Not only those traditional features such as mfcc, plp, and pitch are studied, but also a new feature extraction method based on fisher's F-Ratio is proposed and reported. In...
Disordered voice database is very useful for analyzing, evaluating patients' disease conditions. According to pathological conditions of patients and characteristics of Mandarin speech, our database is composed of sustained vowel, words, sentences and poetry. The sustained vowel part contains the most common used six vowels in Mandarin speech. The words part contains ten simple words. The sentences...
Noise environment and natural spoken speech, is still a challenging issue for speech recognition. In this paper, study on this field is explored on Mandarin speech, from aspects of signal processing, acoustic model, language model, decoding algorithm, and post processing. The two-phase mel-warped wiener filter algorithm is improved for obtaining noise-robust feature. Segmentation algorithm and gender...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.