The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Features derived using sparse representation (SR)-based approaches have been shown to yield promising results for speech recognition tasks. In most of the approaches, the SR corresponding to speech signal is estimated using a dictionary, which could be either exemplar based or learned. However, a single-level decomposition may not be suitable for the speech signal, as it contains complex hierarchical...
This paper proposes an approach based on compressed sensing to reduce the footprint of speech corpus in unit selection based speech synthesis (USS) systems. It exploits the observation that speech signal can have a sparse representation (in suitable choice of basis functions) and can be estimated effectively using the sparse coding framework. Thus, only few significant coefficients of the sparse vector...
Speech is an informative signal, which conveys many information's like status of the speaker, environmental conditions of the speaker: the other necessary parameters which are classified as prosodic features and general features of speech. As speech is a signal which can be analysed by subjecting and can be inspected to various criteria with the implication of several available techniques. In this...
In this paper, the non-uniform duration modification is exploited along with other prosody features for neutral speech to anger speech conversion. The non-uniform duration modification method modifies the durations of vowel and pause segments by different modification factors. Vowel segments are modified by factors based on their identities, and pause segments by uniform factors. Consonant and transition...
In this paper, a three stage improved speech signal recognition model is presented. The presented approach improved the recognition process by reducing the process time and to provide robust speech recognition. In first layer of presented model, the feature extraction from speech is done using Statistical Analysis based DWT approach. The extracted feature based recognition reduced the signal size...
Robust syllabification of continuous speech is a vital aspect of language and speech processing systems. Syllabification of speech can be done by detecting the syllable nuclei. Syllable is the basic production unit of human speech and syllable nuclei can be attributed to high energy sonarants or resonant sounds which are relatively loud and carry a clear pitch. In this work, high spectral energy at...
Speech coding is one of the major degradation involved in building the speech systems in mobile environment. In this paper, we are exploring the effect of low bit rate speech coding on the accuracy of detection of epochs. Epoch is referred as the instant of significant excitation of the vocal-tract system during production of speech. Many speech applications depend on the the accurate estimation of...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.