The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Usable speech criteria are proposed to extract minimally corrupted speech for speaker identification in co-channel speech. This paper deals with a empirical mode decomposition for usable speech segments detection of co-channel speech. Usable speech detected could be processed by a speaker identification system. Experiment and simulation of this method is performed on TIMIT database.
Many applications of speech communication and speaker identification suffer from the problem of co-channel speech. This paper deals with a multi-resolution analysis by empirical mode decomposition for usable speech segments detection of co-channel speech. Usable speech detected could be processed by a speaker identification system. Experiment and evaluation of this method is performed on TIMIT database.
According to the characteristic of the Chinese speech, we propose a method of retrieving the Chinese speech based on wavelet transform in this paper. At first we preprocess the data in the warehouse to provide the high quality speech data. And then we divided the speech samples into subsections and pick up the character of them. Finally, based on the Euclid distance of scale parts, we retrieve the...
In this paper we present a novel distance measure, the minimum landscape distance (MLD). MLD provides a non-linear mapping between the elements in one sequence to those of another. Each element in one sequence is mapped to that with the highest neighbourhood structural similarity (landscape) in the other sequence within a search window. Experimental results obtained on sequences representing binary...
A novel semi-blind defocused image deconvolution technique is proposed, which is based on RBF neural network and iterative Wiener filtering. In this technique, firstly a RBF neural network is trained in wavelet domain to estimate defocus parameter. After obtaining the point spread function (PSF) parameter, iterative Wiener filter is adopted to complete the restoration. We experimentally illustrate...
The speech segmentation problem can be formulated as estimating the locations and durations of speech and non-speech components of the measured speech data. In this paper, a new time-scale transform based segmentation method and one of its important application in speech processing, are presented. The proposed scheme is tested on a number of recorded speech data. The preliminary results are shown...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.