The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Single channel speech separation (SCSS) is widely used in many real-time applications such as preprocessing stage for speech recognition to control humanoid robots and in hearing aid. The performance of the separation is crucial for these applications. In this paper, we propose a new approach for unsupervised SCSS. The separation relies on an optimization of the subspace separation by decomposing...
In real-life environment, the speech of interest is often correlated with different kinds of perturbation. Perturbation can be caused by speaking or non-speaking noise, or even by reverberation. This could make the speech signal auditable but not intelligible. In this case, speech cannot be exploited by other automated applications such as voice-command or speech/speaker identification and identification...
In this paper we present a comparative analysis of different time-frequency (T-F) masking techniques used for single channel speech separation (SCSS). We survey T-F masking concept and compare different types of masks in different criteria. The comparison is conduct theoretically by mathematical study and numerically by objective and subjective assessment. Also, we study the effect of the masking...
In this paper we study the masking effect on Computational Auditory Scene Analysis (CASA) based systems for single channel speech separation (SCSS). In this study, we focus on the benchmark masks of the literature that are namely: the ideal binary mask (IBM), the binary mask (BM) and soft mask. Each system is evaluated objectively and subjectively in order to highlight the effect of each mask on the...
We present an approach of single channel speech separation based on sinusoidal modeling and multi-scale product analysis. To construct the sinusoidal model of speech, we need to determine three parameters which are namely frequency, amplitude and phase. For fundamental frequency determination, we apply an effective method based on multi-scale product analysis. For amplitude estimation, we adopt an...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.