The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The major activity during speech production is glottal activity and is earlier detected using strength of excitation (SoE). This work uses the normalized autocorrelation peak strength (NAPS) and higher order statistics (HOS) as additional features for detecting glottal activity. The three features, namely, SoE, NAPS, and HOS, are, respectively indicators of different attributes of glottal activity,...
This work proposes two different methods for polarity detection in speech and Electroglottograph (EGG) signals using Hilbert Envelope (HE). HE is defined as the magnitude of complex time function and hence an unipolar signal. The zero frequency filtering (ZFF) obtained from HE of LP residual is of same phase for both polarity. Alternatively, the ZFF of speech and EGG, integrated linear prediction...
The analysis of various components of the Electroglottograph (EGG) signal, obtained after Ensemble Empirical Mode Decomposition (EEMD) is the primary objective of this paper. The ability of EEMD to detect intermittent high frequency data embedded in the data of lower frequency is exploited to segregate the Epoch locations and the Periodic nature of EGG signal. The dyadic filterbank property of EEMD...
In this paper a simple method is proposed using zero frequency filtering (ZFF) of a close approximate glottal flow derivative (GFD) to extract glottal closure (GCI's) and opening instants (GOI's) from speech. The GFD is obtained from iterative adaptive inverse filtering (IAIF) which contains such instants. It is observed that GCI's can be located by positive zero crossings of zero frequency filtered...
This work treats vowels and semivowels as vowellike regions. An analysis of the spurious vowel-like regions (VLRs) detected by a signal processing based method using excitation source information is demonstrated. Limitation of excitation information in detecting some of the nasals and voiced consonants as non-VLRs is discussed. An attempt to reduce spurious VLRs compared to the existing signal processing...
This work proposes a modified zero frequency filtering (ZFF) method for epoch extraction from emotional speech. Epochs refers the instants of maximum excitation of the vocal tract. In the conventional ZFF method, the epochs are estimated by trend removing the output of the zero frequency resonator (ZFR) using the window length equal to the average pitch period of the utterance. Use of this fixed window...
In this paper, we present our initial study with the recently collected speech database for developing robust speaker recognition systems in Indian context. The database contains the speech data collected across different sensors, languages, speaking styles, and environments, from 200 speakers. The speech data is collected across five different sensors in parallel, in English and multiple Indian languages,...
This study analyzes the effect of stress in human and automatic stressed speech processing tasks for speech collected from non-professional speakers. The database of 33 keywords is collected under five stress conditions, namely, neutral, angry, happy, sad and Lombard from fifteen speakers. The first study is to understand the ability to identify stress by human and automatic speech processing. The...
In this work, we present a bimodal biometric system using speech and face features and tested its performance under degraded condition. Speaker verification (SV) system is built using Mel-Frequency Cepstral Coefficients (MFCC) followed by delta and delta-delta for feature extraction and Gaussian Mixture Model (GMM) for modeling. A face verification (FV) system is built using the combination of Principal...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.