The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper reviews few important and contemporary research contributions in the field of sign language biometrics, especially Indian sign language (ISL) biometrics. The current research study suggests that the work in this area is very limited in Indian context and therefore we have reviewed the existing work on ISL or similar biometrics for different languages. The scope of potential research in...
This work explores the nature of warping path and the shape of the gross spectrum for speaker information in text-dependent speaker verification under degraded condition. The nature of warping path is observed to follow a similar trend for given speaker across different sessions, due to the style of spoken delivery. The deviation of the warping path from the diagonal is taken as feature for discrimination...
Recently we have explored the use of a Gaussian mixture model (GMM) based global transformer for artificial bandwidth extension (ABWE) for improving the automatic recognition of children's speech in mismatched condition. As the spectral characteristic of the speech varies significantly from one sound class to another so the global transformation would be sub-optimal for that purpose. Motivated by...
The total variability i-vector based speaker verification system is one of the most successful systems in the recent NIST evaluations. It achieves significant improvement in performance over the conventional GMM-UBM based systems by using the projections of the GMM mean shifted supervectors to a low dimensional space for representation. This low dimensional projections are commonly referred to as...
In this work, we explore the use of sparse representation of GMM mean shifted supervectors over a learned dictionary for the speaker verification (SV) task. In this method the dictionaries are learned using the KSVD algorithm unlike the recently proposed SV methods employing the sparse representation classification (SRC) over exemplar dictionaries. The proposed approach with learned dictionary results...
In this paper, we present our initial study with the recently collected speech database for developing robust speaker recognition systems in Indian context. The database contains the speech data collected across different sensors, languages, speaking styles, and environments, from 200 speakers. The speech data is collected across five different sensors in parallel, in English and multiple Indian languages,...
The degradation in the automatic speech recognition performance of the adult speech trained models for children speech data is a well known problem. In this work, motivated by the voice conversion approaches for addressing the acoustic mis-match between the adult and children speech, we investigated the effect of pitch transformation on children speech on telephone-based connected digit recognition...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.