The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This work explores the nature of warping path and the shape of the gross spectrum for speaker information in text-dependent speaker verification under degraded condition. The nature of warping path is observed to follow a similar trend for given speaker across different sessions, due to the style of spoken delivery. The deviation of the warping path from the diagonal is taken as feature for discrimination...
In this work, we explore the use of sparse representation of GMM mean shifted supervectors over a learned dictionary for the speaker verification (SV) task. In this method the dictionaries are learned using the KSVD algorithm unlike the recently proposed SV methods employing the sparse representation classification (SRC) over exemplar dictionaries. The proposed approach with learned dictionary results...
In this paper we describe the collection and organization of the speaker recognition database in Indian scenario named as IITG Multivariability Speaker Recognition Database. The database contains speech from 451 speakers speaking English and other Indian languages both in conversational and read speech styles recorded using various sensors in parallel under different environmental conditions. The...
In this paper, we present our initial study with the recently collected speech database for developing robust speaker recognition systems in Indian context. The database contains the speech data collected across different sensors, languages, speaking styles, and environments, from 200 speakers. The speech data is collected across five different sensors in parallel, in English and multiple Indian languages,...
In this work, we have studied the effectiveness of recently proposed warp factor estimation method based on correlation between the pitch frequency and the warp factor for speaker normalization. Our study shows that estimation of warp factors using maximum a posteriori criterion by exploiting the statistics from the pitch based warp factor estimation approach results in 12.5% relative improvement...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.