The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Gaussian mixture model (GMM) supervector is one of the effective techniques in text independent speaker recognition. In our previous work, we introduce the GMM-UBM mean interval (GUMI) concept based on the Bhattacharyya distance. Subsequently GUMI kernel was successfully used in conjunction with support vector machine (SVM) for speaker recognition. Besides the first order statistics, it is generally...
In this paper, we propose two cluster criterion functions which aim to maximize the separation between intra-cluster distances and inter-cluster distances. These criteria can automatically deduce the desired number of clusters based on their extremized values. We then propose an algorithm to apply our criterion functions in conjunction with spectral clustering. By exploiting the characteristic of...
Gaussian mixture model (GMM) and support vector machine (SVM) have become popular classifiers in text-independent speaker recognition. A GMM-supervector characterizes a speaker's voice with the parameters of GMM, which include mean vectors, covariance matrices, and mixture weights. GMM-supervector SVM benefits from both GMM and SVM frameworks to achieve the state-of-the-art performance. Conventional...
We usually build a prosody model to predict the prosodic parameters, which will be used as part of the criteria for unit selection. Spectral appropriateness of units is usually ensured by using identities of context units, which are linguistic symbols. With looking into the spectral properties of the actual signal, the spectral mismatches are often perceived in the synthetic speech. In this paper,...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.