The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
A new back-end classifier for GMM-LM based language identification systems is proposed in this paper. The proposed system consists of a mapping matrix and a back-end classifier of SVMs as its main parts, located in series after the GMM-LM system. While the mapping matrix maps the language model's output vectors to a new space in which the languages are more separable than before, each SVM in the SVM...
In this paper, we present a new back-end classifier for GMM-LM based language identification systems. Our new proposed system consists of two main parts, mapping matrix and bank of SVMs. These two parts are located in series after GMM-LM system. The mapping matrix, maps the language models' output vectors to a new space in which the languages are more separable than before. Then each SVM in the SVM...
Adaptive Multi-Rate (AMR) codec was standardized for GSM in 1999. AMR offers substantial improvement over previous GSM speech codecs in error robustness by adapting speech and channel coding depending on channel conditions. The Adaptive Multi-Rate speech codec is adopted as a standard for IMT-2000 by ETSI and 3GPP and consists of eight source codecs with bit rates from 4.75 to 12.2 kbit/s. In this...
Mel-frequency cepstral coefficients (MFCC) are the most widely used features for speech recognition. However, MFCC-based speech recognition performance degrades in presence of additive noise. In this paper, we propose a set of noise-robust features based on conventional MFCC feature extraction method. Our proposed method consists of two steps. In the first step, mel sub-band Wiener filtering is carried...
The duty of speaker diarization comprises of answering the question ldquoWho spoke when?rdquo. In this paper, we present an approach comprising of PSO (particle swarm optimization) algorithm, which encodes possible segmentations of an audio record by measuring mutual information between these segments and the audio data.. This measure is used as the fitness function for the PSO. This algorithm has...
This paper shows research performed into the topic of speaker diarization for multi-speaker environment. It looks into the algorithms and the implementation of an offline speaker segmentation and indexing system for recorded speech data where usually more than one speaker is present. Speaker diarization is a well studied topic in the domain of broadcast news recordings. Most of the proposed systems...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.