Search results for: S.M. Mirrezaie

Items from 1 to 6 out of 6 results

chapter

A new approach for spoken language identification based on sequence kernel SVMs

A. Ziaei, S.M. Ahadi, H. Yeganeh, S.M. Mirrezaie

2009 16th International Conference on Digital Signal Processing > 1 - 4

2009 16th International Conference on Digital Signal Processing (DSP)

A new back-end classifier for GMM-LM based language identification systems is proposed in this paper. The proposed system consists of a mapping matrix and a back-end classifier of SVMs as its main parts, located in series after the GMM-LM system. While the mapping matrix maps the language model's output vectors to a new space in which the languages are more separable than before, each SVM in the SVM...

chapter

Spoken Language Identification Using a New Sequence Kernel-based SVM Back-end Classifier

A. Ziaei, S.M. Ahadi, S.M. Mirrezaie, H. Yeganeh

2008 IEEE International Symposium on Signal Processing and Information Technology > 324 - 329

2008 8th IEEE International Symposium on Signal Processing and Information Technology. ISSPIT 2008

In this paper, we present a new back-end classifier for GMM-LM based language identification systems. Our new proposed system consists of two main parts, mapping matrix and bank of SVMs. These two parts are located in series after GMM-LM system. The mapping matrix, maps the language models' output vectors to a new space in which the languages are more separable than before. Then each SVM in the SVM...

chapter

A Particle Swarm Optimization-Based Approach to Speaker Segmentation Based on Independent Component Analysis on GSM Digital Speech

S.M. Mirrezaie, K. Faez, A. Asnaashari, A. Ziaei

2008 IEEE International Symposium on Signal Processing and Information Technology > 502 - 507

2008 8th IEEE International Symposium on Signal Processing and Information Technology. ISSPIT 2008

Adaptive Multi-Rate (AMR) codec was standardized for GSM in 1999. AMR offers substantial improvement over previous GSM speech codecs in error robustness by adapting speech and channel coding depending on channel conditions. The Adaptive Multi-Rate speech codec is adopted as a standard for IMT-2000 by ETSI and 3GPP and consists of eight source codecs with bit rates from 4.75 to 12.2 kbit/s. In this...

chapter

Weighting of Mel Sub-bands Based on SNR/Entropy for Robust ASR

H. Yeganeh, S.M. Ahadi, S.M. Mirrezaie, A. Ziaei

2008 IEEE International Symposium on Signal Processing and Information Technology > 292 - 296

2008 8th IEEE International Symposium on Signal Processing and Information Technology. ISSPIT 2008

Mel-frequency cepstral coefficients (MFCC) are the most widely used features for speech recognition. However, MFCC-based speech recognition performance degrades in presence of additive noise. In this paper, we propose a set of noise-robust features based on conventional MFCC feature extraction method. Our proposed method consists of two steps. In the first step, mel sub-band Wiener filtering is carried...

chapter

Speaker diarization in a multi-speaker environment using particle swarm optimization and mutual information

S.M. Mirrezaie, S.M. Ahadi

2008 IEEE International Conference on Multimedia and Expo > 1533 - 1536

2008 IEEE International Conference on Multimedia and Expo (ICME)

The duty of speaker diarization comprises of answering the question ldquoWho spoke when?rdquo. In this paper, we present an approach comprising of PSO (particle swarm optimization) algorithm, which encodes possible segmentations of an audio record by measuring mutual information between these segments and the audio data.. This measure is used as the fitness function for the PSO. This algorithm has...

chapter

Robust Speaker Diarization in a Multi-Speaker Environment Using Autocorrelation-based Noise Subtraction

S.M. Mirrezaie, S.M. Ahadi, A. Kashi

2007 IEEE International Symposium on Signal Processing and Information Technology > 291 - 296

2007 IEEE International Symposium on Signal Processing and Information Technology

This paper shows research performed into the topic of speaker diarization for multi-speaker environment. It looks into the algorithms and the implementation of an offline speaker segmentation and indexing system for recorded speech data where usually more than one speaker is present. Speaker diarization is a well studied topic in the domain of broadcast news recordings. Most of the proposed systems...

Filter options

Publication date

Set your own date range

Keywords

ENTROPY (3)
FEATURE EXTRACTION (3)
GENETIC ALGORITHM (3)
SPEECH (3)
SPEECH RECOGNITION (3)
DATABASES (2)
GENETIC ALGORITHMS (2)
KERNEL (2)
LANGUAGE IDENTIFICATION (2)
LINEAR DISCRIMINANT MATRIX (2)
MAPPING MATRIX (2)
MULTISPEAKER ENVIRONMENT (2)
MUTUAL INFORMATION (2)
OGI-TS MULTILANGUAGE TASK (2)
PARTICLE SWARM OPTIMIZATION (2)
SEQUENCE KERNEL SVM (2)
SPEAKER DIARIZATION (2)
SPEAKER RECOGNITION (2)
SPEECH PROCESSING (2)
SPOKEN LANGUAGE IDENTIFICATION (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
SUPPORT VECTOR MACHINES (2)
TRAINING (2)
ACCURACY (1)
ADAPTIVE CODES (1)
ADAPTIVE MULTIRATE (AMR) (1)
ADAPTIVE MULTIRATE SPEECH CODEC (1)
ADDITIVE NOISE (1)
AUDIO CODING (1)
AUDIO RECORD SEGMENTATION (1)
AUDIO RECORD SEGMENTATIONS (1)
AUDIO SIGNAL PROCESSING (1)
AUTOCORRELATION-BASED NOISE SUBTRACTION (1)
BACK-END CLASSIFIER (1)
BIT RATE (1)
BROADCAST NEWS RECORDING (1)
CELLULAR RADIO (1)
CEPSTRAL ANALYSIS (1)
CEPSTRUM PARAMETER FORMATION (1)
CHANNEL CODING (1)
CODECS (1)
CORRELATION METHODS (1)
COVARIANCE MATRIX (1)
DATA CLUSTERING (1)
DATA MINING (1)
DISTBIC ALGORITHM (1)
FEATURE EXTRACTION METHOD (1)
GALLIUM (1)
GAUSSIAN MIXTURE (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN MIXTURE MODELS (1)
GAUSSIAN PROCESSES (1)
GMM-LM (1)
GSM (1)
GSM DIGITAL SPEECH CODEC (1)
INDEPENDENT COMPONENT ANALYSIS (1)
INDEXING (1)
INDEXING SYSTEM (1)
LANGUAGE IDENTIFICATION SYSTEMS (1)
MATRIX ALGEBRA (1)
MEETINGS INDEXING (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (1)
MEL SUBBAND WEIGHTING (1)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (1)
MUTUAL INFORMATION MEASUREMENT (1)
NATURAL LANGUAGE PROCESSING (1)
NOISE (1)
NOISY SPEECH (1)
OFFLINE SPEAKER SEGMENTATION (1)
PARTICLE SWARM OPTIMISATION (1)
PARTICLE SWARM OPTIMIZATION (PSO) (1)
PATTERN CLASSIFICATION (1)
PATTERN CLUSTERING (1)
PROBABILISTIC LOGIC (1)
PSO CONVERGENCE (1)
ROBUST ASR (1)
ROBUST SPEAKER DIARIZATION (1)
ROBUST SPEECH RECOGNITION (1)
ROBUSTNESS (1)
SEQUENCE KERNEL-BASED SVM BACK-END CLASSIFIER (1)
SIGNAL TO NOISE RATIO (1)
SNR (1)
SPEAKER SEGMENTATION (1)
SPEAKER SEGMENTATION AND CLUSTERING (1)
SPEAKER SEGMENTATION AND INDEXING (1)
SPEECH CODECS (1)
SPEECH CODING (1)
SPEECH SIGNAL CHARACTERISTICS (1)
SUB-BAND WIENER FILTERING (1)
WIENER FILTERS (1)
more

INFONA - science communication portal

Search results for: S.M. Mirrezaie

A new approach for spoken language identification based on sequence kernel SVMs

Spoken Language Identification Using a New Sequence Kernel-based SVM Back-end Classifier

A Particle Swarm Optimization-Based Approach to Speaker Segmentation Based on Independent Component Analysis on GSM Digital Speech

Weighting of Mel Sub-bands Based on SNR/Entropy for Robust ASR

Speaker diarization in a multi-speaker environment using particle swarm optimization and mutual information

Robust Speaker Diarization in a Multi-Speaker Environment Using Autocorrelation-based Noise Subtraction

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options