Wyniki wyszukiwania dla: G. Saha

Pozycje od 1 do 9 spośród 9 wyników

rozdział

Relevant subspace selection in Kernel feature space for speech recognition

J De, G Saha

2010 Annual IEEE India Conference (INDICON) > 1 - 5

2010 Annual IEEE India Conference (INDICON 2010)

This paper describes an approach to select the most relevant subspace in Kernel PCA feature space applied on MFCC coefficients for speech recognition. It has been seen that the relevant information about a supervised classification problem is contained in a finite number of leading Kernel PCA components if the Kernel matches the underlying classification problem. In this paper our contribution is...

rozdział

On the use of perceptual Line Spectral Pairs Frequencies for speaker identification

M. Sahidullah, G. Saha

2010 National Conference On Communications (NCC) > 1 - 5

2010 National Conference on Communications (NCC 2010)

Line Spectral Pairs Frequencies (LSFs) provide an alternative representation of the linear prediction coefficients. In this paper an investigation is carried out for extracting feature for speaker identification task which is based on perceptual analysis of speech signal and LSF. A modified version of the standard perceptual analysis is applied to obtain better performance. We have extracted the conventional...

rozdział

Analysis of Distance Measures for Pre-Quantization before Feature Extraction in Automatic Speaker Recognition

G. Sarkar, G. Saha

2009 Annual IEEE India Conference > 1 - 4

2009 Annual IEEE India Conference (INDICON 2009)

The total recognition time as well as the memory requirement in speaker recognition is mainly governed by the number of speakers, the number of frame vectors in the test sequence and the feature dimensionality. The adjacent frame vectors can show similarity in the feature space because of the slow movements of the articulators. Hence efficient frame selection techniques to select non-redundant frames...

rozdział

On the Use of Distributed DCT in Speaker Identification

M. Sahidullah, G. Saha

2009 Annual IEEE India Conference > 1 - 4

2009 Annual IEEE India Conference (INDICON 2009)

Feature extraction is one of the most significant stage in development of a speaker identification (SI) system. Most of the SI systems use mel-frequency cepstral coefficient (MFCC) as a parameter for representing the speech signal into compact form. MFCC are extracted through spectral weighting by a bank of overlapping triangular filters followed by a de-correlation process. Conventionally, discrete...

rozdział

Efficient pre-quantization techniques based on probability density for speaker recognition system

G. Sarkar, G. Saha

TENCON 2009 - 2009 IEEE Region 10 Conference > 1 - 6

TENCON 2009. 2009 IEEE Region 10 Conference

The amount of speaker specific information in speech signal varies from frame to frame depending on spoken text and environmental conditions. A frame selection at the preprocessing stage can be an added advantage in this context. In pre-quantization (PQ) we select a new sequence of frames Y from the original frames X such that length of Y is less than X. In this paper, we first analyze a number of...

rozdział

Improving speaker identification via Singular Value Decomposition based Feature Transformer

B.P. Mishra, S. Chakroborty, G. Saha

TENCON 2008 - 2008 IEEE Region 10 Conference > 1 - 6

TENCON 2008 - 2008 IEEE Region 10 Conference

State-of-the-art Speaker Identification (SI) systems use Gaussian Mixture Models (GMM) for modeling speakerspsila data. Using GMM, a speaker can be identified accurately even from a large number of speakers, when model complexity is large. However, lower ordered speaker model using GMM show poor accuracy as lesser number of Gaussian are involved. In SI context, not much attention have been paid towards...

rozdział

Capturing Complementary Information via Reversed Filter Bank and Parallel Implementation with MFCC for Improved Text-Independent Speaker Identification

S. Chakroborty, A. Roy, S. Majumdar, G. Saha

2007 International Conference on Computing: Theory and Applications (ICCTA'7) > 463 - 467

2007 International Conference on Computing Theory and Applications

A state of the art speaker identification (SI) system requires a robust feature extraction unit followed by a speaker modeling scheme for generalized representation of these features. Over the years, mel-frequency cepstral coefficients (MFCC) modeled on the human auditory system have been used as a standard acoustic feature set for SI applications. However, due to the structure of its filter bank,...

rozdział

Fusion of a Complementary Feature Set with MFCC for Improved Closed Set Text-Independent Speaker Identification

S. Chakroborty, A. Roy, G. Saha

2006 IEEE International Conference on Industrial Technology > 387 - 390

2006 IEEE International Conference on Industrial Technology

A state of the art speaker identification (SI) system requires a robust feature extraction unit followed by a speaker modeling scheme for generalized representation of these features. Over the years, Mel-frequency cepstral coefficients (MFCC) modeled on the human auditory system have been used as a standard acoustic feature set for SI applications. However, due to the structure of its filter bank,...

rozdział

Log Gabor Wavelet and Maximum a Posteriori Estimator in Speaker Identification

S. Senapati, S. Chakroborty, G. Saha

2006 Annual IEEE India Conference > 1 - 6

2006 Annual IEEE India Conference

Speaker identification (SI) system needs an efficient feature extraction process and an appropriate speaker model developed from these features. The work introduces the fusion of log Gabor wavelet (LGW) and maximum a posteriori (MAP) estimator for robust text-independent SI system. The focus of this paper is on the robustness to degradations produced by transmission over a telephone channel. Complete...

Opcje filtrowania

Słowa kluczowe:
FEATURE EXTRACTION

Data publikacji

Ustaw własny zakres dat

Dostępność treści

Dostępna (8)
Brak (1)

Słowa kluczowe

SPEAKER RECOGNITION (8)
SPEECH (6)
DATABASES (5)
MEL FREQUENCY CEPSTRAL COEFFICIENT (4)
SPEAKER IDENTIFICATION (4)
ACCURACY (3)
GAUSSIAN MIXTURE MODEL (3)
GAUSSIAN PROCESSES (3)
SILICON (3)
CHANNEL BANK FILTERS (2)
CORRELATION (2)
GAUSSIAN MIXTURE MODELS (2)
HUMAN AUDITORY SYSTEM (2)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (2)
MFCC (2)
MICROPHONE SPEECH (2)
PRE-QUANTIZATION (2)
SPEECH DATABASE (2)
SPEECH PROCESSING (2)
SPEECH SIGNAL (2)
STANDARD ACOUSTIC FEATURE SET (2)
TELEPHONE SPEECH (2)
AUDIO DATABASES (1)
AUDIO SIGNAL PROCESSING (1)
AUTOMATIC SPEAKER RECOGNITION (1)
CLASSIFIER PARADIGMS (1)
CLOSED SET TEXT-INDEPENDENT SPEAKER IDENTIFICATION (1)
COMPLEMENTARY FEATURE SET (1)
COMPLEMENTARY INFORMATION (1)
CONVERSATIONAL TELEPHONE KING-92 (1)
DATA MINING (1)
DATABASE MANAGEMENT SYSTEMS (1)
DE-CORRELATION PROCESS (1)
DISCRETE COSINE TRANSFORMS (1)
DISTANCE MEASURE (1)
DISTANCE MEASURE TECHNIQUE (1)
DISTANCE MEASURES (1)
DISTRIBUTED DISCRETE COSINE TRANSFORM (1)
EQUATIONS (1)
EUCLIDEAN DISTANCE (1)
FEATURE DIMENSIONALITY (1)
FEATURE EXTRACTION PROCESS (1)
FEATURE TRANSFORMER (1)
FILTER BANK (1)
FILTERING THEORY (1)
FRAME SELECTION (1)
GAUSSIAN MIXTURE MODEL (GMM) (1)
GMM (1)
GMM BASED CLASSIFIER (1)
HIERARCHICAL SPEAKER PRUNING (1)
KERNEL (1)
KERNEL FEATURE SPACE (1)
KERNEL PCA FEATURE SPACE (1)
KPCA (1)
LGW (1)
LINE SPECTRAL PAIRS FREQUENCIES (1)
LINEAR PREDICTION COEFFICIENT (1)
LOG GABOR WAVELET (1)
MAP (1)
MATHEMATICAL MODEL (1)
MAXIMUM A POSTERIORI ESTIMATOR (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
MFCC COEFFICIENTS (1)
MOMENTS (1)
NARROW BAND SPEECH UTTERANCE (1)
OVERLAPPING TRIANGULAR FILTERS (1)
PARALLEL IMPLEMENTATION (1)
PATTERN CLASSIFICATION (1)
PERCEPTUAL LINE SPECTRAL PAIR FREQUENCY (1)
PERCEPTUAL LINEAR PREDICTION (1)
POLYCOST (1)
POLYCOST DATABASE (1)
POLYNOMIAL CLASSIFIER (1)
PREQUANTIZATION TECHNIQUE (1)
PRINCIPAL COMPONENT ANALYSIS (1)
PROBABILITY (1)
PROBABILITY DENSITY FUNCTION (1)
PUBLIC DATABASES (1)
PUBLIC INFORMATION SYSTEMS (1)
QUANTISATION (SIGNAL) (1)
RELEVANT DIMENSION ESTIMATION (1)
RELEVANT SUBSPACE SELECTION (1)
REVERSED FILTER BANK (1)
ROBUST FEATURE EXTRACTION (1)
ROBUST FEATURE EXTRACTION UNIT (1)
SET THEORY (1)
SI (1)
SI SYSTEMS (1)
SINGULAR VALUE DECOMPOSITION (1)
SPEAKER IDENTIFICATION SYSTEM (1)
SPEAKER MODELING SCHEME (1)
SPEECH RECOGNITION (1)
SPEECH SIGNAL ANALYSIS (1)
STANDARD PERCEPTUAL ANALYSIS (1)
SUPERVISED CLASSIFICATION (1)
TELECOMMUNICATION CHANNELS (1)
TELEPHONE CHANNEL (1)
TEST SEQUENCE (1)
TESTING (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: G. Saha

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu