Search results

Items from 1 to 20 out of 325 results

chapter

An effective combination scheme for improving speaker verification performance

Krishna Dutta, Jagabandhu Mishra, Debadatta Pati

TENCON 2017 - 2017 IEEE Region 10 Conference > 1296 - 1299

TENCON 2017 - 2017 IEEE Region 10 Conference

In machine recognition the benefit of utilizing multiple evidence lies in the combination schemes employed. In speaker verification (SV) tasks the score level combination scheme is widely used. The score level combination scheme provides interesting improvements in the overall performance, but when evidence from different features are complementary in nature. It is conjecture that collectively contributed...

chapter

A new speaker verification algorithm based on identification results

Khettaoui Billal, Dahimene Abdelhakim

2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B) > 1 - 6

2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B)

In this paper, a text independent speaker recognition system based on Gaussian mixture models (GMM) was developed with a specific focus on the use of a voice activated detector (VAD) algorithm in the training and testing. At the training level, a modified estimation/maximization (EM) algorithm is used. It is less prone to get trapped around a local maximum and so, it will have more chance to converge...

chapter

I-vector-based speaker identification with extremely short utterances for both training and testing

Misaki Tsujikawa, Tsuyoki Nishikawa, Tomoko Matsui

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE) > 1 - 4

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE)

Voice applications often require the ability to make user-friendly responses by judging the user or user-type from an extremely short utterance, such as a single word. However, it is assumed that performance becomes degraded as the utterance length decreases. In this paper, we examine the performance of speaker identification for extremely short utterances of less than two seconds and then study the...

chapter

Binaural and log-power spectra features with deep neural networks for speech-noise separation

Alfredo Zermini, Qingju Liu, Yong Xu, Mark D. Plumbley, more

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

Binaural features of interaural level difference and interaural phase difference have proved to be very effective in training deep neural networks (DNNs), to generate time-frequency masks for target speech extraction in speech-speech mixtures. However, effectiveness of binaural features is reduced in more common speech-noise scenarios, since the noise may over-shadow the speech in adverse conditions...

chapter

Learning vocal mode classifiers from heterogeneous data sources

Zhao Shuyang, Toni Heittola, Tuomas Virtanen

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 16 - 20

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

This paper targets on a generalized vocal mode classifier (speech/singing) that works on audio data from an arbitrary data source. Previous studies on sound classification are commonly based on cross-validation using a single dataset, without considering training-recognition mismatch. In our study, two experimental setups are used: matched training-recognition condition and mismatched training-recognition...

chapter

Speech enhancement using extreme learning machines

Babafemi O. Odelowo, David V Anderson

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 200 - 204

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

The enhancement of speech degraded with the non-stationary noise types that typify real-world conditions has remained a challenging problem for several decades. However, recent use of data driven methods for this task has brought great performance improvements. In this paper, we develop a speech enhancement framework based on the extreme learning machine. Experimental results show that the proposed...

chapter

Development of speech corpora for Goalparia dialect and similar languages

Tanvira Ismail, L. Joyprakash Singh

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA) > 170 - 173

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)

Accurate dialect identification technique helps in improving the speech recognition systems that exist in most of the present day electronic devices and is also expected to help in providing new services in the field of e-health and telemedicine which is especially important for older and homebound people. The accuracy of a dialect identification system is highly dependent on its speech corpora. Therefore,...

chapter

Significance of exploring pitch only features for the recognition of spontaneous emotions from speech signals

A. Pooja, D. Pravena, D. Govind

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1438 - 1442

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

The emotional database can be classified as spontaneous and simulated emotions. Spontaneous emotions can be identified based on the two parameters 1) Arousal and 2) Valence values represented in a two dimensional plane. Arousal measures how calming or exciting the information is, whereas valence measures postive or negative affectivity of information. The objective of the paper is to predict the arousal...

chapter

Transcriber: An Android application that automates the transcription of interviews in Indonesian

Rahman Adianto, Cil Hardianto Satriawan, Dessi Puji Lestari

2017 International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA) > 1 - 6

2017 International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA)

In this paper, Transcriber that can be used to automatically transcribe interviews in Indonesian using speech-to-text and speaker diarization technology is described. The main feature of the software is generating interview transcription automatically and providing an option if grouping by group of speakers is required. Transcriber is designed to work in two modes that give users the freedom to provide...

chapter

Robust speaker verification with a two classifier format and feature enhancement

Joshua S. Edwards, Ravi P. Ramachandran, Umashanger Thayasivam

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

In the presence of environmental noise, speaker verification systems inevitably see a decrease in performance. This paper proposes the (1) use of two parallel classifiers, (2) feature enhancement based on blind signal-to-noise ratio (SNR) estimation and (3) fusion, to improve the performance of speaker verification systems. The two classifiers are based on Gaussian mixture models and the partial least-squares...

chapter

Speaker verification with mostly voiced speech for GMM/UBM and GMM/IBM systems

Vadim Ditlovich, Yuval Bistritz

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON) > 1175 - 1180

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON)

The paper proposes the use of just mostly voiced speech (MVS) for speaker verification (SV). The speech is partitioned into an MVS part and a non-MVS part by a simple machine classification. SV experiments were held with a standard Gaussian mixture model (GMM) with universal background model (UBM) system and a GMM with computationally improved individual background model (IBM) system. They demonstrate...

chapter

Testing conversational quality of VoIP with different terminals and degradations

Michal Soloducha, Alexander Raake, Frank Kettler, Stefan Bleiholder

2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX) > 1 - 3

2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX)

In this paper, telephone conversation test results are reported. The main goal of the research is to derive a quality assessment model for today's Voice over Internet Telephony (VoIP) communication including the influence of end-point terminals with their internal signal processing. For this reason, two different terminals were used during the test and a possibly wide range of impairments were simulated,...

chapter

Improved speaker recognition system for stressed speech using deep neural networks

Sri Harsha Dumpala, Sunil Kumar Kopparapu

2017 International Joint Conference on Neural Networks (IJCNN) > 1257 - 1264

2017 International Joint Conference on Neural Networks (IJCNN)

Good speaker recognition systems should identify the speaker irrespective of what is spoken, including non-speech sounds that are often produced during natural conversations. In this work, the inclusion of breath sounds in the training phase of the speaker recognition is analyzed using the popular Gaussian mixture model-universal background model (GMM-UBM) and deep neural network (DNN) based systems...

chapter

Feature fusion techniques based training MLP for speaker identification system

Najiya M. Omar, M.E. El-Hawary

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 6

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE)

This paper aims to compare the Linear Predictive Cepstral Coefficients (LPCC) method, the Mel-frequency Cepstral Coefficient (MFCC) method, their concatenation (LPCC-MFCC), and a new proposed feature fusion approach based on method involving this concatenation with the respective averages normalization; Linear predictive and Mel-frequency Cepstral Coefficients (LMACC) through applying a multi-layer...

chapter

Performance evaluation of mixtures of PLDA and conventional PLDA for a small-set speaker verification system

Qianhui Wan, Martin Bouchard

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 4

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE)

This paper compares the use of signal to noise ratio (SNR)-dependent and SNR-independent mixtures of probabilistic linear discriminant analysis (PLDA) versus conventional PLDA, under multi-noise and multi-SNR conditions for a small-set speaker verification system. Results indicate that conventional PLDA is more robust under multi-SNR conditions. The effect of the testing speech length is also examined...

chapter

Incremental adaptation using active learning for acoustic emotion recognition

Mohammed Abdelwahab, Carlos Busso

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5160 - 5164

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The performance of speech emotion classifiers greatly degrade when the training conditions do not match the testing conditions. This problem is observed in cross-corpora evaluations, even when the corpora are similar. The lack of generalization is particularly problematic when the emotion classifiers are used in real applications. This study addresses this problem by combining active learning (AL)...

chapter

Ensemble feature selection for domain adaptation in speech emotion recognition

Mohammed Abdelwahab, Carlos Busso

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5000 - 5004

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

When emotion recognition systems are used in new domains, the classification performance usually drops due to mismatches between training and testing conditions. Annotations of new data in the new domain is expensive and time demanding. Therefore, it is important to design strategies that efficiently use limited amount of new data to improve the robustness of the classification system. The use of...

chapter

Identification of Kamrupi dialect and similar languages

Tanvira Ismail, Gaurab Krishnan Deka, Sushanta Kabir Dutta, L. Joyprakash Singh

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN) > 540 - 543

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN)

Dialect can be defined as a variety of a language that is distinguished from other varieties of the same language by pronunciation, grammar and vocabulary. The process of recognizing such dialects is called Dialect Identification. Kamrupi, although a dialect of the Assamese language, is spoken both in Assam (Kamrup district) and North Bengal. In this paper, we describe a method to identify not just...

chapter

Speech emotion recognition with deep learning

Pavol Harar, Radim Burget, Malay Kishore Dutta

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN) > 137 - 140

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN)

This paper describes a method for Speech Emotion Recognition (SER) using Deep Neural Network (DNN) architecture with convolutional, pooling and fully connected layers. We used 3 class subset (angry, neutral, sad) of German Corpus (Berlin Database of Emotional Speech) containing 271 labeled recordings with total length of 783 seconds. Raw audio data were standardized so every audio file has zero mean...

chapter

ASP-DAC 2017 keynote speech I: In memory of Edward J. McCluskey: The next wave of pioneering innovations

Subhasish Mitra, Deming Chen

2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC) > 1

2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC)

This special plenary session will celebrate Prof. McCluskey (who passed away in 2016) through three keynote speeches by world-renowned scholars on the next wave of pioneering innovations, starting with a memorial speech by Prof. Jacob Abraham of University of Texas at Austin.

Data set:
ieee
Keywords:
TESTING
SPEECH
Publication type:
book

Publication date

Set your own date range

Content availability

Available (323)
None (2)

Keywords

TRAINING (168)
SPEECH RECOGNITION (124)
FEATURE EXTRACTION (91)
HIDDEN MARKOV MODELS (76)
DATABASES (62)
SPEAKER RECOGNITION (60)
SPEECH PROCESSING (57)
MEL FREQUENCY CEPSTRAL COEFFICIENT (48)
ACOUSTICS (45)
ACCURACY (36)
NATURAL LANGUAGE PROCESSING (32)
SUPPORT VECTOR MACHINES (29)
NOISE (25)
DATA MINING (24)
SIGNAL TO NOISE RATIO (24)
ARTIFICIAL NEURAL NETWORKS (21)
CEPSTRAL ANALYSIS (19)
COMPUTATIONAL MODELING (19)
SIGNAL PROCESSING (19)
MFCC (18)
EMOTION RECOGNITION (17)
COMPUTERS (16)
NOISE MEASUREMENT (16)
SIGNAL PROCESSING ALGORITHMS (16)
SPEAKER IDENTIFICATION (16)
TRANSFORMS (16)
EDUCATIONAL INSTITUTIONS (15)
ROBUSTNESS (15)
SPEECH ENHANCEMENT (15)
ADAPTATION MODEL (14)
ESTIMATION (14)
AUDITORY SYSTEM (13)
DATA MODELS (13)
GMM (13)
MATHEMATICAL MODEL (13)
PERFORMANCE EVALUATION (13)
SOFTWARE (13)
SPEECH CODING (13)
ALGORITHM DESIGN AND ANALYSIS (12)
CLASSIFICATION ALGORITHMS (12)
CONFERENCES (12)
GAUSSIAN PROCESSES (12)
PATTERN RECOGNITION (12)
CORRELATION (11)
GAUSSIAN MIXTURE MODEL (11)
HIDDEN MARKOV MODEL (11)
SPEAKER VERIFICATION (11)
SPEECH SYNTHESIS (11)
TRAINING DATA (11)
VECTORS (11)
ADAPTATION MODELS (10)
ACOUSTIC SIGNAL PROCESSING (9)
EQUATIONS (9)
MAXIMUM LIKELIHOOD ESTIMATION (9)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (9)
NEURAL NETWORKS (9)
VISUALIZATION (9)
WAVELET TRANSFORMS (9)
AUTOMATIC SPEECH RECOGNITION (8)
KERNEL (8)
MICROPHONES (8)
PRINCIPAL COMPONENT ANALYSIS (8)
BANDWIDTH (7)
COMPLEXITY THEORY (7)
DICTIONARIES (7)
HMM (7)
LEARNING (ARTIFICIAL INTELLIGENCE) (7)
MACHINE LEARNING (7)
NOISE REDUCTION (7)
POLYNOMIALS (7)
PREDICTIVE MODELS (7)
PRESSES (7)
SIGNAL CLASSIFICATION (7)
SPEECH INTELLIGIBILITY (7)
STATISTICAL ANALYSIS (7)
VECTOR QUANTISATION (7)
VECTOR QUANTIZATION (7)
BUILDINGS (6)
CONTEXT MODELING (6)
DELAY (6)
DETECTORS (6)
EDUCATION (6)
FACE RECOGNITION (6)
INDEXES (6)
LABORATORIES (6)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (6)
NEURAL NETS (6)
NEURAL NETWORK (6)
PROTOTYPES (6)
REVIEWS (6)
SPECTRAL ANALYSIS (6)
SUPPORT VECTOR MACHINE (6)
SUPPORT VECTOR MACHINE CLASSIFICATION (6)
USA COUNCILS (6)
WRITING (6)
ANALYTICAL MODELS (5)
BIOMETRICS (ACCESS CONTROL) (5)
CLUSTERING ALGORITHMS (5)
more

INFONA - science communication portal

Search results

An effective combination scheme for improving speaker verification performance

A new speaker verification algorithm based on identification results

I-vector-based speaker identification with extremely short utterances for both training and testing

Binaural and log-power spectra features with deep neural networks for speech-noise separation

Learning vocal mode classifiers from heterogeneous data sources

Speech enhancement using extreme learning machines

Development of speech corpora for Goalparia dialect and similar languages

Significance of exploring pitch only features for the recognition of spontaneous emotions from speech signals

Transcriber: An Android application that automates the transcription of interviews in Indonesian

Robust speaker verification with a two classifier format and feature enhancement

Speaker verification with mostly voiced speech for GMM/UBM and GMM/IBM systems

Testing conversational quality of VoIP with different terminals and degradations

Improved speaker recognition system for stressed speech using deep neural networks

Feature fusion techniques based training MLP for speaker identification system

Performance evaluation of mixtures of PLDA and conventional PLDA for a small-set speaker verification system

Incremental adaptation using active learning for acoustic emotion recognition

Ensemble feature selection for domain adaptation in speech emotion recognition

Identification of Kamrupi dialect and similar languages

Speech emotion recognition with deep learning

ASP-DAC 2017 keynote speech I: In memory of Edward J. McCluskey: The next wave of pioneering innovations

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options