Search results for: M. Lech

Items from 1 to 9 out of 9 results

chapter

Modeling subjectiveness in emotion recognition with deep neural networks: Ensembles vs soft labels

H.M. Fayek, M. Lech, L. Cavedon

2016 International Joint Conference on Neural Networks (IJCNN) > 566 - 570

2016 International Joint Conference on Neural Networks (IJCNN)

Ground truth labels obtained by averaging or majority voting are commonly used to train automatic emotion classifiers. However, ground truth labels fail to encapsulate inter-annotator variability and ignore the subjectivity of emotions. In this paper, we propose two viable approaches to model the subjectiveness of emotions by incorporating inter-annotator variability, which are soft labels and model...

chapter

Towards real-time Speech Emotion Recognition using deep neural networks

H.M. Fayek, M. Lech, L. Cavedon

2015 9th International Conference on Signal Processing and Communication Systems (ICSPCS) > 1 - 5

2015 9th International Conference on Signal Processing and Communication Systems (ICSPCS)

Most existing Speech Emotion Recognition (SER) systems rely on turn-wise processing, which aims at recognizing emotions from complete utterances and an overly-complicated pipeline marred by many preprocessing steps and hand-engineered features. To overcome both drawbacks, we propose a real-time SER system based on end-to-end deep learning. Namely, a Deep Neural Network (DNN) that recognizes emotions...

chapter

Speaker Verification Based on Different Vector Quantization Techniques with Gaussian Mixture Models

S. Memon, M. Lech, N. Maddage

2009 Third International Conference on Network and System Security > 403 - 408

2009 Third International Conference on Network and System Security (NSS)

The introduction of Gaussian mixture models (GMMs) in the field of speaker verification has led to very good results. This paper illustrates an evolution in state-of-the-art speaker verification by highlighting the contribution of recently established information theoretic based vector quantization technique. We explore the novel application of three different vector quantization algorithms, namely...

chapter

Neural Networks and TEO Features for an Automatic Recognition of Stress in Spontaneous Speech

Ling He, M. Lech, N.C. Maddage, N. Allen

2009 Fifth International Conference on Natural Computation > 2 > 227 - 231

2009 Fifth International Conference on Natural Computation (ICNC 2009)

This study presents automatic stress recognition methods based on acoustic speech analysis. Novel approaches to feature extraction based on the nonlinear Teager energy operator (TEO) calculated within critical bands, discrete wavelet transform bands, and wavelet packet bands are presented. The classification process was performed using two types of neural networks: the multilayer perceptron neural...

chapter

Emotion Recognition in Spontaneous Speech within Work and Family Environments

Ling He, M. Lech, N. Maddage, S. Memon, more

2009 3rd International Conference on Bioinformatics and Biomedical Engineering > 1 - 4

2009 3rd International Conference on Bioinformatics and Biomedical Engineering (iCBBE 2009)

The speech signal is an important tool for conveying information between humans; at the same time, it is an indicator of a speaker's emotions. In this paper, the automatic identification of affect from speech containing spontaneously expressed (not acted) emotions within different environments was investigated. The teager energy operator-perceptual wavelet packet (TEO-PWP) features as well as the...

chapter

Effect of Clinical Depression on Automatic Speaker Identification

S. Memon, N. Maddage, M. Lech, N. Allen

2009 3rd International Conference on Bioinformatics and Biomedical Engineering > 1 - 4

2009 3rd International Conference on Bioinformatics and Biomedical Engineering (iCBBE 2009)

This study investigates effects of a clinical environment on speaker recognition rates. Two sets of speakers were used: a clinical set containing speech recordings of 70 clinically depressed speakers and a control set containing 68 non-depressed speakers. MFCC characteristic features were used to produce statistical models of speakers using four modeling methods: GMM_EM, GMM_K-means, GMM_LBG, and...

chapter

Mel frequency cepstral feature and Gaussian Mixtures for modeling clinical depression in adolescents

L.-S.A. Low, N.C. Maddage, M. Lech, N. Allen

2009 8th IEEE International Conference on Cognitive Informatics > 346 - 350

2009 8th IEEE International Conference on Cognitive Informatics (ICCI)

With suicidal behavior being linked to depression that starts at an early age of a person's life, many investigators are trying to find early tell-tale signs to assist psychologists in detecting clinical depression through acoustic analysis of a patient's speech. The purpose of this paper was to study the effectiveness of Mel frequency cepstral coefficients (MFCCs) in capturing the overall mental...

chapter

Facial Expression Recognition Using Neural Networks and Log-Gabor Filters

S.M. Lajevardi, M. Lech

2008 Digital Image Computing: Techniques and Applications > 77 - 83

2008 Digital Image Computing: Techniques and Applications

This study proposes a classification-based facial expression recognition method using a bank of multilayer perceptron neural networks. Six different facial expressions were considered. Firstly, logarithmic Gabor filters were applied to extract the features. Optimal subsets of features were then selected for each expression, down-sampled and further reduced in size via principal component analysis...

chapter

Facial expression recognition from image sequences using optimized feature selection

S.M. Lajevardi, M. Lech

2008 23rd International Conference Image and Vision Computing New Zealand > 1 - 6

2008 23rd International Conference Image and Vision Computing New Zealand

A novel method for facial expression recognition from sequences of image frames is described and tested. The expression recognition system is fully automatic, and consists of the following modules: face detection, maximum arousal detection, feature extraction, selection of optimal features, and facial expression recognition. The face detection is based on AdaBoost algorithm and is followed by the...

Filter options

Keywords:
TRAINING

Publication date

Set your own date range

Keywords

FEATURE EXTRACTION (7)
SPEECH (7)
SPEECH RECOGNITION (5)
EMOTION RECOGNITION (4)
CEPSTRAL ANALYSIS (3)
CLASSIFICATION ALGORITHMS (3)
DATABASES (3)
GAUSSIAN PROCESSES (3)
MEL FREQUENCY CEPSTRAL COEFFICIENT (3)
NEURAL NETWORKS (3)
STRESS (3)
ARTIFICIAL NEURAL NETWORKS (2)
COHN-KANADE DATABASE (2)
FACE RECOGNITION (2)
FACIAL EXPRESSION RECOGNITION (2)
GABOR FILTERS (2)
GAUSSIAN MIXTURE MODELS (2)
IMAGE CLASSIFICATION (2)
LOG-GABOR FILTER (2)
MEDICAL SIGNAL PROCESSING (2)
MULTILAYER PERCEPTRON NEURAL NETWORK (2)
MULTILAYER PERCEPTRONS (2)
PROBABILISTIC NEURAL NETWORK (2)
PSYCHOLOGY (2)
SPEAKER RECOGNITION (2)
VECTOR QUANTISATION (2)
ACCURACY (1)
ACOUSTIC ANALYSIS (1)
ADABOOST ALGORITHM (1)
ADOLESCENTS (1)
AFFECTIVE COMPUTING (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ARRAYS (1)
AUTOMATIC RECOGNITION (1)
AUTOMATIC SPEAKER IDENTIFICATION (1)
AUTOMATIC STRESS RECOGNITION METHODS (1)
BAYES METHODS (1)
CLASSIFICATION (1)
CLASSIFICATION-BASED FACIAL EXPRESSION RECOGNITION (1)
CLINICAL DEPRESSION (1)
CLINICAL DEPRESSION EFFECT (1)
DATA DISTRIBUTION (1)
DATA MINING (1)
DATA TENSOR (1)
DATABASE MANAGEMENT SYSTEMS (1)
DEEP LEARNING (1)
DEEP NEURAL NETWORKS (1)
DISCRETE WAVELET TRANSFORM (1)
DISCRETE WAVELET TRANSFORMS (1)
EIGENVALUES AND EIGENFUNCTIONS (1)
EIGENVECTOR (1)
EMOTION (1)
EXPECTATION MAXIMIZATION ALGORITHM (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
FACE DETECTION (1)
FACIAL DETECTION (1)
FACIAL EXPRESSION (1)
FAMILY ENVIRONMENT (1)
FEATURE SELECTION (1)
GAUSSIAN DISTRIBUTION (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN MIXTURES (1)
GAUSSIAN MODELING (1)
GENDER INDEPENDENT CLINICAL DEPRESSION MODELS (1)
GMM (1)
GMM CLASSIFIER (1)
GMM EM METHOD (1)
GMM K-MEANS METHOD (1)
GMM LBG METHOD (1)
IMAGE SEQUENCES (1)
INFORMATION THEORETIC VECTOR QUANTIZATION (1)
INTERFRAME MUTUAL INFORMATION CRITERION (1)
K-MEANS VECTOR QUANTIZATION (1)
LBG ITVQ METHOD (1)
LINDE-BUZO-GRAY VECTOR QUANTIZATION (1)
LOG GABOR FILTER (1)
LOG-GABOR FEATURES (1)
LOG-GABOR FILTERS (1)
LOGARITHMIC GABOR FILTER (1)
MAXIMUM AROUSAL DETECTION (1)
MEL FREQUENCY CEPSTRAL FEATURE (1)
MFCC (1)
MFCC CHARACTERISTIC FEATURES (1)
MLPNN (1)
MUTUAL INFORMATION (1)
NAIVE BAYESIAN CLASSIFIER (1)
NEURAL NETS (1)
NEURAL NETWORK (1)
NEURONS (1)
NONDEPRESSED SPEAKERS (1)
NONLINEAR TEAGER ENERGY OPERATOR (1)
OPTIMIZED FEATURE SELECTION (1)
ORI EMOTIONAL CLASS (1)
PARALLEL NEURAL NETWORK (1)
PERCEPTUAL WAVELET PACKET BANDS (1)
PNN (1)
PRINCIPAL COMPONENT ANALYSIS (1)
SIGNAL CLASSIFICATION (1)
SPEAKER EMOTION (1)
more

INFONA - science communication portal

Search results for: M. Lech

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options