Search results

Items from 1 to 20 out of 21 results

chapter

A study of support vector machines for emotional speech recognition

Nattapong Kurpukdee, Sawit Kasuriya, Vataya Chunwijitra, Chai Wutiwiwatchai, more

2017 8th International Conference of Information and Communication Technology for Embedded Systems (IC-ICTES) > 1 - 6

2017 8th International Conference of Information and Communication Technology for Embedded Systems (IC-ICTES)

In this paper, efficiency comparison of Support Vector Machines (SVM) and Binary Support Vector Machines (BSVM) techniques in utterance-based emotion recognition is studied. Acoustic features including energy, Mel-frequency cepstral coefficients (MFCC), Perceptual linear predictive (PLP), Filter bank (FBANK), pitch, their first and second derivatives are used as frame-based features. Four basic emotions...

chapter

Multi-task deep neural network with shared hidden layers: Breaking down the wall between emotion representations

Yue Zhang, Yifan Liu, Felix Weninger, Bjorn Schuller

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4990 - 4994

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Emotion representations are psychological constructs for modelling, analysing, and recognising emotion, being one essential element of affect. Due to its complexity, the boundaries between different emotion concepts are often fuzzy, which is also reflected in the diversification of emotion databases, and their inconsistent target labels. When facing data scarcity as an ever present issue for acoustic...

chapter

Ranking emotional attributes with deep neural networks

Srinivas Parthasarathy, Reza Lotfian, Carlos Busso

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4995 - 4999

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Studies have shown that ranking emotional attributes through preference learning methods has significant advantages over conventional emotional classification/regression frameworks. Preference learning is particularly appealing for retrieval tasks, where the goal is to identify speech conveying target emotional behaviors (e.g., positive samples with low arousal). With recent advances in deep neural...

chapter

Automatic speech emotion recognition using recurrent neural networks with local attention

Seyedmahdad Mirsamadi, Emad Barsoum, Cha Zhang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2227 - 2231

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Automatic emotion recognition from speech is a challenging task which relies heavily on the effectiveness of the speech features used for classification. In this work, we study the use of deep learning to automatically discover emotionally relevant features from speech. It is shown that using a deep recurrent neural network, we can learn both the short-time frame-level acoustic features that are emotionally...

chapter

Multimodal fusion of audio, scene, and face features for first impression estimation

Furkan Gurpinar, Heysem Kaya, Albert Ali Salah

2016 23rd International Conference on Pattern Recognition (ICPR) > 43 - 48

2016 23rd International Conference on Pattern Recognition (ICPR)

Affective computing, particularly emotion and personality trait recognition, is of increasing interest in many research disciplines. The interplay of emotion and personality shows itself in the first impression left on other people. Moreover, the ambient information, e.g. the environment and objects surrounding the subject, also affect these impressions. In this work, we employ pre-trained Deep Convolutional...

chapter

Tendencies regarding the effect of emotional intensity in inter corpus phoneme-level speech emotion modelling

Bogdan Vlasenko, Bjorn Schuller, Andreas Wendemuth

2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP)

As emotion recognition from speech has matured to a degree where it becomes suitable for real-life applications, it is time for developing techniques for matching different types of emotional data with multi-dimensional and categories-based annotations. The categorical approach is usually applied for acted ‘full blown’ emotions and multi-dimensional annotation is often preferred for spontaneous real...

chapter

Video Affective Content Analysis based on multimodal features using a novel hybrid SVM-RBM classifier

Ashwin T S, Sai Saran, G Ram Mohana Reddy

2016 IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics Engineering (UPCON) > 416 - 421

2016 IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics Engineering (UPCON)

Video Affective Content Analysis is an active research area in computer vision. Live Streaming video has become one of the modes of communication in the recent decade. Hence video affect content analysis plays a vital role. Existing works on video affective content analysis are more focused on predicting the current state of the users using either of the visual or the acoustic features. In this paper,...

chapter

Cross-linguistic perception of Chinese attitudes praising and blaming

Ping Tang, Lei Liu, Shanpeng Li, Wentao Gu

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) > 113 - 117

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)

This study compared the perceptions of Chinese sentences conveying the attitudinal contrast of praising and blaming by five groups of subjects (Chinese natives, Japanese L2 learners of Mandarin, French L2 learners of Mandarin, Japanese and French subjects without any Mandarin ability). Context-elicited target sentences conveying praising, blaming or neutral attitude were used as stimuli in the listening...

chapter

Data selection for acoustic emotion recognition: Analyzing and comparing utterance and sub-utterance selection strategies

Duc Le, Emily Mower Provost

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 146 - 152

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

Data selection is an important component of cross-corpus training and semi-supervised/active learning. However, its effect on acoustic emotion recognition is still not well understood. In this work, we perform an in-depth exploration of various data selection strategies for emotion classification from speech using classifier agreement as the selection metric. Our methods span both the traditional...

chapter

Detection of negative emotions in speech signals using bags-of-audio-words

Florian B. Pokorny, Franz Graf, Franz Pernkopf, Bjorn W. Schuller

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 879 - 884

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

Boosted by a wide potential application spectrum, emotional speech recognition, i.e., the automatic computer-aided identification of human emotional states based on speech signals, currently describes a popular field of research. However, a variety of studies especially concentrating on the recognition of negative emotions often neglected the specific requirements of real-world scenarios, for example,...

chapter

Cross-corpus analysis for acoustic recognition of negative interactions

Iulia Lefter, Harold T. Nefs, Catholijn M. Jonker, Leon J. M. Rothkrantz

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 132 - 138

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

Recent years have witnessed a growing interest in recognizing emotions and events based on speech. One of the applications of such systems is automatically detecting when a situations gets out of hand and human intervention is needed. Most studies have focused on increasing recognition accuracies using parts of the same dataset for training and testing. However, this says little about how such a trained...

chapter

Cross-corpus acoustic emotion recognition: Variances and strategies (Extended abstract)

Bjorn Schuller, Bogdan Vlasenko, Florian Eyben, Martin Wollmer, more

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 470 - 476

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

As the recognition of emotion from speech has matured to a degree where it becomes applicable in real-life settings, it is time for a realistic view on obtainable performances. Most studies tend to overestimation in this respect: acted data is often used rather than spontaneous data, results are reported on pre-selected prototypical data, and true speaker disjunctive partitioning is still less common...

chapter

Location of an emotionally neutral region in valence-arousal space: Two-class vs. three-class cross corpora emotion recognition evaluations

Bogdan Vlasenko, Andreas Wendemuth

2014 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2014 IEEE International Conference on Multimedia and Expo (ICME)

There are two main emotion annotation techniques: multidimensional and categories based. In order to conduct experiments on emotional data annotated with different techniques, two-classes emotion mapping strategies (e.g. high-vs. low-arousal) are commonly used. The ”affective computing” community could not specify a location of emotionally neutral area in multi-dimensional emotional space (e.g. valence-arousal-dominance...

chapter

Comparison between decision-level and feature-level fusion of acoustic and linguistic features for spontaneous emotion recognition

Santiago Planet, Ignasi Iriondo

7th Iberian Conference on Information Systems and Technologies (CISTI 2012) > 1 - 6

2012 7th Iberian Conference on Information Systems and Technologies (CISTI)

Detection of affective states in speech could improve the way users interact with electronic devices. However the analysis of speech at the acoustic level could be not enough to determine the emotion of a user speaking in a realistic scenario. In this paper we analysed the spontaneous speech recordings of the FAU Aibo Corpus at the acoustic and linguistic levels to extract two sets of acoustic and...

chapter

Spontaneous children's emotion recognition by categorical classification of acoustic features

Santiago Planet, Ignasi Iriondo

6th Iberian Conference on Information Systems and Technologies (CISTI 2011) > 1 - 6

2011 6th Iberian Conference on Information Systems and Technologies (CISTI)

This paper describes three categorical classification approaches to spontaneous children's emotion recognition based on acoustic features from speech. Also, we present a fourth approach combining by stacking generalisation the two best classifiers. We used the FAU Aibo Corpus to work under real-life conditions, dealing with spontaneous speech and with low emotional expressiveness, unbalanced data,...

chapter

An Articulation Training System with Intelligent Interface and Multimode Feedbacks to Articulation Disorders

Yeou-Jiunn Chen, Jiunn-Liang Wu, Hui-Mei Yang, Chung-Hsien Wu, more

2009 International Conference on Asian Language Processing > 3 - 6

2009 International Conference on Asian Language Processing (IALP 2009)

Articulation training with many kinds of stimulus and messages such as visual, voice, and articulatory information can teach user to pronounce correctly and improve user's articulatory ability. In this paper, an articulation training system with intelligent interface and multimode feedbacks is proposed to improve the performance of articulation training. Clinical knowledge of speech evaluation is...

chapter

Fusion of Acoustic and Linguistic Features for Emotion Detection

F. Metze, T. Polzehl, M. Wagner

2009 IEEE International Conference on Semantic Computing > 153 - 160

2009 IEEE International Conference on Semantic Computing (ICSC)

This paper describes a system that deploys acoustic and linguistic information from speech in order to decide whether the utterance contains negative or non-negative meaning. An earlier version of this system was submitted to the Interspeech-2009 Emotion Challenge evaluation. The speech data consist of short utterances of the children's speech, and the proposed system is designed to detect anger in...

chapter

A multiple perception model on emotional speech

Jianhua Tao, Ya Li, Shifeng Pan

2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops > 1 - 6

2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops (ACII 2009)

More and more efforts have been made for the research of emotional speech recently. Although we may, sometimes be able to make a definite perceptual decision on emotion state, emotion is actually a kind of cline in a large vector space. Different emotions can be thought of as zones along an emotional vector. To resolve the ambiguity of emotion perception, the authors make an array of perception experiments...

chapter

Combining speech recognition and acoustic word emotion models for robust text-independent emotion recognition

B. Schuller, B. Vlasenko, D. Arsic, G. Rigoll, more

2008 IEEE International Conference on Multimedia and Expo > 1333 - 1336

2008 IEEE International Conference on Multimedia and Expo (ICME)

Recognition of emotion in speech usually uses acoustic models that ignore the spoken content. Likewise one general model per emotion is trained independent of the phonetic structure. Given sufficient data, this approach seemingly works well enough. Yet, this paper tries to answer the question whether acoustic emotion recognition strongly depends on phonetic content, and if models tailored for the...

chapter

Facial Expression Recognition Based on LBP-EHMM

Jianqiang Cao, Can Tong

2008 Congress on Image and Signal Processing > 2 > 371 - 375

International Congress on Image and Signal Processing (CISP 2008)

Facial expression recognition can be divided into three steps: face detection, expression feature extraction and expression categorization. Facial expression feature extraction and categorization are the most key issue. To address this issue, we propose a method to combine local binary pattern (LBP) and embedded hidden markov model (EHMM), which is the key contribution of this paper. This paper first...

Keywords:
TRAINING
ACOUSTICS
EMOTION RECOGNITION

Publication date

Set your own date range

Keywords

SPEECH (18)
SPEECH RECOGNITION (10)
FEATURE EXTRACTION (9)
DATABASES (6)
HIDDEN MARKOV MODELS (6)
SUPPORT VECTOR MACHINES (6)
CLASSIFICATION ALGORITHMS (4)
SPEECH EMOTION RECOGNITION (4)
STRESS (4)
ACOUSTIC FEATURES (3)
DATA MINING (3)
SPEECH PROCESSING (3)
ACCURACY (2)
AFFECTIVE COMPUTING (2)
CEPSTRAL ANALYSIS (2)
CROSS-CORPUS EVALUATION (2)
DISCRETE COSINE TRANSFORMS (2)
PRAGMATICS (2)
PRESSES (2)
PSYCHOLOGY (2)
ROBUSTNESS (2)
SIGNAL PROCESSING (2)
SPONTANEOUS SPEECH (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
VISUALIZATION (2)
3D FACIAL ANIMATION (1)
ACOUSTIC INFORMATION (1)
ACOUSTIC MEASUREMENTS (1)
ACOUSTIC MODELING (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTIC WORD EMOTION MODELS (1)
ACTIVE SHAPE MODEL (1)
ADAPTATION MODEL (1)
AFFECTIVE SPEECH (1)
ANALYSIS OF VARIANCE (1)
ANALYTICAL MODELS (1)
ART (1)
ARTICULATION DISORDERS (1)
ARTICULATION TRAINING SYSTEM (1)
ARTIFICIAL INTELLIGENCE (1)
ARTIFICIAL NEURAL NETWORK (1)
ATMOSPHERIC MODELING (1)
ATTENTION MECHANISM (1)
AUDITORY SYSTEM (1)
AUTOMATIC SPEECH RECOGNITION (1)
AUTOMATIC SPEECH RECOGNIZER (1)
BAG-OF-AUDIO-WORDS (1)
BAND PASS FILTERS (1)
BELIEF NETWORKS (1)
BINARY SUPPORT VECTOR MACHINES (BSVM) (1)
BIOMECHANICS (1)
BLAMING ATTITUDE (1)
CALIBRATION (1)
CLASSIFICATION (1)
CLASSIFICATION TREE ANALYSIS (1)
CLASSIFIER AGREEMENT (1)
CLASSIFIER FUSION (1)
COGNITION (1)
COMPUTATIONAL LINGUISTICS (1)
COMPUTER ANIMATION (1)
COMPUTERS (1)
CONFERENCES (1)
CONTEXT (1)
CONVERGENCE (1)
CORRELATION (1)
COVARIANCE MATRIX (1)
CROSS-CORPUS (1)
CROSS-LINGUISTIC PERCEPTION (1)
DATA AGGREGATION (1)
DATA SELECTION (1)
DECISION-LEVEL FUSION (1)
DEEP NEURAL NETWORKS (1)
DEEP RECURRENT NEURAL NETWORKS (1)
DYNAMIC MODELING (1)
EDUCATIONAL INSTITUTIONS (1)
EHMM (1)
EMO-DB DATABASE FACING SPEAKER-INDEPENDENCE (1)
EMOTION (1)
EMOTION DETECTION (1)
EMOTION PERCEPTION (1)
EMOTION STATE (1)
EMOTIONAL INTENSITY (1)
EMOTIONAL SALIENT WORD (1)
EMOTIONAL SPEECH (1)
EMOTIONAL SPEECH RECOGNITION (ESR) AND CLASSIFICATION (1)
EMOTIONAL VECTOR (1)
ENCODING (1)
ENERGY MEASUREMENT (1)
ENERGY RESOLUTION (1)
ENGINES (1)
EQUATIONS (1)
FACE (1)
FACE DETECTION (1)
FACE RECOGNITION (1)
FACIAL EXPRESSION RECOGNITION (1)
FACIAL FEATURES (1)
FEATURE SELECTION (1)
more

INFONA - science communication portal

Search results

A study of support vector machines for emotional speech recognition

Multi-task deep neural network with shared hidden layers: Breaking down the wall between emotion representations

Ranking emotional attributes with deep neural networks

Automatic speech emotion recognition using recurrent neural networks with local attention

Multimodal fusion of audio, scene, and face features for first impression estimation

Tendencies regarding the effect of emotional intensity in inter corpus phoneme-level speech emotion modelling

Video Affective Content Analysis based on multimodal features using a novel hybrid SVM-RBM classifier

Cross-linguistic perception of Chinese attitudes praising and blaming

Data selection for acoustic emotion recognition: Analyzing and comparing utterance and sub-utterance selection strategies

Detection of negative emotions in speech signals using bags-of-audio-words

Cross-corpus analysis for acoustic recognition of negative interactions

Cross-corpus acoustic emotion recognition: Variances and strategies (Extended abstract)

Location of an emotionally neutral region in valence-arousal space: Two-class vs. three-class cross corpora emotion recognition evaluations

Comparison between decision-level and feature-level fusion of acoustic and linguistic features for spontaneous emotion recognition

Spontaneous children's emotion recognition by categorical classification of acoustic features

An Articulation Training System with Intelligent Interface and Multimode Feedbacks to Articulation Disorders

Fusion of Acoustic and Linguistic Features for Emotion Detection

A multiple perception model on emotional speech

Combining speech recognition and acoustic word emotion models for robust text-independent emotion recognition

Facial Expression Recognition Based on LBP-EHMM

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options