Search results

Items from 21 to 40 out of 443 results

chapter

Determining the voiceprint recognition on the basis of emotional speech signal: Indonesia language

Kanyadian Idananta, Kristianus Oktriono

2017 3rd International Conference on Information Management (ICIM) > 388 - 392

2017 3rd International Conference on Information Management (ICIM)

Automatic voiceprint recognition, posited on human speech signal, serves many salient practical applications. A number of studies are undertaken on the basis of normal speech. This research intends to develop automatic voiceprint recognition system on the basis of emotion speech signal in Indonesia language. The study is limited to four different people with speeches of four distinctive emotional...

chapter

Voice based interface for route search system

Tomas Rasymas, Vytautas Rudzionis

2017 Open Conference of Electrical, Electronic and Information Sciences (eStream) > 1 - 4

2017 Open Conference of Electrical, Electronic and Information Sciences (eStream)

In this paper we are presenting our approach of creating voice based interface for one of the leading Lithuania bus route search system — www.autobusubilietai.lt. We designed a hybrid speech recognition system which is based on one Lithuanian speech recognizer (LIEPA) and two foreign language recognizers (German and Spanish). We experimented with different methods that may be used for combining outputs...

chapter

Pashto language dialect recognition using mel frequency cepstral coefficient and support vector machines

Saud Khan, Haider Ali, Khalil Ullah

2017 International Conference on Innovations in Electrical Engineering and Computational Technologies (ICIEECT) > 1 - 4

2017 International Conference on Innovations in Electrical Engineering and Computational Technologies (ICIEECT)

In this paper, a system based on support vector machines is proposed for content-based dialect classification and retrieval. This work is part of an ongoing effort to address the needs of new under-resourced languages. The recognition system will work for the interest and welfare of the Pashto speaking people and will help in keeping the language dialects alive by this process. Voice samples are collected...

chapter

Emotion recognition on speech signals using machine learning

Mohan Ghai, Shamit Lal, Shivam Duggal, Shrey Manik

2017 International Conference on Big Data Analytics and Computational Intelligence (ICBDAC) > 34 - 39

2017 International Conference on Big Data Analytics and Computational Intelligence (ICBDAC)

With the increase in man to machine interaction, speech analysis has become an integral part in reducing the gap between physical and digital world. An important subfield within this domain is the recognition of emotion in speech signals, which was traditionally studied in linguistics and psychology. Speech emotion recognition is a field having diverse applications. The prime objective of this paper...

chapter

Speech signals identification base on improved DBN

Cai Jun, Yao Qin, Zhang Yi

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1144 - 1148

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

For the problem low speech recognition rate, an improved method of combining Deep Belief Network (DBN) with support vector machine (SVM) for analyzing Small sample speech signals is proposed. The speech signal data collected as the training sample is used for training the DBN to get the optimal parameter values. The trained DBN is utilized for feature extraction, and these speech sample data signals...

chapter

Multidimensional speaker information recognition based on proposed baseline system

Shan Li, Longting Xu, Zhen Yang

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1776 - 1780

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

Traditional speech-related identity recognition commonly pays attention to individual aspect of speech signals but in reality, the speech signals are made up of semantics, speaker dependent features, etc. This paper therefore presents a new study that recognizes simultaneously multidimensional speaker information. In order to extract sufficient relational features, both high-level and low-level features...

chapter

Organic voice pathology classification

Chekili Salma, Belhaj Asma, Bouzid Aicha

2017 14th International Multi-Conference on Systems, Signals & Devices (SSD) > 203 - 206

2017 14th International Multi-Conference on Systems, Signals & Devices (SSD)

In this paper, we propose to achieve the classification of pathologic voices and essentially the classification between organic pathologies: it's about polyp, edema and nodule pathologies using new features. The principle contribution in this work is to provide new parameter more efficient than the classic MFCC. It's about calculating MFCC not from the speech signal but from the speech multiscale...

chapter

Discriminative feature domains for reverberant acoustic environments

Constantinos Papayiannis, Christine Evers, Patrick A. Naylor

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 756 - 760

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Several speech processing and audio data-mining applications rely on a description of the acoustic environment as a feature vector for classification. The discriminative properties of the feature domain play a crucial role in the effectiveness of these methods. In this work, we consider three environment identification tasks and the task of acoustic model selection for speech recognition. A set of...

chapter

Learning cross-lingual knowledge with multilingual BLSTM for emphasis detection with limited training data

Yishuang Ning, Zhiyong Wu, Runnan Li, Jia Jia, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5615 - 5619

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Bidirectional long short-term memory (BLSTM) recurrent neural network (RNN) has achieved state-of-the-art performance in many sequence processing problems given its capability in capturing contextual information. However, for languages with limited amount of training data, it is still difficult to obtain a high quality BLSTM model for emphasis detection, the aim of which is to recognize the emphasized...

chapter

Effective emotion recognition in movie audio tracks

Margarita Kotti, Yannis Stylianou

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5120 - 5124

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper addresses the problem of speech emotion recognition from movie audio tracks. The recently collected Acted Facial Expression in the Wild 5.0 database is used. The aim is to discriminate among angry, happy, and neutral. We extract a relatively small number of features, a subset of which is not commonly used for the emotion recognition task. Those features are fed as input to an ensemble classifier...

chapter

Speech emotion recognition with skew-robust neural networks

Po-Yuan Shih, Chia-Ping Chen, Hsin-Min Wang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2751 - 2755

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a neural-network training algorithm that is robust to data imbalance in classification. In our proposed algorithm, weights are introduced to training examples, effectively modifying the trajectory traversed in the parameter space during the learning process. Furthermore, the proposed algorithm would reduce to the normal stochastic gradient decent learning if the data is balanced. On the...

chapter

Pashto spoken digits recognition using spectral and prosodic based feature extraction

Shibli Nisar, Ibrahim Shahzad, Muhammad Adnan Khan, Muhammad Tariq

2017 Ninth International Conference on Advanced Computational Intelligence (ICACI) > 74 - 78

2017 Ninth International Conference on Advanced Computational Intelligence (ICACI)

Automatic spoken digit recognition is one of the important areas in speech recognition. Local language spoken digits recognition is the next stage in this technological advancement. This paper presents a new approach for Pashto digits recognition using spectral and prosodic based feature extraction. Very little or almost no work has been done in Pashto spoken digit recognition. Thats why no standard...

chapter

Automatic speech emotion detection system using multi-domain acoustic feature selection and classification models

Nancy Semwal, Abhijeet Kumar, Sakthivel Narayanan

2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA) > 1 - 6

2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA)

Emotions exhibited by a speaker can be detected by analyzing his/her speech, facial expressions and gestures or by combining these properties. This paper concentrates on determining the emotional state from speech signals. Various acoustic features such as energy, zero crossing rate(ZCR), fundamental frequency, Mel Frequency Cepstral Coefficients (MFCCs), etc are extracted for short term, overlapping...

article

The Cost of Dichotomizing Continuous Labels for Binary Classification Problems: Deriving a Bayesian-Optimal Classifier

Soroosh Mariooryad, Carlos Busso

IEEE Transactions on Affective Computing > 2017 > 8 > 1 > 119 - 130

Many pattern recognition problems involve characterizing samples with continuous labels instead of discrete categories. While regression models are suitable for these learning tasks, these labels are often discretized into binary classes to formulate the problem as a conventional classification task (e.g., classes with low versus high values). This methodology brings intrinsic limitations on the classification...

chapter

Classification of speech under stress based on cepstral features and one-class SVM

Salsabil Besbes, Zied Lachiri

2017 International Conference on Control, Automation and Diagnosis (ICCAD) > 213 - 218

2017 International Conference on Control, Automation and Diagnosis (ICCAD)

This paper presents an approach that aims to recognize stressed speech utterances. Our work consists of extracting features using Mel Frequency Cepstral Coefficients (MFCC) and Gammatone Frequency Cepstral Coefficients (GFCC). Indeed, these features are classified with One-class Support Vector Machines (OC-SVM). The results of the proposed method are obtained by conducting speech samples of four stressed...

article

A Facial-Expression Monitoring System for Improved Healthcare in Smart Cities

Ghulam Muhammad, Mansour Alsulaiman, Syed Umar Amin, Ahmed Ghoneim, more

IEEE Access > 2017 > 5 > 10871 - 10881

Human facial expressions change with different states of health; therefore, a facial-expression recognition system can be beneficial to a healthcare framework. In this paper, a facial-expression recognition system is proposed to improve the service of the healthcare in a smart city. The proposed system applies a bandlet transform to a face image to extract sub-bands. Then, a weighted, center-symmetric...

chapter

BlowClick 2.0: A trigger based on non-verbal vocal input

Daniel Zielasko, Neha Neha, Benjamin Weyers, Torsten W. Kuhlen

2017 IEEE Virtual Reality (VR) > 319 - 320

2017 IEEE Virtual Reality (VR)

The use of non-verbal vocal input (NVVI) as a hand-free trigger approach has proven to be valuable in previous work [7]. Nevertheless, BlowClick's original detection method is vulnerable to false positives and, thus, is limited in its potential use, e.g., together with acoustic feedback for the trigger. Therefore, we extend the existing approach by adding common machine learning methods. We found...

chapter

Classifying emotional states using pitch and formants in vowel regions

Abhijit Mohanta, V. K. Mittal

2016 International Conference on Signal Processing and Communication (ICSC) > 458 - 463

2016 International Conference on Signal Processing and Communication (ICSC)

In the field of Human Computer Interaction (HCI), human emotion recognition from speech signal is evolving as a recent research area. Speech is the most common way for communication among human beings. Speech consists of sentences, which can be further segregated into words. Words consist of phonemes which are considered to be the primary voice construction elements. This paper presents a classification...

chapter

Wavelet packet energy and entropy features for classification of stressed speech

Salsabil Besbes, Zied Lachiri

2016 17th International Conference on Sciences and Techniques of Automatic Control and Computer Engineering (STA) > 98 - 103

2016 17th International Conference on Sciences and Techniques of Automatic Control and Computer Engineering (STA)

This paper presents an approach that aims to recognize stress under speech. The proposed system is based on wavelet packet prosody features. These features are extracted from speech according to Mel scale, Bark scale and ERB scale. Multiclass Support Vector Machines are used as the based classifiers in order to classify the stress states. The speech utterances used is this study are taken from Speech...

chapter

Fusion Based Emotion Recognition System

Anupam Agrawal, Nayaneesh Kumar Mishra

2016 International Conference on Computational Science and Computational Intelligence (CSCI) > 727 - 732

2016 International Conference on Computational Science and Computational Intelligence (CSCI)

The field of Emotion recognition (ER) is a part of human-computer interaction and this field has evolved very rapidly since the last decade. There are several works which have been done on emotion recognition using audio and video, however recently work is being done on fusion of the different modalities. The aim of this paper is to fuse the results of emotion detection obtained using audio and visual...

Data set:
ieee
Keywords:
SUPPORT VECTOR MACHINES
SPEECH RECOGNITION

Publication date

Set your own date range

Content availability

Available (431)
None (12)

Publication type

book (399)
article (44)

Keywords

SPEECH (289)
FEATURE EXTRACTION (196)
TRAINING (129)
HIDDEN MARKOV MODELS (119)
SUPPORT VECTOR MACHINE (116)
EMOTION RECOGNITION (109)
SVM (84)
MEL FREQUENCY CEPSTRAL COEFFICIENT (77)
KERNEL (76)
DATABASES (61)
ACCURACY (60)
SPEECH PROCESSING (58)
ACOUSTICS (57)
SPEAKER RECOGNITION (43)
PATTERN CLASSIFICATION (40)
NATURAL LANGUAGE PROCESSING (36)
LEARNING (ARTIFICIAL INTELLIGENCE) (32)
SUPPORT VECTOR MACHINE CLASSIFICATION (30)
CLASSIFICATION ALGORITHMS (28)
GAUSSIAN PROCESSES (28)
DATA MINING (27)
SIGNAL CLASSIFICATION (25)
ARTIFICIAL NEURAL NETWORKS (24)
MACHINE LEARNING (24)
SPEECH EMOTION RECOGNITION (24)
ACOUSTIC SIGNAL PROCESSING (20)
MFCC (20)
AUTOMATIC SPEECH RECOGNITION (17)
GAUSSIAN MIXTURE MODEL (17)
NATURAL LANGUAGES (17)
FEATURE SELECTION (16)
NIST (16)
CEPSTRAL ANALYSIS (15)
ROBUSTNESS (15)
HIDDEN MARKOV MODEL (14)
NEURAL NETWORKS (14)
NOISE (14)
PRINCIPAL COMPONENT ANALYSIS (14)
VISUALIZATION (14)
FACE RECOGNITION (13)
SPEECH SIGNAL (13)
SUPPORT VECTOR MACHINE (SVM) (13)
ADAPTATION MODEL (12)
COMPUTATIONAL MODELING (12)
DATA MODELS (12)
REGRESSION ANALYSIS (12)
NEURAL NETS (11)
TRAINING DATA (11)
CORRELATION (10)
OPTIMIZATION (10)
SVM CLASSIFIER (10)
BAYES METHODS (9)
ERROR ANALYSIS (9)
FACE (9)
HMM (9)
LANGUAGE RECOGNITION (9)
SPEAKER VERIFICATION (9)
SPEECH CODING (9)
TESTING (9)
TRANSFORMS (9)
VECTORS (9)
WAVELET TRANSFORMS (9)
CLASSIFICATION (8)
COMPUTERS (8)
COVARIANCE MATRIX (8)
ELECTROENCEPHALOGRAPHY (8)
ENTROPY (8)
GAUSSIAN MIXTURE MODELS (8)
HUMAN COMPUTER INTERACTION (8)
MATHEMATICAL MODEL (8)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (8)
PROBABILITY (8)
STATISTICAL ANALYSIS (8)
ALGORITHM DESIGN AND ANALYSIS (7)
AUDIO SIGNAL PROCESSING (7)
DECISION TREES (7)
DETECTORS (7)
HUMANS (7)
IMAGE SEGMENTATION (7)
LANGUAGE IDENTIFICATION (7)
LATTICES (7)
PATTERN RECOGNITION (7)
PHONEME RECOGNITION (7)
PROSODIC FEATURES (7)
SHAPE (7)
SIGNAL TO NOISE RATIO (7)
STRESS (7)
SUPPORT VECTOR REGRESSION (7)
TIME SERIES (7)
VIDEO SIGNAL PROCESSING (7)
VOCABULARY (7)
ADAPTATION MODELS (6)
CONFERENCES (6)
CONFIDENCE MEASURE (6)
CONTEXT (6)
ESTIMATION (6)
IMAGE CLASSIFICATION (6)
INFORMATION RETRIEVAL (6)
more

INFONA - science communication portal

Search results

Determining the voiceprint recognition on the basis of emotional speech signal: Indonesia language

Voice based interface for route search system

Pashto language dialect recognition using mel frequency cepstral coefficient and support vector machines

Emotion recognition on speech signals using machine learning

Speech signals identification base on improved DBN

Multidimensional speaker information recognition based on proposed baseline system

Organic voice pathology classification

Discriminative feature domains for reverberant acoustic environments

Learning cross-lingual knowledge with multilingual BLSTM for emphasis detection with limited training data

Effective emotion recognition in movie audio tracks

Speech emotion recognition with skew-robust neural networks

Pashto spoken digits recognition using spectral and prosodic based feature extraction

Automatic speech emotion detection system using multi-domain acoustic feature selection and classification models

The Cost of Dichotomizing Continuous Labels for Binary Classification Problems: Deriving a Bayesian-Optimal Classifier

Classification of speech under stress based on cepstral features and one-class SVM

A Facial-Expression Monitoring System for Improved Healthcare in Smart Cities

BlowClick 2.0: A trigger based on non-verbal vocal input

Classifying emotional states using pitch and formants in vowel regions

Wavelet packet energy and entropy features for classification of stressed speech

Fusion Based Emotion Recognition System

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options