Search results

Items from 1 to 20 out of 129 results

chapter

A low complexity solution for epilepsy detection using an improved version of the reaction-diffusion transform

Radu Dogaru, Ioana Dogaru

2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE) > 1 - 6

2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE)

Recognition of epileptic seizures is an important issue and in certain circumstances it is desirable to have portable equipment implementing the algorithm in order to better monitor the patients. This work considers a widely used EEG database from University of Bonn as reference for comparing our recognition method with other previously reported. In order to perform epileptic seizures we combine a...

chapter

A low complexity method based on reaction-diffusion transform for ultrasound echo-based shape object classification

Mihai Bucurica, Ioana Dogaru, Radu Dogaru

2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE) > 1 - 5

2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE)

This paper presents improvements in terms of accuracy for shape object classification using a new low complexity method compared to previous implementation [1]. The method is using echoes generated by a JAVA platform capable of emulate sound propagation in a controlled 2D virtual environment [2][3]. Echoes originate from the ultrasonic waves generated inside a virtual environment which contains geometrical...

chapter

Research on voiceprint recognition based on weighted clustering recognition SVM algorithm

Yang Wu, Lihong Xu, Yandong Chen, Xueyang Zhang

2017 Chinese Automation Congress (CAC) > 1144 - 1148

2017 Chinese Automation Congress (CAC)

Support vector machine (SVM) algorithm received much attention in the research of voiceprint recognition, especially for small sample datasets. However, with the increase of recognition number and speech features number, the rate of model training and recognition is significantly reduced. In order to solve the problem, a new weighted clustering algorithm is proposed, which use “one to one” SVM model...

chapter

Voice biometrics: Deep learning-based voiceprint authentication system

Andrew Boles, Paul Rad

2017 12th System of Systems Engineering Conference (SoSE) > 1 - 6

2017 12th System of Systems Engineering Conference (SoSE)

Speaker identification systems are becoming more important in today's world. This is especially true as devices rely on the user to speak commands. In this article, an analysis of how a text-independent voice identification system can be built is presented. Extracting the Mel-Frequency Cepstral Coefficients is evaluated and a support vector machine is trained and tested on two different data sets,...

chapter

Novel Applications of Complexity Inspired RDT Transform for Low Complexity Embedded Speech Recognition in Automotive Environments

Mihai Bucurica, Ioana Dogaru, Radu Dogaru

2017 21st International Conference on Control Systems and Computer Science (CSCS) > 375 - 378

2017 21st International Conference on Control Systems and Computer Science (CSCS)

Embedded dictation, i.e. recognizing vocal commands in noisy environments, with good accuracy and using low complexity implementations is a desirable task with many applications. Such applications include automotive infotainment solutions particularly when no connectivity is available, personal assistants including embedded dictation solutions for disabled people, and so on. This paper reports our...

chapter

A study of support vector machines for emotional speech recognition

Nattapong Kurpukdee, Sawit Kasuriya, Vataya Chunwijitra, Chai Wutiwiwatchai, more

2017 8th International Conference of Information and Communication Technology for Embedded Systems (IC-ICTES) > 1 - 6

2017 8th International Conference of Information and Communication Technology for Embedded Systems (IC-ICTES)

In this paper, efficiency comparison of Support Vector Machines (SVM) and Binary Support Vector Machines (BSVM) techniques in utterance-based emotion recognition is studied. Acoustic features including energy, Mel-frequency cepstral coefficients (MFCC), Perceptual linear predictive (PLP), Filter bank (FBANK), pitch, their first and second derivatives are used as frame-based features. Four basic emotions...

chapter

Speech signals identification base on improved DBN

Cai Jun, Yao Qin, Zhang Yi

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1144 - 1148

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

For the problem low speech recognition rate, an improved method of combining Deep Belief Network (DBN) with support vector machine (SVM) for analyzing Small sample speech signals is proposed. The speech signal data collected as the training sample is used for training the DBN to get the optimal parameter values. The trained DBN is utilized for feature extraction, and these speech sample data signals...

chapter

Multidimensional speaker information recognition based on proposed baseline system

Shan Li, Longting Xu, Zhen Yang

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1776 - 1780

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

Traditional speech-related identity recognition commonly pays attention to individual aspect of speech signals but in reality, the speech signals are made up of semantics, speaker dependent features, etc. This paper therefore presents a new study that recognizes simultaneously multidimensional speaker information. In order to extract sufficient relational features, both high-level and low-level features...

chapter

Speech emotion recognition with skew-robust neural networks

Po-Yuan Shih, Chia-Ping Chen, Hsin-Min Wang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2751 - 2755

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a neural-network training algorithm that is robust to data imbalance in classification. In our proposed algorithm, weights are introduced to training examples, effectively modifying the trajectory traversed in the parameter space during the learning process. Furthermore, the proposed algorithm would reduce to the normal stochastic gradient decent learning if the data is balanced. On the...

chapter

Pashto spoken digits recognition using spectral and prosodic based feature extraction

Shibli Nisar, Ibrahim Shahzad, Muhammad Adnan Khan, Muhammad Tariq

2017 Ninth International Conference on Advanced Computational Intelligence (ICACI) > 74 - 78

2017 Ninth International Conference on Advanced Computational Intelligence (ICACI)

Automatic spoken digit recognition is one of the important areas in speech recognition. Local language spoken digits recognition is the next stage in this technological advancement. This paper presents a new approach for Pashto digits recognition using spectral and prosodic based feature extraction. Very little or almost no work has been done in Pashto spoken digit recognition. Thats why no standard...

chapter

Automatic speech emotion detection system using multi-domain acoustic feature selection and classification models

Nancy Semwal, Abhijeet Kumar, Sakthivel Narayanan

2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA) > 1 - 6

2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA)

Emotions exhibited by a speaker can be detected by analyzing his/her speech, facial expressions and gestures or by combining these properties. This paper concentrates on determining the emotional state from speech signals. Various acoustic features such as energy, zero crossing rate(ZCR), fundamental frequency, Mel Frequency Cepstral Coefficients (MFCCs), etc are extracted for short term, overlapping...

article

The Cost of Dichotomizing Continuous Labels for Binary Classification Problems: Deriving a Bayesian-Optimal Classifier

Soroosh Mariooryad, Carlos Busso

IEEE Transactions on Affective Computing > 2017 > 8 > 1 > 119 - 130

Many pattern recognition problems involve characterizing samples with continuous labels instead of discrete categories. While regression models are suitable for these learning tasks, these labels are often discretized into binary classes to formulate the problem as a conventional classification task (e.g., classes with low versus high values). This methodology brings intrinsic limitations on the classification...

chapter

Automatic opinion leader recognition in group discussions

Yu-Chang Ho, Hao-Min Liu, Hui-Hsin Hsu, Chun-Han Lin, more

2016 Conference on Technologies and Applications of Artificial Intelligence (TAAI) > 138 - 145

2016 Conference on Technologies and Applications of Artificial Intelligence (TAAI)

In this paper, we propose an efficient approach to identify the opinion leader from group discussion. This approach is able to recognize the opinion leader without analyzing semantic and syntactic features, which may cost a lot more computing effort. We firstly propose algorithms to evaluate the degree of participation and the emotion expression from the speaking of each member during group discussion...

chapter

Speech emotion recognition using kernel sparse representation based classifier

Pulkit Sharma, Vinayak Abrol, Abhijeet Sachdev, A. D. Dileep

2016 24th European Signal Processing Conference (EUSIPCO) > 374 - 377

2016 24th European Signal Processing Conference (EUSIPCO)

In this paper, we propose to use a kernel sparse representation based classifier (KSRC) for the task of speech emotion recognition. Further, the recognition performance using the KSRC is improved by imposing a group sparsity constraint. The speech utterances with same emotion may have different duration, but the frame sequence information does not play a crucial role in this task. Hence, in this work,...

chapter

Speech to text conversion for multilingual languages

Yogita H. Ghadage, Sushama D. Shelke

2016 International Conference on Communication and Signal Processing (ICCSP) > 236 - 240

2016 International Conference on Communication and Signal Processing (ICCSP)

The current work presents a multilingual speech-to-text conversion system. Conversion is based on information in speech signal. Speech is the natural and most important form of communication for human being. Speech-To-Text (STT) system takes a human speech utterance as an input and requires a string of words as output. The objective of this system is to extract, characterize and recognize the information...

chapter

Combining ensemble methods of Bagging, Subagging and Random Subspace for phoneme recognition

Abir Bousmina, Chiraz Jlassi, Najet Arous

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) > 677 - 682

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

A ‘weak classifier’ is a classifier that performed badly for many raisons. In general, bad performance can be caused by the highly dimensionality of the data and also the instability of the classifier. Ensemble methods has been developed in order to overcome this problems. The most popular are bagging and Random Subspace Methods (RSM). We propose to use a combination of concepts used in Bagging and...

chapter

Spontaneous Speech Emotion Recognition via Multiple Kernel Learning

Cheng Zha, Ping Yang, Xinran Zhang, Li Zhao

2016 Eighth International Conference on Measuring Technology and Mechatronics Automation (ICMTMA) > 621 - 623

2016 Eighth International Conference on Measuring Technology and Mechatronics Automation (ICMTMA)

Speech emotion recognition has become an active topic in pattern recognition. Specifically, support vector machine (SVM) is an effective classifier due to the application of the nonlinear mapping function, which can map the data into high or ever infinite dimensional feature space. However, a single kernel function might not sufficient to describe the different properties of spontaneous speech emotion...

chapter

Efficient speech emotion recognition using binary support vector machines & multiclass SVM

N. Ratna Kanth, S. Saraswathi

2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC) > 1 - 6

2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

This paper presents the construction of Binary Support Vector Machines and its significance for efficient Speech Emotion Recognition (SER). German Emotional Speech Corpus EmoDB has been used in this study. Seven Binary Support Vector Machines (SVMs) corresponding to each of the seven emotions in the EmoDB, namely Anger-Not Anger, Boredom-Not Boredom, Disgust-Not Disgust, Fear-Not Fear, Happy-Not Happy,...

chapter

Speech emotion recognition using DWT

S. Lalitha, Anoop Mudupu, Bala Visali Nandyala, Renuka Munagala

2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC) > 1 - 4

2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

Emotion recognition from speech helps us in improving the effectiveness of human-machine interaction. This paper presents a method to identify suitable features in DWT domain and improve good accuracy. In this work, 7 emotions (Berlin Database) are recognized using Support Vector Machine (SVM) classifier. Entropy of Teager Energy operated Discrete Wavelet Transform (DWT) coefficients, Linear Predictive...

chapter

Speaker Authentication for an Assistive Domotic System

Virginia Sandulescu, Ionela Halcu

2015 20th International Conference on Control Systems and Computer Science > 337 - 340

2015 20th International Conference on Control Systems and Computer Science (CSCS)

This paper presents different approaches for developing a speaker recognition system to be used in a voice control interface for an assistive domotic system. Experimental research has been carried out in Matlab, using the Voicebox toolbox for data preparation and feature extraction and using lib SVM for speaker recognition.

Content availability:
Available
Data set:
ieee
Keywords:
TRAINING
SUPPORT VECTOR MACHINES
SPEECH RECOGNITION

Publication date

Set your own date range

Publication type

book (117)
article (12)

Keywords

SPEECH (89)
FEATURE EXTRACTION (47)
HIDDEN MARKOV MODELS (40)
SUPPORT VECTOR MACHINE (34)
KERNEL (25)
ACCURACY (23)
ACOUSTICS (22)
EMOTION RECOGNITION (21)
PATTERN CLASSIFICATION (19)
NATURAL LANGUAGE PROCESSING (17)
SPEECH PROCESSING (16)
SVM (16)
DATABASES (15)
MEL FREQUENCY CEPSTRAL COEFFICIENT (15)
SPEAKER RECOGNITION (15)
LEARNING (ARTIFICIAL INTELLIGENCE) (14)
CLASSIFICATION ALGORITHMS (11)
ARTIFICIAL NEURAL NETWORKS (10)
SUPPORT VECTOR MACHINE CLASSIFICATION (10)
GAUSSIAN PROCESSES (9)
MACHINE LEARNING (9)
NIST (9)
SIGNAL CLASSIFICATION (8)
SPEECH EMOTION RECOGNITION (8)
TESTING (8)
ROBUSTNESS (7)
TRAINING DATA (7)
ACOUSTIC SIGNAL PROCESSING (6)
ADAPTATION MODEL (6)
DATA MINING (6)
GAUSSIAN MIXTURE MODEL (6)
NEURAL NETWORKS (6)
OPTIMIZATION (6)
NOISE (5)
AUTOMATIC SPEECH RECOGNITION (4)
COMPLEXITY THEORY (4)
CORRELATION (4)
DICTIONARIES (4)
ENCODING (4)
ENTROPY (4)
ERROR ANALYSIS (4)
ESTIMATION (4)
GAUSSIAN MIXTURE MODELS (4)
INFORMATION RETRIEVAL (4)
LANGUAGE IDENTIFICATION (4)
LANGUAGE RECOGNITION (4)
LATTICES (4)
MATHEMATICAL MODEL (4)
NATURAL LANGUAGES (4)
PROSODY (4)
TRANSFORMS (4)
ADAPTATION MODELS (3)
ALGORITHM DESIGN AND ANALYSIS (3)
CEPSTRAL ANALYSIS (3)
COMPLEX NONLINEAR NETWORKS (3)
COMPUTATIONAL MODELING (3)
COVARIANCE MATRIX (3)
DATA MODELS (3)
DECODING (3)
DEEP LEARNING (3)
DETECTORS (3)
DISCRIMINATIVE TRAINING (3)
EQUAL ERROR RATE (3)
EQUATIONS (3)
FEATURE SELECTION (3)
HMM (3)
HUMANS (3)
MAXIMUM LIKELIHOOD ESTIMATION (3)
PARAMETER ESTIMATION (3)
PRINCIPAL COMPONENT ANALYSIS (3)
PROTOTYPES (3)
QUERY PROCESSING (3)
RADIAL BASIS NEURAL NETWORKS (3)
SPEAKER SEGMENTATION (3)
SPOKEN LANGUAGE RECOGNITION (3)
SUPPORT VECTOR CLASSIFIERS (3)
SVM CLASSIFIER (3)
TEXT ANALYSIS (3)
TOPIC IDENTIFICATION (3)
UNSUPERVISED LEARNING (3)
VECTORS (3)
ACOUSTIC MODELING (2)
ACOUSTIC SYSTEMS (2)
ACOUSTIC WAVEFORMS (2)
ACTIVE LEARNING (2)
AURORA 2.0 (2)
AUTOMATIC LANGUAGE IDENTIFICATION (2)
BAGGING (2)
BAYES METHODS (2)
BAYESIAN METHODS (2)
BERLIN DATABASE (2)
BINARY CLASSIFIERS (2)
BIOMETRICS (2)
CALIBRATION (2)
CHINESE ISOLATED WORDS (2)
CLASSIFICATION (2)
CLUSTERING ALGORITHMS (2)
more

INFONA - science communication portal

Search results

A low complexity solution for epilepsy detection using an improved version of the reaction-diffusion transform

A low complexity method based on reaction-diffusion transform for ultrasound echo-based shape object classification

Research on voiceprint recognition based on weighted clustering recognition SVM algorithm

Voice biometrics: Deep learning-based voiceprint authentication system

Novel Applications of Complexity Inspired RDT Transform for Low Complexity Embedded Speech Recognition in Automotive Environments

A study of support vector machines for emotional speech recognition

Speech signals identification base on improved DBN

Multidimensional speaker information recognition based on proposed baseline system

Speech emotion recognition with skew-robust neural networks

Pashto spoken digits recognition using spectral and prosodic based feature extraction

Automatic speech emotion detection system using multi-domain acoustic feature selection and classification models

The Cost of Dichotomizing Continuous Labels for Binary Classification Problems: Deriving a Bayesian-Optimal Classifier

Automatic opinion leader recognition in group discussions

Speech emotion recognition using kernel sparse representation based classifier

Speech to text conversion for multilingual languages

Combining ensemble methods of Bagging, Subagging and Random Subspace for phoneme recognition

Spontaneous Speech Emotion Recognition via Multiple Kernel Learning

Efficient speech emotion recognition using binary support vector machines & multiclass SVM

Speech emotion recognition using DWT

Speaker Authentication for an Assistive Domotic System

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options