Search results

Items from 1 to 20 out of 1,621 results

chapter

A Deep Transfer Learning Approach for Improved Post-Traumatic Stress Disorder Diagnosis

Debrup Banerjee, Kazi Islam, Gang Mei, Lemin Xiao, more

2017 IEEE International Conference on Data Mining (ICDM) > 11 - 20

2017 IEEE International Conference on Data Mining (ICDM)

Post-traumatic stress disorder (PTSD) is a traumatic-stressor related disorder developed by exposure to a traumatic or adverse environmental event that caused serious harm or injury. Structured interview is the only widely accepted clinical practice for PTSD diagnosis but suffers from several limitations including the stigma associated with the disease. Diagnosis of PTSD patients by analyzing speech...

chapter

Marathi digit recognition using lip geometric shape features and dynamic time warping

Aparna Brahme, Umesh Bhadade

TENCON 2017 - 2017 IEEE Region 10 Conference > 974 - 979

TENCON 2017 - 2017 IEEE Region 10 Conference

The aim of our proposed research work is to identify language of spoken utterance using visual speech recognition and include Marathi language in language identification (LID) system. In this paper we have focused on the task of identifying first three digits in Marathi language. For this first Lips are extracted from video frames of face images and then landmark points on the lips are detected. Then...

chapter

Detecting depression in speech: Comparison and combination between different speech types

Hailiang Long, Zhenghao Guo, Xia Wu, Bin Hu, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 1052 - 1058

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Depression is a mental disorder of high prevalence, leading to a negative effect on individuals, their families, society and the economy. In recent years, the problem of automatic detection of depression from the speech signal has gained more interest. In this paper, a new multiple classifier system for depression recognition was developed and tested. The novel aspect of this methodology is the combination...

chapter

A novel speech endpoint detection based on multiple complexities and fuzzy C means

Chuanyan Wu, Rui Gao, Bentao Lin

2017 Chinese Automation Congress (CAC) > 6369 - 6372

2017 Chinese Automation Congress (CAC)

Accurate Speech endpoint detection is important for speaker recognition, speech recognition, coding, and transmission and so on. In this paper, a fusion feature is proposed for speech endpoint detection, which utilized zero-crossing rate, Lempel and Ziv complexity (LZC), C₀ complexity and fluctuation complexity to represent the speech signal. In order to classify speech signal and background signal,...

chapter

An Improved Tibetan Lhasa Speech Recognition Method Based on Deep Neural Network

Wenbin Ruan, Zhenye Gan, Bin Liu, Yin Guo

2017 10th International Conference on Intelligent Computation Technology and Automation (ICICTA) > 303 - 306

2017 10th International Conference on Intelligent Computation Technology and Automation (ICICTA)

Deep Neural Networks (DNN) are the dominant technique widely used in English and Chinese speech recognition currently. However, Tibetan speech recognition research starts late and mainly uses Hidden Markov Model (HMM). In this paper, We show a better method of replacing Gaussian Mixture Models (GMM) by DNN to Tibetan Lhasa dialect speech recognition system. The system contains seven layers of features...

chapter

Speech emotion recognition based on Gaussian kernel nonlinear proximal support vector machine

Zhiyan Han, Jian Wang

2017 Chinese Automation Congress (CAC) > 2513 - 2516

2017 Chinese Automation Congress (CAC)

For the sake of improving the precision of speech emotion recognition, this paper proposed a novel speech emotion recognition approach based on Gaussian Kernel Nonlinear Proximal Support Vector Machine (PSVM) to recognize four basic human emotions (angry, joy, sadness, surprise). Firstly, preprocess speech signal containing sampling, quantification, pre-emphasizing, framing, adding window and endpoint...

chapter

Comparing statistical classifiers for emotion classification

Raseeda Hamzah, Nursuriati Jamil, Khyrina Airin Fariza Abu Samah, Nur Nabilah Abu Mangshor, more

2017 7th IEEE International Conference on System Engineering and Technology (ICSET) > 183 - 188

2017 7th IEEE International Conference on System Engineering and Technology (ICSET)

Speech emotion recognition has been widely used in human computer interaction and applications. This paper has classified emotion into two classes: happy and angry. All the speech signal is preprocessed from Malay spoken speech database. Emotional information is obtained by applying two well-established acoustical features that are Mel Frequency Cepstral Coefficients (MFCC) and Short Time Energy (STE)...

chapter

An incremental intelligent object recognition system based on deep learning

Long Yan, Yongxiong Wang, Tianzhong Song, Zhong Yin

2017 Chinese Automation Congress (CAC) > 7135 - 7138

2017 Chinese Automation Congress (CAC)

The accuracy of object recognition has been greatly improved due to the rapid development of deep learning, but the deep learning generally requires a lot of training data and the training process is very slow and complex. We propose an incremental object recognition system based on deep learning techniques and speech recognition technology with high learning speed and wide applicability. The system...

chapter

Application of convolution neural network to flow pattern identification of gas-liquid two-phase flow in small-size pipe

Zhiyong Yang, Haifeng Ji, Zhiyao Huang, Baoliang Wang, more

2017 Chinese Automation Congress (CAC) > 1389 - 1393

2017 Chinese Automation Congress (CAC)

Flow pattern is one of the most important parameters for gas-liquid two-phase flow. In this work, a new flow pattern identification method based on Convolution Neural Network (CNN) is presented. A 7-layer CNN structure is chosen, and the parameters of this network are determined by a training set. In order to verify the feasibility, experiments were carried out in horizontal pipe with the inner diameter...

chapter

PitchKeywordExtractor: Prosody-based automatic keyword extraction for speech content

Iurii Lezhenin, Artyom Zhuikov, Natalia Bogach, Elena Boitsova, more

2017 Federated Conference on Computer Science and Information Systems (FedCSIS) > 265 - 269

2017 Federated Conference on Computer Science and Information Systems (FedCSIS)

Keyword extraction is widely used for information indexing, compressing, summarizing, etc. Existing keyword extraction techniques apply various text-based algorithms and metrics to locate the keywords. At the same time, some types of audio and audiovisual content, e. g. lectures, talks, interviews and other speech-oriented information, allow to perform keyword search by prosodic accents made by a...

chapter

Implementation of accent recognition methods subsystem for eLearning systems

Eugen Tverdokhleb, Hennadii Dobrovolskyi, Nataliya Keberle, Natalia Myronova

2017 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS) > 2 > 1037 - 1041

2017 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS)

The results of the implementation of an external accent recognition system and its integration into massive open online courses platform Moodle are reported. Accent recognition becomes important in foreign languages learning to provide a feedback to a student on a presence of a certain unwanted accent in a foreign language pronunciation. Implementation of several accent recognition methods and their...

chapter

Improvement of speech recognition results by a combination of systems

Rama Hasan, Hussein Hussein, Pavlos Lazaridis, Sinan Khwandah, more

2017 23rd International Conference on Automation and Computing (ICAC) > 1 - 4

2017 23rd International Conference on Automation and Computing (ICAC)

The aim of this study is to suggest an algorithm that combines two speech recognition systems. These systems differ in the methods used in the feature extraction stage, but they have the same classifier Hidden Markov Model (HMM). The first system uses Mel-Frequency Cepstrum Coefficients (MFCC), the second one uses Linear Prediction Cepstrum Coefficients (LPCC), and the third system uses Perceptual...

chapter

Speaker-Dependent Isolated-Word Speech Recognition System Based on Vector Quantization

Yinyin Zhao, Lei Zhu

2017 International Conference on Computer Network, Electronic and Automation (ICCNEA) > 133 - 137

2017 International Conference on Computer Network, Electronic and Automation (ICCNEA)

Speaker-dependent speech recognition system requires the system should not only recognize speech, but also recognize the speaker of the segment. In this paper, two indicators are selected—short-time average zero-crossing rate and dual-threshold endpoint to test the signal endpoint through the study of speaker-dependent isolated-word speech characteristics, and MFCC parameters are taken...

chapter

Does speech enhancement work with end-to-end ASR objectives?: Experimental analysis of multichannel end-to-end ASR

Tsubasa Ochiai, Shinji Watanabe, Shigeru Katagiri

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

Recently we proposed a novel multichannel end-to-end speech recognition architecture that integrates the components of multichannel speech enhancement and speech recognition into a single neural-network-based architecture and demonstrated its fundamental utility for automatic speech recognition (ASR). However, the behavior of the proposed integrated system remains insufficiently clarified. An open...

chapter

Automated rating of recorded classroom presentations using speech analysis in kazakh

Akzharkyn Izbassarova, Aidana Irmanova, Alex Pappachen James

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 393 - 397

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Effective presentation skills can help to succeed in business, career and academy. This paper presents the design of speech assessment during the oral presentation and the algorithm for speech evaluation based on criteria of optimal intonation. As the pace of the speech and its optimal intonation varies from language to language, developing an automatic identification of language during the presentation...

chapter

Relative spectral-perceptual linear prediction (RASTA-PLP) speech signals analysis using singular value decomposition (SVD)

Muhammad Amirul Azzim Zulkifly, Norashikin Yahya

2017 IEEE 3rd International Symposium in Robotics and Manufacturing Automation (ROMA) > 1 - 5

2017 IEEE 3rd International Symposium in Robotics and Manufacturing Automation (ROMA)

Speech recognition system has application in many areas such as customer call centers and as a medium in helping those with learning disabilities. There are three main stages in speech recognition which are signal analysis, feature extraction and modeling. Feature extraction plays an important role in speech recognition system and good speech feature extraction technique will allow the systems to...

chapter

Speech classification based on cuckoo algorithm and support vector machines

Wenlei Shi, Xinhai Fan

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA) > 98 - 102

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA)

Speech classification is an important part of speech signal processing. It is significant to classify speech accurately and quickly in speech coding and speech synthesis. Because of the diversity and uncertainty of the speech signals, the traditional classification method is slow and not so accurate in the large-scale application of real speech classification. In order to improve the accuracy and...

chapter

Development of speech emotion recognition system using deep belief networks in malayalam language

Athira Chandran, D. Pravena, D. Govind

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 676 - 680

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

The goal of this work is to validate the impact of natural elicitation of emotions by the speakers during the development of speech emotion databases for Malayalam language. The work also proposes a Gaussian Mixture Model-Deep Belief Networks (GMM-DBN) based speech emotion recognition system. To test the effect of emotion elicitation by the speakers, two independent datasets with emotionally biased...

chapter

Speech recognition using facial sEMG

Mok Win Soon, Muhammad Ikmal Hanafi Anuar, Mohamad Hafizat Zainal Abidin, Ahmad Syukri Azaman, more

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA) > 1 - 5

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)

This paper presents a study of speech recognition based on electromyographic biosignals captured from the articulatory muscles in the face using surface electrodes. This paper compares the speech recognition system for spoken English and Malay words by a group of Malay native speakers. Feature extraction was done in both temporal and time-frequency domains. Temporal features used are integrated EMG...

chapter

Affective computing using speech processing for call centre applications

Rakshith K. Gowda, Vandana Nimbalker, R. Lavanya, S. Lalitha, more

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 766 - 771

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

The paper deals with affective computing to improve the performance of Human-Machine interaction. The focus of this work is to detect affective state of a human using speech processing techniques primarily intended for call centre applications. Limited work is reported till date on affect detection using phase derived features. A unique combination of Group delay (GD), Phase delay (PD), One Sided...

Keywords:
FEATURE EXTRACTION
SPEECH RECOGNITION

Publication date

Set your own date range

Content availability

Available (1,584)
None (37)

Keywords

SPEECH (1,164)
HIDDEN MARKOV MODELS (484)
MEL FREQUENCY CEPSTRAL COEFFICIENT (409)
TRAINING (377)
SPEECH PROCESSING (289)
EMOTION RECOGNITION (284)
DATABASES (212)
ACCURACY (187)
ACOUSTICS (187)
SUPPORT VECTOR MACHINES (186)
SPEAKER RECOGNITION (157)
MFCC (135)
NOISE (121)
DATA MINING (113)
CEPSTRAL ANALYSIS (109)
ARTIFICIAL NEURAL NETWORKS (104)
NATURAL LANGUAGE PROCESSING (96)
AUTOMATIC SPEECH RECOGNITION (92)
VISUALIZATION (90)
HIDDEN MARKOV MODEL (80)
CLASSIFICATION ALGORITHMS (76)
ROBUSTNESS (76)
NEURAL NETS (73)
NEURAL NETWORKS (72)
NOISE MEASUREMENT (69)
HMM (63)
PRINCIPAL COMPONENT ANALYSIS (62)
SPEECH EMOTION RECOGNITION (57)
TESTING (56)
PATTERN CLASSIFICATION (55)
FACE RECOGNITION (53)
GAUSSIAN PROCESSES (53)
VECTORS (50)
SIGNAL CLASSIFICATION (49)
LEARNING (ARTIFICIAL INTELLIGENCE) (48)
CORRELATION (45)
SIGNAL TO NOISE RATIO (45)
SUPPORT VECTOR MACHINE (44)
WAVELET TRANSFORMS (44)
DISCRETE COSINE TRANSFORMS (43)
FEATURE SELECTION (43)
COMPUTATIONAL MODELING (42)
KERNEL (42)
FACE (41)
MACHINE LEARNING (41)
MATHEMATICAL MODEL (41)
SIGNAL PROCESSING (39)
SPEECH ENHANCEMENT (39)
ACOUSTIC SIGNAL PROCESSING (38)
GAUSSIAN MIXTURE MODEL (38)
STATISTICAL ANALYSIS (38)
SVM (38)
PATTERN RECOGNITION (37)
SPEECH CODING (36)
SPEECH SIGNAL (36)
VOCABULARY (36)
COMPUTERS (35)
HUMANS (35)
ROBUST SPEECH RECOGNITION (35)
SUPPORT VECTOR MACHINE CLASSIFICATION (35)
NEURAL NETWORK (34)
CONTEXT (33)
AUDIO SIGNAL PROCESSING (32)
FILTERING THEORY (32)
GMM (32)
ALGORITHM DESIGN AND ANALYSIS (31)
ENTROPY (31)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (31)
MULTILAYER PERCEPTRONS (31)
SPEAKER IDENTIFICATION (30)
SPECTROGRAM (30)
DATA MODELS (29)
HUMAN COMPUTER INTERACTION (29)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (29)
ROBOTS (29)
VECTOR QUANTIZATION (29)
FILTER BANK (28)
PROBABILITY (28)
DISCRETE WAVELET TRANSFORMS (27)
ERROR ANALYSIS (27)
NEURONS (27)
TRAINING DATA (27)
TRANSFORMS (27)
CEPSTRUM (26)
DETECTORS (26)
DICTIONARIES (26)
LINEAR PREDICTIVE CODING (26)
MOUTH (26)
LPC (25)
MICROPHONES (25)
DECODING (24)
FILTER BANKS (24)
IMAGE SEGMENTATION (24)
SPECTRAL ANALYSIS (24)
SPEECH ANALYSIS (24)
ARTIFICIAL NEURAL NETWORK (23)
NATURAL LANGUAGES (23)
SPEECH SYNTHESIS (23)
more

Data set

ieee (1,618)
Springer (3)

INFONA - science communication portal

Search results

A Deep Transfer Learning Approach for Improved Post-Traumatic Stress Disorder Diagnosis

Marathi digit recognition using lip geometric shape features and dynamic time warping

Detecting depression in speech: Comparison and combination between different speech types

A novel speech endpoint detection based on multiple complexities and fuzzy C means

An Improved Tibetan Lhasa Speech Recognition Method Based on Deep Neural Network

Speech emotion recognition based on Gaussian kernel nonlinear proximal support vector machine

Comparing statistical classifiers for emotion classification

An incremental intelligent object recognition system based on deep learning

Application of convolution neural network to flow pattern identification of gas-liquid two-phase flow in small-size pipe

PitchKeywordExtractor: Prosody-based automatic keyword extraction for speech content

Implementation of accent recognition methods subsystem for eLearning systems

Improvement of speech recognition results by a combination of systems

Speaker-Dependent Isolated-Word Speech Recognition System Based on Vector Quantization

Does speech enhancement work with end-to-end ASR objectives?: Experimental analysis of multichannel end-to-end ASR

Automated rating of recorded classroom presentations using speech analysis in kazakh

Relative spectral-perceptual linear prediction (RASTA-PLP) speech signals analysis using singular value decomposition (SVD)

Speech classification based on cuckoo algorithm and support vector machines

Development of speech emotion recognition system using deep belief networks in malayalam language

Speech recognition using facial sEMG

Affective computing using speech processing for call centre applications

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options