Search results

Items from 101 to 120 out of 443 results

1 ...
3
4
5
6
7
8
9

chapter

Enhancement of Speaker Identification using SID-usable speech

Saurabh S. Khanwalkar, Brett Y. Smolenski, Robert E. Yantorno, S. J. Wenndt

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

Most present day Speaker Identification (SID) systems focus on the speech features used for modeling the speakers without any concern for the speech being input to the system. Knowing how reliable the input speech information is can be very important and useful. The idea of SID-usable speech is to identify and extract those portions of corrupted input speech, which are more reliable for SID systems,...

chapter

Audio-visual speech recognition with a hybrid SVM-HMM system

Mihai Gurban, Jean-Philippe Thiran

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

Traditional speech recognition systems use Gaussian mixture models to obtain the likelihoods of individual phonemes, which are then used as state emission probabilities in hidden Markov models representing the words. In hybrid systems, the Gaussian mixtures are replaced by more discriminant classifiers, leading to an improved performance. Most of the time the classifiers used in such systems are neural...

chapter

Voice pathology detection with MDVP parameters using Arabic voice pathology database

Ahmed Al-nasheri, Zulfiqar Ali, Ghulam Muhammad, Mansour Alsulaiman, more

2015 5th National Symposium on Information Technology: Towards New Smart World (NSITNSW) > 1 - 5

2015 5th National Symposium on Information Technology: Towards New Smart World (NSITNSW)

This paper investigates the use of MultiDimensional Voice Program (MDVP) parameters to automatically detect voice pathology in Arabic voice pathology database (AVPD). MDVP parameters are very popular among the physician / clinician to detect voice pathology; however, MDVP is a commercial software. AVPD is a newly developed speech database designed to suit a wide range of experiments in the field of...

chapter

Speech emotion recognition using RBF kernel of LIBSVM

Y. D. Chavhan, B. S. Yelure, K. N. Tayade

2015 2nd International Conference on Electronics and Communication Systems (ICECS) > 1132 - 1135

2015 2nd International Conference on Electronics and Communication Systems (ICECS)

Automatic Speech Emotion Recognition (SER) is a current research topic in the field of Human Computer Interaction (HCI) with wide range of applications. The speech features such as, Mel Frequency cepstrum coefficients (MFCC) and Mel Energy Spectrum Dynamic Coefficients (MEDC) are extracted from speech utterance. The LIBSVM is used as classifier to identify different emotional states such as anger,...

chapter

Low power SVM module using spurious power suppression technique

R. Ravanya, S. Ramya

2015 2nd International Conference on Electronics and Communication Systems (ICECS) > 429 - 433

2015 2nd International Conference on Electronics and Communication Systems (ICECS)

A biometric system makes a pattern recognition decision in accordance with the biometric features extracted from a human being. This paper presents a text-independent speaker Verification system using support vector machines (SVMs) is to identify the speaker by listening to the voice of the speaker. Thus speaker verification is to determine whether a test utterance is spoken by a target speaker and...

chapter

Classification of emotions from speech using implicit features

Mohit Srivastava, Anupam Agarwal

2014 9th International Conference on Industrial and Information Systems (ICIIS) > 1 - 6

2014 9th International Conference on Industrial and Information Systems (ICIIS)

Human computer interaction with the time has extended its branches to many different other fields like engineering, cognition, medical etc. Speech analysis has also become an important area of concern. People involved are using this mode for the interaction with the machines to bridge the gap between physical and digital world. Speech emotion recognition has become an integral subfield in the domain...

chapter

Speech/Music Classification of Short Audio Segments

Toni Hirvonen

2014 IEEE International Symposium on Multimedia > 135 - 138

2014 IEEE International Symposium on Multimedia (ISM)

Research on speech/music classification of digital audio has been both popular in academia, and increasingly utilized in industry. Most of the usual methods use carefully hand-crafted features with Gaussian Mixture Models. To get best performance, some of the features necessitate a long latency due to look ahead, or/and a long onset error. This paper aims to have a different approach to the problem...

chapter

Elimination of person names in spoken documents for privacy protection

Ryo Kawaguchi, Masatoshi Tsuchiya, Seiichi Nakagawa

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific > 1 - 4

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

There is an increasing use of sensor networks capable of sensing multimedia data including audio data. Unfortunately, public use of these is not allowed because they contain crucial privacy information such as person and location names. Person name extraction (PNE), which is a widely investigated research topic, is an effective technique to resolve this problem. However, there is an important difference...

chapter

Developing Tamil emotional speech corpus and evaluating using SVM

C. Vijesh Joe

2014 International Conference on Science Engineering and Management Research (ICSEMR) > 1 - 6

2014 International Conference on Science Engineering and Management Research (ICSEMR)

Finding the user's emotion it can be used for business development and psychological analysis. The motivation of this paper is to build the Tamil emotional corpus for being base for the emotional analysis based on the acoustic variations present and to make Tamil emotional corpus available in public domain. Tamil Play will be used as main resource for building the emotional corpus. Basically, emotions...

chapter

Speaker age recognition based on isolated words by using SVM

Mengdi Yue, Ling Chen, Jie Zhang, Hong Liu

2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems > 282 - 286

2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems (CCIS)

Speaker age recognition is an essential technique in automation speech recognition based on the speech wavform parameters in speaker's voice. However, there are several challenges in speaker age recognition, such as innate differences in speaker's voice, subjective classification fuzzy, etc. The issue of speaker age based on isolated words is proposed in this paper, including support vector machine...

chapter

A Case Study on Back-End Voice Activity Detection for Distributed Specch Recognition System Using Support Vector Machines

Azzedine Touazi, Mohamed Debyeche

2014 Tenth International Conference on Signal-Image Technology and Internet-Based Systems > 21 - 26

2014 Tenth International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)

Recently, the Voice Activity Detection (VAD) algorithms based on machine learning techniques have shown impressive results in the area of speech recognition. In this paper, we present a case study and we discuss the performance of VAD based on Support Vector Machines (SVM) for Distributed Speech Recognition (DSR) system. In this case study, the speech and the non-speech frames are detected from the...

chapter

A Study of Deep Belief Network Based Chinese Speech Emotion Recognition

Bu Chen, Qian Yin, Ping Guo

2014 Tenth International Conference on Computational Intelligence and Security > 180 - 184

2014 Tenth International Conference on Computational Intelligence and Security (CIS)

This paper presents a deep learning method application to the extraction of emotions included in Chinese speech with a deep belief network (DBN) structure. Eight proper features such as pitch, mel frequency cepstrum coefficient (MFCC) are chosen from mandarin speech used as network inputs, and a DBN classifier is used instead of traditional shallow learning methods to recognition of emotions. Experiment...

chapter

Using the Lyapunov exponent from cepstral coefficients for automatic emotion recognition

Marius Dan Zbancioc, Monica Feraru

2014 International Conference and Exposition on Electrical and Power Engineering (EPE) > 110 - 113

2014 International Conference and Exposition on Electrical and Power Engineering (EPE)

The main goal of this paper is to establish the relevance of nonlinear parameters (Lyapunov exponents) in the automatic classification of emotions, for the Romanian language. The Largest Lyapunov Exponent - LLE was computed for the MFCC mel frequency cepstral coefficients and the LPCC linear prediction cepstral coefficients. The Support Vector Machine - SVM classifier provides better results than...

chapter

Sentence segmentation for speech processing

J.P. Anu, Veena Karjigi

2014 IEEE National Conference on Communication, Signal Processing and Networking (NCCSN) > 1 - 4

2014 National Conference on Communication, Signal Processing and Networking (NCCSN)

Automatic sentence segmentation of speech is a process of identifying the end of a sentence. It is used for improving the output after speech recognition and helps in making the recognition output more readable. It is generally a two-class problem which involves the identification of a boundary characterizing the sentence part and the non-sentence part. An Automatic Speech Recognition (ASR) system...

chapter

Speech emotion recognition

S. Lalitha, Abhishek Madhavan, Bharath Bhushan, Srinivas Saketh

2014 International Conference on Advances in Electronics Computers and Communications > 1 - 4

2014 International Conference on Advances in Electronics, Computers and Communications (ICAECC)

In the past decade a lot of research has gone into Automatic Speech Emotion Recognition(SER). The primary objective of SER is to improve man-machine interface. It can also be used to monitor the psycho physiological state of a person in lie detectors. In recent time, speech emotion recognition also find its applications in medicine and forensics. In this paper 7 emotions are recognized using pitch...

chapter

An automatic speaker-speech recognition system for friendly HMI based on binary halved clustering

Chih-Hsiang Peng, Chih-Hung Chou, Ta-Wen Kuan, Po-Chuan Lin, more

2014 International Conference on Orange Technologies > 161 - 164

2014 IEEE International Conference on Orange Technologies (ICOT)

This work presents a low-cost and fast-trainable automatic speaker-speech recognition (ASSR) system, by proposed binary halved clustering (BHC) method for human-machine interface (HMI) on an embedded platform, owing to the trait of low cost in ASSR system is essential and affordable for real-world application. In addition, fast-trainable ability can provide fast responding time. The reduction of waiting...

chapter

Improving speech emotion recognition system for a social robot with speaker recognition

Lukasz Juszkiewicz

2014 19th International Conference on Methods and Models in Automation and Robotics (MMAR) > 921 - 925

2014 19th International Conference on Methods & Models in Automation & Robotics (MMAR)

This paper presents modification of a speech emotion recognition system for a social robot. Using speaker dependent classifiers with prior speaker identification step was proposed. Emotion recognition is done using global acoustic features of the speech. Six speech signal parameters are computed with the specialised software. The feature extraction is based on calculation of global statistics of those...

chapter

Automatic emotion variation detection using multi-scaled sliding window

Yuchao Fan, Mingxing Xu, Zhiyong Wu, Lianhong Cai

2014 International Conference on Orange Technologies > 232 - 236

2014 IEEE International Conference on Orange Technologies (ICOT)

Emotion recognition from speech plays an important role in developing affective and intelligent Human Computer Interaction. The goal of this work is to build an Automatic Emotion Variation Detection (AEVD) system to determine each emotional salient segment in continuous speech. We focus on emotion detection in angry-neutral speech, which is common in recent studies of AEVD. This study proposes a novel...

chapter

Recognition of emotion using non-linear dynamics of speech

Ali Harimi, Ali Shahzadi, Alireza Ahmadyfard

7'th International Symposium on Telecommunications (IST'2014) > 446 - 451

2014 7th International Symposium on Telecommunications (IST)

Recognition of human's emotion from speech has become one of the most challenging and attractive fields of research in speech processing area. The present study aimed to detect valence of emotions, using Non-Linear Dynamic features (NLDs). NLDs are extracted from the Discrete Cosine Transform (DCT) of descriptor contours computed from Phase Space Reconstruction (PSR) of speech. These features are...

chapter

Adaptive Hierarchical Emotion Recognition from Speech Signal for Human-Robot Communication

Ba Vui Le, Sungyoung Lee

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 807 - 810

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

Emotional speech recognition is an interesting application that is able to recognize different emotional states from speech signal. In Human-Robot Interaction (HRI), emotion recognition is being applied on intelligent robots so that they can understand emotional states of user and interact in a more human-like manner. However, it is not easy to apply emotion recognition algorithms in real applications...

1 ...
3
4
5
6
7
8
9

Data set:
ieee
Keywords:
SUPPORT VECTOR MACHINES
SPEECH RECOGNITION

Publication date

Set your own date range

Content availability

Available (431)
None (12)

Publication type

book (399)
article (44)

Keywords

SPEECH (289)
FEATURE EXTRACTION (196)
TRAINING (129)
HIDDEN MARKOV MODELS (119)
SUPPORT VECTOR MACHINE (116)
EMOTION RECOGNITION (109)
SVM (84)
MEL FREQUENCY CEPSTRAL COEFFICIENT (77)
KERNEL (76)
DATABASES (61)
ACCURACY (60)
SPEECH PROCESSING (58)
ACOUSTICS (57)
SPEAKER RECOGNITION (43)
PATTERN CLASSIFICATION (40)
NATURAL LANGUAGE PROCESSING (36)
LEARNING (ARTIFICIAL INTELLIGENCE) (32)
SUPPORT VECTOR MACHINE CLASSIFICATION (30)
CLASSIFICATION ALGORITHMS (28)
GAUSSIAN PROCESSES (28)
DATA MINING (27)
SIGNAL CLASSIFICATION (25)
ARTIFICIAL NEURAL NETWORKS (24)
MACHINE LEARNING (24)
SPEECH EMOTION RECOGNITION (24)
ACOUSTIC SIGNAL PROCESSING (20)
MFCC (20)
AUTOMATIC SPEECH RECOGNITION (17)
GAUSSIAN MIXTURE MODEL (17)
NATURAL LANGUAGES (17)
FEATURE SELECTION (16)
NIST (16)
CEPSTRAL ANALYSIS (15)
ROBUSTNESS (15)
HIDDEN MARKOV MODEL (14)
NEURAL NETWORKS (14)
NOISE (14)
PRINCIPAL COMPONENT ANALYSIS (14)
VISUALIZATION (14)
FACE RECOGNITION (13)
SPEECH SIGNAL (13)
SUPPORT VECTOR MACHINE (SVM) (13)
ADAPTATION MODEL (12)
COMPUTATIONAL MODELING (12)
DATA MODELS (12)
REGRESSION ANALYSIS (12)
NEURAL NETS (11)
TRAINING DATA (11)
CORRELATION (10)
OPTIMIZATION (10)
SVM CLASSIFIER (10)
BAYES METHODS (9)
ERROR ANALYSIS (9)
FACE (9)
HMM (9)
LANGUAGE RECOGNITION (9)
SPEAKER VERIFICATION (9)
SPEECH CODING (9)
TESTING (9)
TRANSFORMS (9)
VECTORS (9)
WAVELET TRANSFORMS (9)
CLASSIFICATION (8)
COMPUTERS (8)
COVARIANCE MATRIX (8)
ELECTROENCEPHALOGRAPHY (8)
ENTROPY (8)
GAUSSIAN MIXTURE MODELS (8)
HUMAN COMPUTER INTERACTION (8)
MATHEMATICAL MODEL (8)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (8)
PROBABILITY (8)
STATISTICAL ANALYSIS (8)
ALGORITHM DESIGN AND ANALYSIS (7)
AUDIO SIGNAL PROCESSING (7)
DECISION TREES (7)
DETECTORS (7)
HUMANS (7)
IMAGE SEGMENTATION (7)
LANGUAGE IDENTIFICATION (7)
LATTICES (7)
PATTERN RECOGNITION (7)
PHONEME RECOGNITION (7)
PROSODIC FEATURES (7)
SHAPE (7)
SIGNAL TO NOISE RATIO (7)
STRESS (7)
SUPPORT VECTOR REGRESSION (7)
TIME SERIES (7)
VIDEO SIGNAL PROCESSING (7)
VOCABULARY (7)
ADAPTATION MODELS (6)
CONFERENCES (6)
CONFIDENCE MEASURE (6)
CONTEXT (6)
ESTIMATION (6)
IMAGE CLASSIFICATION (6)
INFORMATION RETRIEVAL (6)
more

INFONA - science communication portal

Search results

Enhancement of Speaker Identification using SID-usable speech

Audio-visual speech recognition with a hybrid SVM-HMM system

Voice pathology detection with MDVP parameters using Arabic voice pathology database

Speech emotion recognition using RBF kernel of LIBSVM

Low power SVM module using spurious power suppression technique

Classification of emotions from speech using implicit features

Speech/Music Classification of Short Audio Segments

Elimination of person names in spoken documents for privacy protection

Developing Tamil emotional speech corpus and evaluating using SVM

Speaker age recognition based on isolated words by using SVM

A Case Study on Back-End Voice Activity Detection for Distributed Specch Recognition System Using Support Vector Machines

A Study of Deep Belief Network Based Chinese Speech Emotion Recognition

Using the Lyapunov exponent from cepstral coefficients for automatic emotion recognition

Sentence segmentation for speech processing

Speech emotion recognition

An automatic speaker-speech recognition system for friendly HMI based on binary halved clustering

Improving speech emotion recognition system for a social robot with speaker recognition

Automatic emotion variation detection using multi-scaled sliding window

Recognition of emotion using non-linear dynamics of speech

Adaptive Hierarchical Emotion Recognition from Speech Signal for Human-Robot Communication

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options