Search results

Items from 1 to 20 out of 58 results

chapter

Recent developments in acoustical signal classification for monitoring

Corneliu Rusu, Lacrimioara Grama

2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE) > 1 - 10

2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE)

In this paper we shall present recent results of two applications for monitoring using acoustical signal classification. The first case study is the problem of context awareness based on acoustic analysis for a service robot. Then we discussed the acoustic classification for wildlife intruder detection. Previous results are briefly recalled and new experimental results are also provided.

chapter

Comparison of I-vector and GMM-UBM approaches to speaker identification with TIMIT and NIST 2008 databases in challenging environments

Musab T. S. Al-Kaltakchi, Wai L. Woo, Satnam S. Dlay, Jonathon A. Chambers

2017 25th European Signal Processing Conference (EUSIPCO) > 533 - 537

2017 25th European Signal Processing Conference (EUSIPCO)

In this paper, two models, the I-vector and the Gaussian Mixture Model-Universal Background Model (GMM-UBM), are compared for the speaker identification task. Four feature combinations of I-vectors with seven fusion techniques are considered: maximum, mean, weighted sum, cumulative, interleaving and concatenated for both two and four features. In addition, an Extreme Learning Machine (ELM) is exploited...

chapter

Speaker verification anti-spoofing using linear prediction residual phase features

Cemal Hanilci

2017 25th European Signal Processing Conference (EUSIPCO) > 96 - 100

2017 25th European Signal Processing Conference (EUSIPCO)

The vulnerability of automatic speaker verification (ASV) systems against spoofing attacks is an important security concern about the reliability of ASV technology. Recently, various countermeasures have been developed for spoofing detection. In this paper, we propose to use features derived from linear prediction (LP) residual signal for spoofing detection using simple Gaussian mixture model (GMM)...

chapter

Speaker identification evaluation based on the speech biometric and i-vector model using the TIMIT and NTIMIT databases

Musab T. S. Al-Kaltakchi, Wai L. Woo, Satnam S. Dlay, Jonathon A. Chambers

2017 5th International Workshop on Biometrics and Forensics (IWBF) > 1 - 6

2017 5th International Workshop on Biometrics and Forensics (IWBF)

Physiological and behavioural human characteristics are exploited in biometrics and performance metrics are used to measure some characteristic of an individual. The measure might lead to a one-to-one match, which is called authentication or one-from-N, and a match represents identification. In this paper, we exploit a speech biometric I-vector with low and fixed dimension of 100 to identify speakers...

chapter

Implicit language identification system based on random forest and support vector machine for speech

Manish Gupta, Shambhu Shankar Bharti, Suneeta Agarwal

2017 4th International Conference on Power, Control & Embedded Systems (ICPCES) > 1 - 6

2017 4th International Conference on Power, Control & Embedded Systems (ICPCES)

Speech uttered by the human beings contains the information about speakers, languages and contents. Language of uttered speech can easily be identified by extracting the language specific information from it. Identification of language of speech is known as Language Identification (LID). Identification of language from speech is helpful in its translation, speech recognition and speech activated automatic...

chapter

Generalization of spoofing countermeasures: A case study with ASVspoof 2015 and BTAS 2016 corpora

Dipjyoti Paul, Md Sahidullah, Goutam Saha

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2047 - 2051

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Voice-based biometric systems are highly prone to spoofing attacks. Recently, various countermeasures have been developed for detecting different kinds of attacks such as replay, speech synthesis (SS) and voice conversion (VC). Most of the existing studies are conducted with a specific training set defined by the evaluation protocol. However, for realistic scenarios, selecting appropriate training...

chapter

Factor analysis methods for joint speaker verification and spoof detection

Dhanush B K, Suparna S, Aarthy R, Likhita C, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5385 - 5389

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The performance of a speaker verification system is severely degraded by spoofing attacks generated from artificial speech synthesizers. Recently, several approaches have been proposed for classifying natural and synthetic speech (spoof detection) which can be used in conjunction with a speaker verification system. In this paper, we attempt to develop a joint modelling approach which can detect the...

chapter

Automatic speech emotion detection system using multi-domain acoustic feature selection and classification models

Nancy Semwal, Abhijeet Kumar, Sakthivel Narayanan

2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA) > 1 - 6

2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA)

Emotions exhibited by a speaker can be detected by analyzing his/her speech, facial expressions and gestures or by combining these properties. This paper concentrates on determining the emotional state from speech signals. Various acoustic features such as energy, zero crossing rate(ZCR), fundamental frequency, Mel Frequency Cepstral Coefficients (MFCCs), etc are extracted for short term, overlapping...

chapter

Automatic Speech Recognition of isolated words in Hindi language using MFCC

U. G. Patil, S. D. Shirbahadurkar, A. N. Paithane

2016 International Conference on Computing, Analytics and Security Trends (CAST) > 433 - 438

2016 International Conference on Computing, Analytics and Security Trends (CAST)

Speech is natural vocalized and primary means of communication. Speech is easy, hand-free, fast and do not require any technical knowledge. Communicating with computer using speech is simple and comfortable way for human being. Speech recognition system made this possible. The acoustic and language model for this system are available but mostly in English language. In India there are so many peoples...

chapter

Classifying musical instruments using speech signal processing methods

Seema Ghisingh, V. K. Mittal

2016 IEEE Annual India Conference (INDICON) > 1 - 6

2016 IEEE Annual India Conference (INDICON)

Identification of musical instruments from the acoustic signal using speech signal processing methods is a challenging problem. Further, whether this identification can be carried out by a single musical note, like humans are able to do, is an interesting research issue that has several potential applications in the music industry. Attempts have been made earlier using the spectral and temporal features...

chapter

Improving speaker verification using MFCC order

A. T. Rusli, M. I. Ahmad, M. Z. Ilyas

2016 International Conference on Robotics, Automation and Sciences (ICORAS) > 1 - 4

2016 International Conference on Robotics, Automation and Sciences (ICORAS)

This paper presents a text-dependent speaker verification using Mel-Frequency Cepstral Coefficients (MFCC) and Support Vector Machine (SVM). Mel-Frequency Cepstral Coefficients technique has been used to extract the characteristic from the recorded voice spoken by the user and SVM is used to classify the all models of the speakers and impostors. A Malay spoken digit database is utilized for the training...

chapter

Development of multilingual phonetic engine for four Indian languages

Lincy Babykutty, Anu George, Leena Mary

2016 International Conference on Next Generation Intelligent Systems (ICNGIS) > 1 - 3

2016 International Conference on Next Generation Intelligent Systems (ICNGIS)

Phonetic Engine (PE) is a system that is used to determine the sequence of phones in a spoken utterance. In order to transcribe the speech database, International Phonetic Alphabet (IPA) is used. This work focuses on developing multilingual PE for four Indian languages namely, Bengali, Hindi, Urdu and Telugu. The number of languages can be increased to any number. For developing the PE, read speech...

chapter

Significance of constraining text in limited data text-independent speaker verification

Rohan Kumar Das, Sarfaraz Jelil, S. R. Mahadeva Prasanna

2016 International Conference on Signal Processing and Communications (SPCOM) > 1 - 5

2016 International Conference on Signal Processing and Communications (SPCOM)

This work projects the importance of phonetic match between train and test session for a text-independent framework under limited test data condition. The robustness of text-independent speaker verification (SV) tends to fall down with the reduction of the amount of speech involved. From a deployable application oriented system point of view, the amount of speech involved, is expected to be less to...

chapter

Influences of languages in speech emotion recognition: A comparative study using Malay, English and Mandarin languages

Rajesvary Rajoo, Ching Chee Aun

2016 IEEE Symposium on Computer Applications & Industrial Electronics (ISCAIE) > 35 - 39

2016 IEEE Symposium on Computer Applications & Industrial Electronics (ISCAIE)

Emotion recognition plays a significant role in affective computing and adds value to machine intelligence. While the emotional state of a person can be manifested in different ways such as facial expressions, gestures, movements and postures, recognition of emotion from speech has gathered much interest over others. However, after years of research, recognizing the emotional state of individuals...

chapter

Speech to text conversion for multilingual languages

Yogita H. Ghadage, Sushama D. Shelke

2016 International Conference on Communication and Signal Processing (ICCSP) > 236 - 240

2016 International Conference on Communication and Signal Processing (ICCSP)

The current work presents a multilingual speech-to-text conversion system. Conversion is based on information in speech signal. Speech is the natural and most important form of communication for human being. Speech-To-Text (STT) system takes a human speech utterance as an input and requires a string of words as output. The objective of this system is to extract, characterize and recognize the information...

chapter

Considering basic emotional state information in speaker verification

Piotr Staroniewicz

2016 4th International Conference on Biometrics and Forensics (IWBF) > 1 - 4

2016 4th International Conference on Biometrics and Forensics (IWBF)

The paper presents speaker verification results for six basic emotional states. The database of emotional speech (six acted states: anger, sadness, happiness, fear, disgust, surprise) plus the neutral state were examined with a typical speaker verification system based on MFCC features and GMM classifiers. The obtained results were confronted with the subjective and objective emotion recognition scores...

chapter

Automatic classification of pinnipeds based on their vocalizations and fusion of cepstral features

Juan J. Noda, Carlos M. Travieso, David Sanchez-Rodriguez, Malay Kishore Dutta, more

2016 3rd International Conference on Signal Processing and Integrated Networks (SPIN) > 215 - 219

2016 3rd International Conference on Signal Processing and Integrated Networks (SPIN)

Acoustic vocalizations are common in marine mammals which can be used for classification purposes. Pinnipeds are a group of carnivore mammals composed by seals, sea lions, and walruses. But although, there is a great interest in research literature about acoustic monitoring of marine mammals, the identification of pinnipeds trough experts systems has been poorly studied. This paper brings a novel...

chapter

Shaking and speech-smile vowels classification: An attempt at amusement arousal estimation from speech signals

Kevin El Haddad, Stephane Dupont, Huseyin Cakmak, Thierry Dutoit

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 428 - 432

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

In this paper, we present our work on speech-smile/shaking vowels classification. An efficient classification system would be a first step towards the estimation (from speech signals only) of amusement levels beyond smile, as indeed shaking vowels represent a transition from smile to laughter superimposed to speech. A database containing examples of both classes has been collected from acted and spontaneous...

chapter

Development of a transformation algorithm for emotional speech signal using DWT and Adaptive Filter for a Voice Culture Training System

Bageshree Sathe-Pathak, Ashish Panat

TENCON 2015 - 2015 IEEE Region 10 Conference > 1 - 5

TENCON 2015 - 2015 IEEE Region 10 Conference

This paper develops an algorithm “Discrete Wavelet Transform with Adaptive Filter” (DWTAF) to transform Neutral speech into emotional speech like Angry, Happy or Sad and this is compared with two other emotion transformation algorithms. The other two algorithms are “Speech Transformation using Statistical Parameters and Pitch Contours” (STSPPC) and “Speech Transformation using Mel Frequency Cepstral...

chapter

Indexing and classifiying video genres using Support Vector Machines

Nouha Dammak, Yassine BenAyed

2015 IEEE/ACS 12th International Conference of Computer Systems and Applications (AICCSA) > 1 - 5

2015 IEEE/ACS 12th International Conference of Computer Systems and Applications (AICCSA)

In this paper, classifying and indexing hierarchical video genres using Support Vector Machines (SVMs) are based on only audio features. In fact, segmentation parameters are extracted at block levels, which have a major benefit by capturing local temporal information. The main contribution of our study is to present a powerful combination between the two employed audio descriptors; Mel Frequency Cepstral...

Content availability:
Available
Data set:
ieee
Keywords:
TRAINING
DATABASES
MEL FREQUENCY CEPSTRAL COEFFICIENT

Publication date

Set your own date range

Publication type

book (56)
article (2)

Keywords

SPEECH (46)
FEATURE EXTRACTION (39)
SPEECH RECOGNITION (24)
HIDDEN MARKOV MODELS (16)
SIGNAL PROCESSING (10)
SUPPORT VECTOR MACHINES (10)
TESTING (10)
MFCC (9)
SPEAKER RECOGNITION (9)
ACCURACY (8)
EMOTION RECOGNITION (8)
ROBUSTNESS (8)
NOISE (7)
SPEECH PROCESSING (7)
ACOUSTICS (6)
COMPUTERS (6)
MATHEMATICAL MODEL (6)
SPEAKER VERIFICATION (6)
CEPSTRAL ANALYSIS (5)
CLASSIFICATION ALGORITHMS (5)
VECTORS (5)
CONFERENCES (4)
CORRELATION (4)
DATA MINING (4)
EQUATIONS (4)
FREQUENCY DOMAIN ANALYSIS (4)
GMM (4)
INSTRUMENTS (4)
LABORATORIES (4)
PREDICTION ALGORITHMS (4)
SIGNAL TO NOISE RATIO (4)
SUPPORT VECTOR MACHINE CLASSIFICATION (4)
TRAINING DATA (4)
ANALYTICAL MODELS (3)
ARTIFICIAL NEURAL NETWORKS (3)
CLASSIFICATION (3)
COMPUTATIONAL MODELING (3)
DATA MODELS (3)
DISCRETE FOURIER TRANSFORMS (3)
ELECTRONIC MAIL (3)
EMOTIONAL SPEECH (3)
FAST FOURIER TRANSFORMS (3)
FILTER BANK (3)
GAUSSIAN PROCESSES (3)
HARMONIC ANALYSIS (3)
HMM (3)
KNN (3)
MUSIC (3)
NATURAL LANGUAGES (3)
NIST (3)
NOISE ROBUSTNESS (3)
SPEAKER IDENTIFICATION (3)
STRESS (3)
ACOUSTIC FEATURES (2)
ADDITIVE NOISE (2)
ALGORITHM DESIGN AND ANALYSIS (2)
ATMOSPHERIC MODELING (2)
AUDITORY SYSTEM (2)
BOOKS (2)
COGNITION (2)
COMPANIES (2)
COMPUTATIONAL EFFICIENCY (2)
DETECTION ALGORITHMS (2)
DISCRETE COSINE TRANSFORMS (2)
EDUCATIONAL INSTITUTIONS (2)
ENCODING (2)
ERROR ANALYSIS (2)
ESTIMATION (2)
FEATURE SELECTION (2)
FILTERING THEORY (2)
FOURIER TRANSFORMS (2)
FREQUENCY CONVERSION (2)
FREQUENCY MODULATION (2)
GAUSSIAN MIXTURE MODELS (2)
HEURISTIC ALGORITHMS (2)
HTK (2)
INFORMATION SCIENCE (2)
MACHINE LEARNING (2)
MACHINE LEARNING ALGORITHMS (2)
MAXIMUM LIKELIHOOD DETECTION (2)
MONITORING (2)
MULTIMEDIA SYSTEMS (2)
MUSIC INFORMATION RETRIEVAL (2)
NATURAL LANGUAGE PROCESSING (2)
NOISE MEASUREMENT (2)
PATTERN RECOGNITION (2)
POLYNOMIALS (2)
PRESSES (2)
PSYCHOLOGY (2)
SIGNAL CLASSIFICATION (2)
SPEECH EMOTION RECOGNITION (2)
STANDARDS (2)
SUPPORT VECTOR MACHINE (2)
TRANSFORMS (2)
VQ (2)
WAVELET TRANSFORMS (2)
WHISPERED SPEECH (2)
more

INFONA - science communication portal

Search results

Recent developments in acoustical signal classification for monitoring

Comparison of I-vector and GMM-UBM approaches to speaker identification with TIMIT and NIST 2008 databases in challenging environments

Speaker verification anti-spoofing using linear prediction residual phase features

Speaker identification evaluation based on the speech biometric and i-vector model using the TIMIT and NTIMIT databases

Implicit language identification system based on random forest and support vector machine for speech

Generalization of spoofing countermeasures: A case study with ASVspoof 2015 and BTAS 2016 corpora

Factor analysis methods for joint speaker verification and spoof detection

Automatic speech emotion detection system using multi-domain acoustic feature selection and classification models

Automatic Speech Recognition of isolated words in Hindi language using MFCC

Classifying musical instruments using speech signal processing methods

Improving speaker verification using MFCC order

Development of multilingual phonetic engine for four Indian languages

Significance of constraining text in limited data text-independent speaker verification

Influences of languages in speech emotion recognition: A comparative study using Malay, English and Mandarin languages

Speech to text conversion for multilingual languages

Considering basic emotional state information in speaker verification

Automatic classification of pinnipeds based on their vocalizations and fusion of cepstral features

Shaking and speech-smile vowels classification: An attempt at amusement arousal estimation from speech signals

Development of a transformation algorithm for emotional speech signal using DWT and Adaptive Filter for a Voice Culture Training System

Indexing and classifiying video genres using Support Vector Machines

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options