Search results

Items from 1 to 20 out of 24 results

chapter

On the use of EMD for automatic newborn cry segmentation

Lina Abou-Abbas, Leila Montazeri, Christian Gargour, Chakib Tadj

2015 International Conference on Advances in Biomedical Engineering (ICABME) > 262 - 265

2015 International Conference on Advances in Biomedical Engineering (ICABME)

Cry segmentation is an essential preprocessing step in any infant crying diagnosis system. Besides crying sounds consisting of expiration phases followed by short periods of inspiration episodes, each recording of newborn cries also includes silence sections as well as other sounds such as speech of caregivers, noise and sound of medical equipments. This paper is devoted to a newly developed Empirical...

chapter

EEG based classification of imagined vowel sounds

Sadaf Iqbal, Yusuf Uzzaman Khan, Omar Farooq

2015 2nd International Conference on Computing for Sustainable Global Development (INDIACom) > 1591 - 1594

2015 2nd International Conference on "Computing for Sustainable Global Development" (INDIACom)

Researches indicate that electroencephalography (EEG) can be used to classify data of imagined speech. It can be further utilized to develop speech prosthesis and synthetic telepathy systems. The objective of this paper is to improve the classification performance in imagined speech by selecting the features that extract maximum discriminatory information from the data. The features extracted are...

chapter

Using k-Nearest Neighbor and Speaker Ranking for Phoneme Prediction

Muhammad Rizwan, David V. Anderson

2014 13th International Conference on Machine Learning and Applications > 383 - 387

2014 13th International Conference on Machine Learning and Applications (ICMLA)

Speech recognition systems are either based on parametric approach or non-parametric approach. Parametric based systems such as HMMs have been the dominant technology for speech recognition in the past decade. Despite a lot of advancements and enhancements in the design of these systems: key problems such as long term temporal dependence, etc. Has not yet been solved. Recently due to availability...

chapter

An approach to vowel recognition using 2DDWT based visual information of the lip region

S. A. Fattah, A. H. M. Rubaiyat, M. M. Hassan

2014 IEEE 57th International Midwest Symposium on Circuits and Systems (MWSCAS) > 1089 - 1092

2014 IEEE 57th International Midwest Symposium on Circuits and Systems (MWSCAS)

In this paper, a vowel recognition scheme using visual information is proposed based on two dimensional discrete wavelet transform (2D-DWT). First, a video frame corresponding to a steady vowel zone is selected utilizing the speech characteristics of audio frames. Next, a pixel-based method is proposed to identify the lip region of a given video frame, where intensity variation of different color...

chapter

Feature selection techniques for gender prediction from blogs

Shahana P. H, Bini Omman

2014 First International Conference on Networks & Soft Computing (ICNSC2014) > 355 - 359

2014 International Conference on Networks & Soft Computing (ICNSC)

The goal of this paper is to identify gender of blog authors. Features such as POS tags, unigram (words+punctuations), bigrams and word classes are considered. To synthesis/rank features we are using Mutual information, Chi-square and Information gain methods. The dataset is the collection of 3227 blogs originally derived from blogs set, and among them 1679 were written by male and 1548 were written...

chapter

Speaker environment classification using rhythm metrics in Levantine Arabic dialect

Yousef A. Alotaibi, Ali H. Meftah, Sid-Ahmed Selouani, Yasser M. Seddiq

2014 9th International Symposium on Communication Systems, Networks & Digital Sign (CSNDSP) > 706 - 709

2014 9th International Symposium on Communication Systems, Networks & Digital Signal Processing (CSNDSP)

This paper investigates the relationship between rhythm metrics and the ability to classify speakers depending on gender and/or social environments that may have been affected by factors such as second language effects and ways of living as expressed through speech. The BBN/AUB (BBN Technologies and American University of Beirut) corpus was used; it contains four subsets of native Levantine dialect...

chapter

Perceptual Evaluation of Voice Quality and Its Correlation with Acoustic Measurement

Farideh Jalalinajafabadi, Chaitanya Gadepalli, Frances Ascott, Jarrod Homer, more

2013 European Modelling Symposium > 283 - 286

2013 European Modelling Symposium (EMS)

The GRBAS scale is a widely used subjective measure of voice quality. The aim of this paper is to investigate the correlation between the 'grade', 'roughness', 'breathiness', 'asthenia' and 'strain' dimensions of this scale and the objective measurements provided by the 'Analysis of Dysphonia in speech and Voice' (ADSV) software package. To do this, voice recordings of 107 samples were collected in...

chapter

Broadcast news audio classification using SVM binary trees

Jozef Vavrek, Eva Vozarikova, Matus Pleva, Jozef Juhar

2012 35th International Conference on Telecommunications and Signal Processing (TSP) > 469 - 473

2012 35th International Conference on Telecommunications and Signal Processing (TSP)

Audio classification is one of the most important task in content-based analysis and can be implemented in many audio applications, such as indexing and retrieving. This paper addresses the problem of broadcast news audio classification, by support vector machine - binary tree (SVM-BT) architecture, into the five classes: pure speech, speech with music, speech with environment sound, pure music and...

chapter

SVM-based separation of unvoiced-voiced speech in cochannel conditions

Ke Hu, DeLiang Wang

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4545 - 4548

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Unvoiced-voiced portions of cochannel speech contain considerable amounts of both voiced and unvoiced speech and play a significant role in separation. Motivated by recent developments in separation of speech from nonspeech noise, we propose a classification-based approach for unvoiced-voiced speech separation. A new feature set consisting of pitch-based features and gammatone frequency cepstral coefficients...

chapter

Adaptive Neuro Fuzzy Inference System, Neural Network and Support Vector Machine for Caller Behavior Classification

Pretesh B. Patel, Tshilidzi Marwala

2011 10th International Conference on Machine Learning and Applications and Workshops > 1 > 298 - 303

2011 Tenth International Conference on Machine Learning and Applications (ICMLA 2011)

A classification system that accurately categorizes caller behavior within Interactive Voice Response systems would assist in developing good automated self service applications. This paper details the implementation of such a classification system for a pay beneficiary application. Adaptive Neuro-Fuzzy Inference System (ANFIS), Feed forward Artificial Neural Network (ANN) and Support Vector Machine...

chapter

Investigation of the robustness of a non-uniform filterbank for cognitive load classification

Phu Ngoc Le, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Jia Min Karen Kua

2011 8th International Conference on Information, Communications & Signal Processing > 1 - 5

2011 8th International Conference on Information, Communications & Signal Processing (ICICS)

Most of the current automatic speech-based cognitive load measurement systems utilize acoustic features estimated using a mel filterbank. However, a previous study showed that a non-uniform filterbank designed specifically to emphasize cognitive load information present in low frequencies was more effective than a mel filterbank under noise-free conditions. This paper investigates the effectiveness...

chapter

Audio stream analysis for environmental sound classification

Issam Feki, Anis Ben Ammar, Adel M. Alimi

2011 International Conference on Multimedia Computing and Systems > 1 - 6

2011 International Conference on Multimedia Computing and Systems (ICMCS)

We present in this paper a framework for audio concept identification based on audio stream analysis and binary classifiers encapsulation. The system consists of three stages. The first stage is called the pre-processing level audio, where audio stream is segmented and silence segments are detected. In the second stage, speech, music and environmental sounds are automatically divided and further classified...

article

Automatic Detection of Pathological Voices Using Complexity Measures, Noise Parameters, and Mel-Cepstral Coefficients

J D Arias-Londoño, J I Godino-Llorente, N Sáenz-Lechón, V Osma-Ruiz, more

IEEE Transactions on Biomedical Engineering > 2011 > 58 > 2 > 370 - 379

This paper proposes a new approach to improve the amount of information extracted from the speech aiming to increase the accuracy of a system developed for the automatic detection of pathological voices. The paper addresses the discrimination capabilities of 11 features extracted using nonlinear analysis of time series. Two of these features are based on conventional nonlinear statistics (largest...

chapter

Investigating analysis of speech content through text classification

S Ezzat, N E Gayar, M M Ghanem

2010 International Conference of Soft Computing and Pattern Recognition > 105 - 110

2010 International Conference of Soft Computing and Pattern Recognition (SoCPaR 2010)

The field of Text Mining has evolved over the past years to analyze textual resources. However, it can be used in several other applications. In this research, we are particularly interested in performing text mining techniques on audio materials after translating them into texts in order to detect the speakers' emotions. We describe our overall methodology and present our experimental results. In...

chapter

Speech Emotion Analysis in Noisy Real-World Environment

A Tawari, M M Trivedi

2010 20th International Conference on Pattern Recognition > 4605 - 4608

2010 20th International Conference on Pattern Recognition (ICPR 2010)

Automatic recognition of emotional states via speech signal has attracted increasing attention in recent years. A number of techniques have been proposed which are capable of providing reasonably high accuracy for controlled studio settings. However, their performance is considerably degraded when the speech signal is contaminated by noise. In this paper, we present a framework with adaptive noise...

chapter

Classification of voice disorders in children with cochlear implantation and hearing aid using multiple classifier fusion

Z Mahmoudi, S Rahati, M M Ghasemi, V Asadpour, more

10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010) > 304 - 307

2010 10th International Conference on Information Sciences, Signal Processing and their Applications (ISSPA 2010)

Speech production and speech phonetic features gradually improve in children by obtaining audio feedback after cochlear implantation or using hearing aid. In this study, voice disorders in children with cochlear implantation and hearing aid are classified. 30 Persian children participated in the study, including 6 children in levels 1 to 3 and 12 in level 4. Voice samples of 5 isolated Persian words...

chapter

SVM Based Part of Speech Tagger for Malayalam

P J Antony, Santhanu P Mohan, K P Soman

2010 International Conference on Recent Trends in Information, Telecommunication and Computing > 339 - 341

2010 International Conference on Recent Trends in Information, Telecommunication and Computing (ITC 2010)

This paper presents the building of part-of-speech Tagger for Malayalam Language using Support Vector Machine (SVM). POS tagger plays an important role in Natural language applications like speech recognition, natural language parsing, information retrieval and information extraction. This supervised machine learning POS tagging approach requires a large amount of annotated training corpus to tag...

chapter

Exploiting the ambiguity domain for non-stationary biomedical signal classification

L Sugavaneswaran, K Umapathy, S Krishnan

2010 Annual International Conference of the IEEE Engineering in Medicine and Biology > 1934 - 1937

2010 32nd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2010)

Research in time-frequency distributions (TFDs) is limited in terms of their use of the available spatial domains and in their target applications. Most of the work up till now has been concentrated mainly on the t-f domain space. This work presents a detailed study about the ambiguity domain (AD), their resemblance in the t-f space and the significance of using such a representation. Further, a novel...

chapter

Partitioned Feature-based Classifier model

Dong-Chul Park

2009 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 412 - 417

2009 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT 2009)

The Partitioned Feature-based Classifier (PFC) is proposed in this paper. PFC does not use entire feature vectors extracted from the original data at once to classify each datum, but use only groups of features related to each feature vector to classify data separately. In the training stage, the contribution rate calculated from each feature vector group is drawn throughout the accuracy of each feature...

chapter

Audio-based classification of speaker characteristics

P. Dutta, A. Haubold

2009 IEEE International Conference on Multimedia and Expo > 422 - 425

2009 IEEE International Conference on Multimedia and Expo (ICME)

The human voice is primarily a carrier of speech, but it also contains non-linguistic features unique to a speaker and indicative of various speaker demographics, e.g. gender, nativity, ethnicity. Such characteristics are helpful cues for audio/video search and retrieval. In this paper, we evaluate the effects of various low-, mid-, and high-level features for effective classification of speaker characteristics...

Content availability:
Available
Data set:
ieee
Keywords:
ACCURACY
CLASSIFICATION
SPEECH

Publication date

Set your own date range

Publication type

book (22)
article (2)

Keywords

FEATURE EXTRACTION (17)
SUPPORT VECTOR MACHINES (10)
TRAINING (8)
SPEECH PROCESSING (7)
SPEECH RECOGNITION (7)
HIDDEN MARKOV MODELS (5)
CEPSTRAL ANALYSIS (4)
DATA MINING (4)
SIGNAL CLASSIFICATION (4)
CLASSIFICATION ALGORITHMS (3)
CORRELATION (3)
DATABASES (3)
KERNEL (3)
MEDICAL SIGNAL PROCESSING (3)
NATURAL LANGUAGE PROCESSING (3)
NOISE (3)
ARTIFICIAL NEURAL NETWORKS (2)
AUDIO SIGNAL PROCESSING (2)
EMOTION RECOGNITION (2)
ENTROPY (2)
FEATURE SELECTION (2)
INFORMATION RETRIEVAL (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
MEL FREQUENCY CEPSTRAL COEFFICIENT (2)
MFCC (2)
MUSIC (2)
NATURAL LANGUAGES (2)
NEURAL NETS (2)
PATHOLOGY (2)
PATTERN CLASSIFICATION (2)
PEDIATRICS (2)
PROBABILITY (2)
SPEECH SIGNAL (2)
STATISTICAL ANALYSIS (2)
SUPPORT VECTOR MACHINE (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
ACOUSTIC ANALYSIS (1)
ACOUSTIC MEASUREMENTS (1)
ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM (1)
ADAPTIVE NOISE CANCELLATION (1)
ADOLESCENTS (1)
ADSV (1)
AMBIGUITY DOMAIN (1)
AND RANKING (1)
ANN (1)
ARTIFICIAL NEURAL NERWORK (1)
AUDIO (1)
AUDIO AND TEXT MINING (1)
AUDIO ANNOTATION (1)
AUDIO DATA (1)
AUDIO DATA CLASSIFICATION PROBLEM (1)
AUDIO FEEDBACK (1)
AUDIO MATERIAL (1)
AUDIO STREAMING (1)
AUDIO SYSTEMS (1)
AUDIO-BASED SPEAKER CHARACTERISTIC CLASSIFICATION (1)
AUDIO/VIDEO SEARCH CUE (1)
AUDIO/VIDEO SEARCH RETRIEVAL (1)
AUTOMATIC CRY SEGMENTATION (1)
AUTOMATIC DETECTION (1)
AUTOMATIC EMOTIONAL STATE IDENTIFICATION (1)
AUTOMATIC RECOGNITION (1)
AUTOMATIC SPEECH RECOGNITION (1)
BBN/AUB CORPUS (1)
BINARY TREES (1)
BIOCOMMUNICATIONS (1)
BLOGS (1)
CALL CENTERS (1)
CALL CENTRES (1)
CALL ROUTING (1)
CANONICAL CORRELATION ANALYSIS (1)
CHAOS (1)
CLASSIFICATION METHOD (1)
CLINICAL DEPRESSION (1)
CLUSTERING (1)
CLUSTERING ALGORITHMS (1)
COCHANNEL SPEECH SEPARATION (1)
COCHLEAR IMPLANTATION (1)
COCHLEAR IMPLANTS (1)
COGNITIVE LOAD (1)
COMBINATION STRATEGY (1)
COMBINING CLASSIFIERS (1)
COMPARABLE PERFORMANCE CHARACTERISTICS (1)
COMPLEXITY MEASURES (1)
COMPLEXITY THEORY (1)
CONCEPT (1)
CONSTRAINED MINIMIZATION (1)
CONSTRAINED MINIMIZATION TECHNIQUE (1)
CORRELATION DIMENSION (1)
CORRELATION METHODS (1)
DICTIONARIES (1)
DISCRETE WAVELET TRANSFORM (1)
DISCRETE WAVELET TRANSFORMS (1)
DISCRIMINATIVE TRAINING (1)
EDUCATIONAL INSTITUTIONS (1)
ELECTROENCEPHALOGRAM (EEG) (1)
ELECTROENCEPHALOGRAPHY (1)
more

INFONA - science communication portal

Search results

On the use of EMD for automatic newborn cry segmentation

EEG based classification of imagined vowel sounds

Using k-Nearest Neighbor and Speaker Ranking for Phoneme Prediction

An approach to vowel recognition using 2DDWT based visual information of the lip region

Feature selection techniques for gender prediction from blogs

Speaker environment classification using rhythm metrics in Levantine Arabic dialect

Perceptual Evaluation of Voice Quality and Its Correlation with Acoustic Measurement

Broadcast news audio classification using SVM binary trees

SVM-based separation of unvoiced-voiced speech in cochannel conditions

Adaptive Neuro Fuzzy Inference System, Neural Network and Support Vector Machine for Caller Behavior Classification

Investigation of the robustness of a non-uniform filterbank for cognitive load classification

Audio stream analysis for environmental sound classification

Automatic Detection of Pathological Voices Using Complexity Measures, Noise Parameters, and Mel-Cepstral Coefficients

Investigating analysis of speech content through text classification

Speech Emotion Analysis in Noisy Real-World Environment

Classification of voice disorders in children with cochlear implantation and hearing aid using multiple classifier fusion

SVM Based Part of Speech Tagger for Malayalam

Exploiting the ambiguity domain for non-stationary biomedical signal classification

Partitioned Feature-based Classifier model

Audio-based classification of speaker characteristics

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options