Search results

Items from 1 to 20 out of 30 results

chapter

Emotion Detection through Speech and Facial Expressions

Krishna Mohan Kudiri, Abas Md. Said, M. Yunus Nayan

2014 International Conference on Computer Assisted System in Health > 26 - 31

2014 International Conference on Computer Assisted System in Health (CASH)

Human machine interaction is one of the most burgeoning area of research in the field of information technology. To date a majority of research in this field has been conducted using unimodal and multimodal systems with asynchronous data. Because of the above, the improper synchronization, which has become a common problem, due to that, the system complexity increases and the system response time...

chapter

Feature selection techniques for gender prediction from blogs

Shahana P. H, Bini Omman

2014 First International Conference on Networks & Soft Computing (ICNSC2014) > 355 - 359

2014 International Conference on Networks & Soft Computing (ICNSC)

The goal of this paper is to identify gender of blog authors. Features such as POS tags, unigram (words+punctuations), bigrams and word classes are considered. To synthesis/rank features we are using Mutual information, Chi-square and Information gain methods. The dataset is the collection of 3227 blogs originally derived from blogs set, and among them 1679 were written by male and 1548 were written...

chapter

Automatic age recommendation system for children's video content

Joseph Santarcangelo, Xiao-Ping Zhang

2014 IEEE International Symposium on Circuits and Systems (ISCAS) > 750 - 753

2014 IEEE International Symposium on Circuits and Systems (ISCAS)

This paper presents a novel automatic method to de- termine the appropriate age of video content in a video database geared to children. When combined with classical features the system improves accuracy rate for more than 0.13 for the same type of classifier in determining the age category of content for children between the ages of three to six years old. The main novelty of the system is that it...

chapter

Integration of MKL-Based and I-Vector-Based Speaker Verification by Short Utterances

Hideitsu Hino, Tetsuji Ogawa

2013 2nd IAPR Asian Conference on Pattern Recognition > 562 - 566

2013 2nd IAPR Asian Conference on Pattern Recognition (ACPR)

We developed a speaker verification system that is efficient for short utterances. The i-vector-based speaker representation has helped realize highly accurate speaker verification systems, however, it might be not robust against short utterances because the reliability of statistics required for extracting i-vectors is low. On the other hand, multiple kernel learning based on conditional entropy...

chapter

Discrete wavelet transforms with multiclass SVM for phoneme recognition

M. Cutajar, E. Gatt, I. Grech, O. Casha, more

Eurocon 2013 > 1695 - 1700

IEEE EUROCON 2013

A phoneme recognition system based on Discrete Wavelet Transforms (DWT) and Support Vector Machines (SVMs), is designed for multi-speaker continuous speech environments. Phonemes are divided into frames, and the DWTs are adopted, to obtain fixed dimensional feature vectors. For the multiclass SVM, the One-against-one method with the RBF kernel was implemented. To further improve the accuracies obtained,...

chapter

Application of fast learning neural-networks to identification of mixed anuran vocalizations

Chenn-Jung Huang, Chin-Fa Lin, Po-An Hsu, Yu-Wei Lee, more

IEEE Conference Anthology > 1 - 5

2013 IEEE Conference Anthology

The proposed identification system for mixed anuran vocalizations is to provide the public to easily consult online. The raw mixed anuran vocalization samples are first filtered by noise removal, high frequency compensation, and discrete wavelet transform techniques in order. An adaptive end-point detection segmentation algorithm is proposed to effectively separate the individual syllables from the...

chapter

Comparison of vector normalization methods in multi-level speaker verification

Szymon Drgas, Adam Dabrowski

2012 International Conference on Signals and Electronic Systems (ICSES) > 1 - 6

2012 International Conference on Signals and Electronic Systems (ICSES 2012)

In this article a text-independent speaker verification problem is considered. After the feature extraction, each conversation side has been represented as a vector in a fixed dimensional space. In order to reduce an influence of the lengths of utterances and also the channel properties, various vector normalization techniques have been selected from the literature, modified, and tested. Additionally,...

chapter

Comparison of different multiclass SVM methods for speaker independent phoneme recognition

M. Cutajar, E. Gatt, I. Grech, O. Casha, more

2012 5th International Symposium on Communications, Control and Signal Processing > 1 - 5

2012 5th International Symposium on Communications, Control and Signal Processing (ISCCSP)

Four multiclass Support Vector Machines (SVMs) methods were designed for the task of speaker independent phoneme recognition. These are the All-at-once, One-against-all, One-against-one, and the Directed Acyclic Graph SVM (DAGSVM). The Discrete Wavelet Transform (DWT) 8 frequency band power percentages are used for feature extraction. All tests were carried out on the TIMIT database. Comparable recognition...

chapter

New techniques for improving the practicality of an SVM-based speech/music classifier

Chungsoo Lim, Seong-Ro Lee, Yeon-Woo Lee, Joon-Hyuk Chang

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1657 - 1660

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Variable bit-rate coding introduced for effective utilization of limited communication bandwidth requires accurate classification of input signals. This paper investigates implementation of a support vector machine (SVM)-based speech/music classifier in the selectable mode vocoder (SMV) framework, which is a standard codec adopted by the Third-Generation Partnership Project 2 (3GPP2). A support vector...

chapter

A new multiple-kernel-learning weighting method for localizing human brain magnetic activity

T. Takiguchi, T. Imada, R. Takashima, Y. Ariki, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 761 - 764

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper shows that pattern classification based on machine learning is a powerful tool to analyze human brain activity data obtained by magnetoencephalography (MEG). We propose a new weighting method using a multiple kernel learning (MKL) algorithm to localize the brain area contributing to the accurate vowel discrimination. Our MKL simultaneously estimates both the classification boundary and...

chapter

Cluster aware normalization for enhancing audio similarity

Mathieu Lagrange, Luis Gustavo Martins, George Tzanetakis

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1969 - 1972

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

An important task in Music Information Retrieval is content-based similarity retrieval in which given a query music track, a set of tracks that are similar in terms of musical content are retrieved. A variety of audio features that attempt to model different aspects of the music have been proposed. In most cases the resulting audio feature vector used to represent each music track is high dimensional...

chapter

Adaptive Neuro Fuzzy Inference System, Neural Network and Support Vector Machine for Caller Behavior Classification

Pretesh B. Patel, Tshilidzi Marwala

2011 10th International Conference on Machine Learning and Applications and Workshops > 1 > 298 - 303

2011 Tenth International Conference on Machine Learning and Applications (ICMLA 2011)

A classification system that accurately categorizes caller behavior within Interactive Voice Response systems would assist in developing good automated self service applications. This paper details the implementation of such a classification system for a pay beneficiary application. Adaptive Neuro-Fuzzy Inference System (ANFIS), Feed forward Artificial Neural Network (ANN) and Support Vector Machine...

chapter

Short Text Categorization via Coherence Constraints

Anca Dinu

2011 13th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing > 247 - 250

2011 13th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC)

In this article we propose a quantitative approach to a relatively new problem: categorizing text as pragmatically correct or pragmatically incorrect (forcing the notion, coherent/incoherent). The typical text categorization criterions comprise categorization by topic, by style (genre classification, authorship identification), by expressed opinion (opinion mining, sentiment classification), etc....

chapter

The Application of Improved Sparse Least-Squares Support Vector Machine in Speaker Identification

Ruiling Luo, Wenqing Cai, Min Chen, Zhongling Han

2011 3rd International Workshop on Intelligent Systems and Applications > 1 - 4

2011 3rd International Workshop on Intelligent Systems and Applications (ISA)

SVM is a novel type of statistical learning method that has been successfully used in speaker recognition. However, training SVM consumes long computing time and large storage space with all training examples. This paper proposes an improved sparse least-squares support vector machine (LS-SVM) for speaker identification. Firstly KPCA is exploited to reduce the dimension of input vectors and to denoise...

chapter

Recognition of repetitions using Support Vector Machines

Juraj Palfy, Jiri Pospichal

Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2011 > 1 - 6

2011 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

The goal of this paper is to present experimental results for the automatic recognition of dysfluencies in the stuttered speech. Mel Frequency Cepstral Coeficients reduce the dimensionality of data and models of acoustic waves of human speech. The acoustic model contains the feature vectors of speech used for further processing with Support Vector Machine. SVM classifier with kernel functions efficiently...

chapter

A sub-Nyquist sampling method for computing the level-crossing-times of an analog signal: Theory and applications

C S Seelamantula

2010 International Conference on Signal Processing and Communications (SPCOM) > 1 - 5

2010 International Conference on Signal Processing and Communications (SPCOM 2010)

We address the problem of computing the level-crossings of an analog signal from samples measured on a uniform grid. Such a problem is important, for example, in multilevel analog-to-digital (A/D) converters. The first operation in such sampling modalities is a comparator, which gives rise to a bilevel waveform. Since bilevel signals are not bandlimited, measuring the level-crossing times exactly...

chapter

Lipreading Recognition Based on SVM and DTAK

He Jun, Zhang Hua

2010 4th International Conference on Bioinformatics and Biomedical Engineering > 1 - 4

2010 4th International Conference on Bioinformatics and Biomedical Engineering (iCBBE 2010)

To enhance recognition accuracy of isolated words identification with small samples in lipreading, SVM is first introduced to act as classifier in this paper. As SVM is based on structural risk minimization, it solves the problem of pattern recognition under small samples, on the other hand, it avoids the unreasonable hypothesis in traditional classifier. To meet the requirement of fixed input feature...

chapter

Experimental Research on Hiding Capacity of Echo Hiding in Voice

Li Li

2010 International Conference on Challenges in Environmental Science and Computer Engineering > 1 > 305 - 308

2010 International Conference on Challenges in Environmental Science and Computer Engineering (CESCE 2010)

Many improvements are developed for echo hiding system, but there is not an intensive study on the hiding capacity of echo hiding. Based on the speech signals with various sampling rate and single echo hiding scheme, this work explores the regular pattern of recovery accuracy and fragment length, and presents that the hiding capacity of speech clip is 55bit/s, and the capacity is not related to the...

chapter

Combining regression and classification methods for improving automatic speaker age recognition

C van Heerden, E Barnard, M Davel, C van der Walt, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5174 - 5177

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

We present a novel approach to automatic speaker age classification, which combines regression and classification to achieve competitive classification accuracy on telephone speech. Support vector machine regression is used to generate finer age estimates, which are combined with the posterior probabilities of well-trained discriminative gender classifiers to predict both the age and gender of a speaker...

article

Automatic height estimation from speech in real-world setup

Todor Ganchev, Iosif Mporas, Nikos Fakotakis

02010 00018th European Signal Processing Conference > 2010 > 800 - 804

2010 18th European Signal Processing Conference

We propose a Gaussian process based regression scheme that provides a direct estimation of the height of unknown speakers and is applicable to real-world autonomous surveillance applications. This scheme relies on utterance-level speech parameterization followed by regression modelling, which estimates the height of the speaker and the uncertainty interval of that estimation. Experiments on the TIMIT...

Data set:
ieee
Keywords:
KERNEL
ACCURACY
SPEECH

Publication date

Set your own date range

Publication type

book (29)
article (1)

Keywords

SUPPORT VECTOR MACHINES (20)
FEATURE EXTRACTION (10)
TRAINING (10)
SPEECH RECOGNITION (9)
SUPPORT VECTOR MACHINE (7)
SPEAKER RECOGNITION (5)
DATA MINING (4)
VECTORS (4)
CEPSTRUM (3)
CLASSIFICATION (3)
HIDDEN MARKOV MODELS (3)
LEARNING (ARTIFICIAL INTELLIGENCE) (3)
OPTIMIZATION (3)
PATTERN CLASSIFICATION (3)
RADIAL BASIS FUNCTION NETWORKS (3)
ROBUSTNESS (3)
SPEECH PROCESSING (3)
SVM (3)
ARTIFICIAL NEURAL NETWORKS (2)
DATABASES (2)
DELAY (2)
DISCRETE WAVELET TRANSFORMS (2)
ECHO (2)
EDUCATIONAL INSTITUTIONS (2)
INFORMATION HIDING (2)
MATHEMATICAL MODEL (2)
MEL FREQUENCY CEPSTRAL COEFFICIENT (2)
PARTICLE SWARM OPTIMISATION (2)
PARTICLE SWARM OPTIMIZATION (2)
PHONEME RECOGNITION (2)
SPEAKER IDENTIFICATION (2)
SPEAKER-INDEPENDENT (2)
STATISTICAL ANALYSIS (2)
STATISTICAL LEARNING METHOD (2)
STREAMING MEDIA (2)
TAGGING (2)
TESTING (2)
ADAPTATION MODEL (1)
ADAPTIVE FILTERS (1)
ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM (1)
AGE CLASSIFICATION (1)
AGE CLASSIFICATION METHOD (1)
ANALOG SIGNAL (1)
ANALOGUE-DIGITAL CONVERSION (1)
ANURAN VOCALIZATIONS (1)
ARABIC PART-OF-SPEECH TAGGER (1)
ARTICULATORY LIMITATIONS (1)
ARTIFICIAL NEURAL NERWORK (1)
ASSISTIVE SOFTWARE (1)
AUDIO (1)
AUDIO ANNOTATION (1)
AUDIO CODING (1)
AUDIO DATA STREAM (1)
AUDIO SIGNAL PROCESSING (1)
AUDIO-BASED SPEAKER CHARACTERISTIC CLASSIFICATION (1)
AUDIO/VIDEO SEARCH CUE (1)
AUDIO/VIDEO SEARCH RETRIEVAL (1)
AUTOMATIC SPEAKER AGE RECOGNITION (1)
BENGALI (1)
BLIND DETECTION (1)
BLIND SOURCE SEPARATION (1)
BLOGS (1)
BRAIN (1)
BRAIN ACTIVITY (1)
BRAIN AREA (1)
CAMERAS (1)
CAPACITY (1)
CEPSTRAL ANALYSIS (1)
CEPSTRUMS (1)
CLASSIFICATION ALGORITHM (1)
CLASSIFICATION ALGORITHMS (1)
CLASSIFIER (1)
CLUSTERING ANALYSIS (1)
CODECS (1)
COHERENCE (1)
COLOR (1)
COMPLEX CEPSTRUM (1)
COMPUTATIONAL COMPLEXITY (1)
CONFERENCES (1)
CONVERTERS (1)
DATA ENCAPSULATION (1)
DECAY RATE (1)
DECISION TREES (1)
DELAY TIME (1)
DISABLED SPEECH (1)
DISCRETE COSINE TRANSFORMS (1)
DISCRIMINANT FEATURES (1)
DISCRIMINATIVE DYSARTHRIC SPEECH CLASSIFICATION (1)
DISTANCE NORMALIZATION (1)
DTAK (1)
DYNAMIC TIME ALIGNMENT KERNEL (1)
DYSARTHRIA (1)
DYSFLUENCIES (1)
ECHO HIDING (1)
ECHO HIDING SYSTEM (1)
ECHO SUPPRESSION (1)
ELECTRONIC MAIL (1)
more

INFONA - science communication portal

Search results

Emotion Detection through Speech and Facial Expressions

Feature selection techniques for gender prediction from blogs

Automatic age recommendation system for children's video content

Integration of MKL-Based and I-Vector-Based Speaker Verification by Short Utterances

Discrete wavelet transforms with multiclass SVM for phoneme recognition

Application of fast learning neural-networks to identification of mixed anuran vocalizations

Comparison of vector normalization methods in multi-level speaker verification

Comparison of different multiclass SVM methods for speaker independent phoneme recognition

New techniques for improving the practicality of an SVM-based speech/music classifier

A new multiple-kernel-learning weighting method for localizing human brain magnetic activity

Cluster aware normalization for enhancing audio similarity

Adaptive Neuro Fuzzy Inference System, Neural Network and Support Vector Machine for Caller Behavior Classification

Short Text Categorization via Coherence Constraints

The Application of Improved Sparse Least-Squares Support Vector Machine in Speaker Identification

Recognition of repetitions using Support Vector Machines

A sub-Nyquist sampling method for computing the level-crossing-times of an analog signal: Theory and applications

Lipreading Recognition Based on SVM and DTAK

Experimental Research on Hiding Capacity of Echo Hiding in Voice

Combining regression and classification methods for improving automatic speaker age recognition

Automatic height estimation from speech in real-world setup

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options