Search results

Items from 1 to 20 out of 23 results

chapter

A convolutional neural network approach for acoustic scene classification

Michele Valenti, Stefano Squartini, Aleksandr Diment, Giambattista Parascandolo, more

2017 International Joint Conference on Neural Networks (IJCNN) > 1547 - 1554

2017 International Joint Conference on Neural Networks (IJCNN)

This paper presents a novel application of convolutional neural networks (CNNs) for the task of acoustic scene classification (ASC). We here propose the use of a CNN trained to classify short sequences of audio, represented by their log-mel spectrogram. We also introduce a training method that can be used under particular circumstances in order to make full use of small datasets. The proposed system...

chapter

Multimodal fusion of audio, scene, and face features for first impression estimation

Furkan Gurpinar, Heysem Kaya, Albert Ali Salah

2016 23rd International Conference on Pattern Recognition (ICPR) > 43 - 48

2016 23rd International Conference on Pattern Recognition (ICPR)

Affective computing, particularly emotion and personality trait recognition, is of increasing interest in many research disciplines. The interplay of emotion and personality shows itself in the first impression left on other people. Moreover, the ambient information, e.g. the environment and objects surrounding the subject, also affect these impressions. In this work, we employ pre-trained Deep Convolutional...

chapter

Feature extraction and target classification of side-scan sonar images

Jason Rhinelander

2016 IEEE Symposium Series on Computational Intelligence (SSCI) > 1 - 6

2016 IEEE Symposium Series on Computational Intelligence (SSCI)

Side-scan sonar technology has been used over the last three decades for underwater surveying and imaging. Application areas of side-scan sonar include archaeology, security and defence, seabed classification, and environmental surveying. In recent years the use of autonomous underwater systems has allowed for automatic collection of data. Along with automatic collection of data comes the need to...

chapter

Distance metric learning for kernel density-based acoustic model under limited training data conditions

Van Hai Do, Xiong Xiao, Eng Siong Chng, Haizhou Li

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 54 - 58

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Kernel density model works well for limited training data in acoustic modeling. In this paper, we improve the kernel density-based acoustic model for low resource language speech recognition. In our previous study, we demonstrated the effectiveness of the kernel density-based acoustic model on discriminative features such as cross-lingual bottleneck features. In this paper, we propose to learn a Mahalanobis-based...

chapter

Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation

Van Hai Do, Xiong Xiao, Haihua Xu, Eng Siong Chng, more

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 594 - 98

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, we investigate the use of the proposed non-parametric exemplar-based acoustic modeling for the NIST Open Keyword Search 2015 Evaluation. Specifically, kernel-density model is used to replace GMM in HMM/GMM (Hidden Markov Model / Gaussian Mixture Model) or DNN in HMM/DNN (Hidden Markov Model / Deep Neural Network) acoustic model to predict the emission probability of HMM states. To get...

chapter

GA based selection and parameter optimization for an SVM based underwater target classifier

B. M. Sherin, Supriya M. H.

2015 International Symposium on Ocean Electronics (SYMPOL) > 1 - 7

2015 International Symposium on Ocean Electronics (SYMPOL)

Underwater target classification is a very demanding task owing to ever changing complicated nature of the underwater communication channels. Underwater target classification system identifies targets from a mixture of underwater events by its characteristic signature. The characteristic signatures pertaining to each target are patterned by feature recognition algorithms operating on hydrophone captured...

chapter

Improved language identification using deep bottleneck network

Yan Song, Ruilian Cui, Xinhai Hong, Ian Mcloughlin, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4200 - 4204

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Effective representation plays an important role in automatic spoken language identification (LID). Recently, several representations that employ a pre-trained deep neural network (DNN) as the front-end feature extractor, have achieved state-of-the-art performance. However the performance is still far from satisfactory for dialect and short-duration utterance identification tasks, due to the deficiency...

chapter

Step-size control for acoustic echo cancellers applying an RBF network

Andreas Mader

2000 10th European Signal Processing Conference > 1 - 4

2000 10th European Signal Processing Conference

The control of an acoustic echo canceller (AEC) is an essential part of hands-free telephone sets. Due to the fact that no single estimator is yet known to reliably control the AEC, various estimators should be implemented. Nevertheless, the combination of several estimators is quite difficult and usually determined heuristically. In this paper, an approach for automatic combination of estimators,...

chapter

Selection and parameter optimization of SVM kernel function for underwater target classification

Sherin B. M., Supriya M. H.

2015 IEEE Underwater Technology (UT) > 1 - 5

2015 IEEE Underwater Technology (UT)

The identification and classification of noise sources in the ocean has become a key task of modern underwater acoustic signal processing and because of the ever changing and complicated oceanic environment, underwater target classification has become a demanding task. An underwater acoustic target classification system identifies the acoustic target from the characteristic acoustic signature. The...

chapter

Experimental and Computational Materials Defects Investigation

Michele Buonsanti, Matteo Cacciola, Francis Cirianni, Giovanni Leonardi, more

2013 8th EUROSIM Congress on Modelling and Simulation > 167 - 172

2013 8th EUROSIM Congress on Modelling and Simulation (EUROSIM)

Production of railway axles (i.e., one of the basic material of the modern train) is an elaborate process unfree from faults and problems. Errors during the manufacturing or the plies' overlapping, in fact, can cause particular flaws in the resulting material, so compromising its same integrity. Within this framework, ultrasonic tests could be useful to characterize the presence of defect, depending...

chapter

Discriminative training of weighted polynomial vector for acoustic language recognition

Ce Zhang, Rong Zheng, Bo Xu

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4849 - 4852

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we propose a discriminative method for the acoustic feature based language recognizer, which is a modification of the polynomial expansion in generalized linear discriminant sequence (GLDS) kernel. It is inspired by the Gaussian mixture model-support vector machine (GMM-SVM) system which has been successfully used in both speaker and language recognition. Because of the restriction...

chapter

Kernel multi-metric learning for multi-channel transient acoustic signal classification

Haichao Zhang, Yanning Zhang, Nasser M. Nasrabadi, Thomas S. Huang

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1989 - 1992

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we propose a kernel multi-metric learning algorithm for multi-channel transient acoustic signal classification. The proposed method learns a set of metrics jointly for multi-channel transient acoustic signals in a kernel-induced feature space to exploit the non-linearity of the data for improving the classification performance. An effective algorithm is developed for the task of learning...

chapter

Acoustic model topology optimization using evolutionary methods

Xirimo Bao, Guanglai Gao

The First Asian Conference on Pattern Recognition > 355 - 361

2011 First Asian Conference on Pattern Recognition (ACPR 2011)

Currently, most of the acoustic model selection work is done empirically or heuristically or even arbitrarily. In this paper, Genetic Algorithm (GA) based and Particle Swarm Optimization (PSO) based algorithms that consider the number of states and the kernel numbers for the states simultaneously and reject the uniform allocation of Gaussian kernels are proposed to automatically optimize acoustic...

chapter

Supervised source localization using diffusion kernels

Ronen Talmon, Israel Cohen, Sharon Gannot

2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 245 - 248

2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Recently, we introduced a method to recover the controlling parameters of linear systems using diffusion kernels. In this paper, we apply our approach to the problem of source localization in a reverberant room using measurements from a single microphone. Prior recordings of signals from various known locations in the room are required for training and calibration. The proposed algorithm relies on...

chapter

Transient acoustic signal classification using joint sparse representation

Haichao Zhang, Nasser M. Nasrabadi, Thomas S. Huang, Yanning Zhang

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2220 - 2223

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we present a novel joint sparse representation based method for acoustic signal classification with multiple measurements. The proposed method exploits the correlations among the multiple measurements with the notion of joint sparsity for improving the classification accuracy. Extensive experiments are carried out on real acoustic data sets and the results are compared with the conventional...

chapter

PAC-Bayesian approach for minimization of phoneme error rate

Joseph Keshet, David McAllester, Tamir Hazan

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2224 - 2227

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We describe a new approach for phoneme recognition which aims at minimizing the phoneme error rate. Building on structured prediction techniques, we formulate the phoneme recognizer as a linear combination of feature functions. We state a PAC-Bayesian generalization bound, which gives an upper-bound on the expected phoneme error rate in terms of the empirical phoneme error rate. Our algorithm is derived...

chapter

Whole word discriminative point process models

Aren Jansen

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5180 - 5183

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper introduces a discriminative extension to whole-word point process modeling techniques. Meant to circumvent the strong independence assumptions of their generative predecessors, discriminative point process models (DPPM) are trained to distinguish the composite temporal patterns of phonetic events produced for a given word from those of its impostors. Using correct and incorrect word hypotheses...

chapter

Arccosine kernels: Acoustic modeling with infinite neural networks

Chih-Chieh Cheng, Brian Kingsbury

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5200 - 5203

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Neural networks are a useful alternative to Gaussian mixture models for acoustic modeling; however, training multilayer networks involves a difficult, nonconvex optimization that requires some “art” to make work well in practice. In this paper we investigate the use of arccosine kernels for speech recognition, using these kernels in a hybrid support vector machine/hidden Markov model recognition system...

chapter

Hidden defects diagnosis using parameter optimization based support vector machine (SVM)

W.Y. Leong, Wen Xiang Chun

2010 IEEE 15th Conference on Emerging Technologies&Factory Automation (ETFA 2010) > 1 - 4

2010 IEEE 15th Conference on Emerging Technologies & Factory Automation (ETFA 2010)

In this paper, the issue of composite defects diagnosis by applying the support vector machine (SVM) was addressed. The component analysis was performed initially to extract the features and to reduce the dimensionality of original data features. Kernel parameters selection of support vector machine which has great influence on the performance of defects classification has been discussed in this work...

chapter

Support vector machines for noise robust ASR

M.J.F. Gales, A. Ragni, H. AlDamarki, C. Gautier

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 205 - 210

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

Using discriminative classifiers, such as Support Vector Machines (SVMs) in combination with, or as an alternative to, Hidden Markov Models (HMMs) has a number of advantages for difficult speech recognition tasks. For example, the models can make use of additional dependencies in the observation sequences than HMMs provided the appropriate form of kernel is used. However standard SVMs are binary classifiers,...

Keywords:
TRAINING
ACOUSTICS
KERNEL

Publication date

Set your own date range

Keywords

SUPPORT VECTOR MACHINES (11)
HIDDEN MARKOV MODELS (7)
FEATURE EXTRACTION (6)
SPEECH (6)
SPEECH RECOGNITION (6)
DATA MODELS (3)
NEURAL NETWORKS (3)
SPEECH PROCESSING (3)
SUPPORT VECTOR MACHINE (3)
VECTORS (3)
CLASSIFICATION ALGORITHMS (2)
DISCRIMINATIVE TRAINING (2)
ERROR ANALYSIS (2)
GENETIC ALGORITHM (2)
GENETIC ALGORITHMS (2)
KERNEL FUNCTION (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
MEASUREMENT (2)
OPTIMIZATION (2)
PARTICLE SWARM OPTIMIZATION (2)
RADIAL BASIS FUNCTION NETWORKS (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
SVM (2)
UNDERWATER TARGET CLASSIFIER (2)
ACCURACY (1)
ACOUSTIC EMISSION (1)
ACOUSTIC LOCALIZATION (1)
ACOUSTIC MODEL (1)
AQUACULTURE (1)
ARTIFICIAL NEURAL NETWORKS (1)
AURORA 2.0 (1)
AUTOMATIC LANGUAGE IDENTIFICATION (1)
AUTONOMOUS SYSTEMS (1)
AXLES (1)
AZIMUTH (1)
BAT ALGORITHM (1)
BINARY CLASSIFIERS (1)
BIOLOGICAL NEURAL NETWORKS (1)
BOOKS (1)
BOTTLENECK FEATURE (1)
CODEBOOK (1)
CODEBOOK BASED MODEL (1)
COMPUTATIONAL MODELING (1)
DATA MINING (1)
DEEP NEURAL NETWORK (1)
DIFFUSION GEOMETRY (1)
DIFFUSION KERNEL (1)
EDUCATIONAL INSTITUTIONS (1)
EMOTION RECOGNITION (1)
EQUAL ERROR RATE (1)
EVOLUTIONARY METHODS (1)
FACE (1)
FISH SCHOOL RECOGNITION (1)
FISHERIES ACOUSTICS (1)
GAUSSIAN BASIS FUNCTION (1)
GMM (1)
HISTOGRAMS (1)
HMM (1)
HYBRID SYSTEMS (1)
IMAGE LEVEL INFERENCE (1)
INFERENCE MECHANISMS (1)
INTONATION PATTERN (1)
JOINT SPARSE RECOVERY (1)
JOINT SPARSITY CLASSIFICATION (1)
JOINTS (1)
KERNEL LEARNING (1)
KERNEL METHODS (1)
KERNEL PARAMETER (1)
KERNELS (1)
LANGUAGE IDENTIFICATION (1)
LANGUAGE RECOGNITION (1)
LATTICES (1)
LINE SPECTRAL FREQUENCIES (1)
LOGISTICS (1)
MANIFOLD LEARNING (1)
MATERIALS (1)
MATHEMATICAL MODEL (1)
MAXIMUM MUTUAL INFORMATION (1)
METRIC LEARNING (1)
MICROPHONES (1)
MODEL SELECTION (1)
MULTI-CLASS LOGISTIC REGRESSION (1)
MULTICHANNEL ACOUSTIC SIGNAL CLASSIFICATION (1)
N-GRAM PROBABILITY (1)
NDT/E (1)
NEURONS (1)
NIST (1)
NOISE (1)
NOISE CORRUPTED DIGIT SEQUENCE TASKS (1)
NOISE MEASUREMENT (1)
NOISE ROBUST ASR (1)
NONLINEAR FUNCTIONS (1)
NONLINEAR MAPPING FUNCTION (1)
OBJECT LEVEL INFERENCE (1)
OBJECT RECOGNITION (1)
OBJECT RECOGNITION MODEL TRAINING (1)
PAC-BAYESIAN THEOREM (1)
more

INFONA - science communication portal

Search results

A convolutional neural network approach for acoustic scene classification

Multimodal fusion of audio, scene, and face features for first impression estimation

Feature extraction and target classification of side-scan sonar images

Distance metric learning for kernel density-based acoustic model under limited training data conditions

Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation

GA based selection and parameter optimization for an SVM based underwater target classifier

Improved language identification using deep bottleneck network

Step-size control for acoustic echo cancellers applying an RBF network

Selection and parameter optimization of SVM kernel function for underwater target classification

Experimental and Computational Materials Defects Investigation

Discriminative training of weighted polynomial vector for acoustic language recognition

Kernel multi-metric learning for multi-channel transient acoustic signal classification

Acoustic model topology optimization using evolutionary methods

Supervised source localization using diffusion kernels

Transient acoustic signal classification using joint sparse representation

PAC-Bayesian approach for minimization of phoneme error rate

Whole word discriminative point process models

Arccosine kernels: Acoustic modeling with infinite neural networks

Hidden defects diagnosis using parameter optimization based support vector machine (SVM)

Support vector machines for noise robust ASR

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options