Search results

Items from 61 to 80 out of 996 results

chapter

Bird-phrase segmentation and verification: A noise-robust template-based approach

Kantapon Kaewtip, Lee Ngee Tan, Charles E. Taylor, Abeer Alwan

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 758 - 762

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we present a birdsong-phrase segmentation and verification algorithm that is robust to limited training data, class variability, and noise. The algorithm comprises a noise-robust, Dynamic-Time-Warping (DTW)-based segmentation and a discriminative classifier for outlier rejection. The algorithm utilizes DTW and prominent (high energy) time-frequency regions of training spectrograms to...

chapter

Weighted training for speech under Lombard Effect for speaker recognition

Muhammad Muneeb Saleem, Gang Liu, John H.L. Hansen

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4350 - 4354

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The presence of Lombard Effect in speech is proven to have severe effects on the performance of speech systems, especially speaker recognition. Varying kinds of Lombard speech are produced by speakers under influence of varying noise types [1]. This study proposes a high-accuracy classifier using deep neural networks for detecting various kinds of Lombard speech against neutral speech, independent...

chapter

Regularizing DNN acoustic models with Gaussian stochastic neurons

Hao Zhang, Yajie Miao, Florian Metze

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4964 - 4968

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Dropout and DropConnect can be viewed as regularization methods for deep neural network (DNN) training. In DNN acoustic modeling, the huge number of speech samples makes it expensive to sample the neuron mask (Dropout) or the weight mask (DropConnect) repetitively from a high dimensional distribution. In this paper we investigate the effect of Gaussian stochastic neurons on DNN acoustic modeling....

chapter

Deep NMF for speech separation

Jonathan Le Roux, John R. Hershey, Felix Weninger

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 66 - 70

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Non-negative matrix factorization (NMF) has been widely used for challenging single-channel audio source separation tasks. However, inference in NMF-based models relies on iterative inference methods, typically formulated as multiplicative updates. We propose “deep NMF”, a novel non-negative deep network architecture which results from unfolding the NMF iterations and untying its parameters. This...

chapter

Averaging random projection: A fast online solution for large-scale constrained stochastic optimization

Jialin Liu, Yuantao Gu, Mengdi Wang

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3586 - 3590

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Stochastic optimization finds wide application in signal processing, online learning, and network problems, especially problems processing large-scale data. We propose an Incremental Constraint Averaging Projection Method (ICAPM) that is tailored to optimization problems involving a large number of constraints. The ICAPM makes fast updates by taking sample gradients and averaging over random constraint...

chapter

Approximate best linear unbiased channel estimation with CFAR detection for frequency selective sparse multipath channels with long delay spreads

Serdar Ozen

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

We provide a non-iterative channel impulse response (CIR) estimation algorithm for communication systems which utilize a periodically transmitted training sequence within a continuous stream of information symbols. The non-iterative channel estimate is an approximation to the Best Linear Unbiased Estimate (BLUE) of the CIR, achieving almost similar performance, with much lower complexity. We first...

chapter

Multimodal biometric score fusion: The Mean Rule vs. support vector classifiers

Sonia Garcia-Salicetti, Mohamed Anouar Mellakh, Lorene Allano, Bernadette Dorizzi

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

Recently, a discrepancy in results has appeared in the literature concerning score fusion methods, classified in “combination methods” and “classification methods” [1]. Some works suggest that a simple Arithmetic Mean Rule (AMR) can outperform some training-based methods on multimodal data [2], while others favour, among other trained classifiers, a Support Vector Machine [3]. This paper makes a comparative...

chapter

Noise power spectral density estimation from noisy speech using on-line trained hidden Markov models

Karsten Vandborg Sorensen, Soren Vang Andersen

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

In this paper we describe a method for estimation of noise power spectral densities from a noisy speech signal. The method is used in conjunction with a time-frequency domain speech presence detection method that provides connected time-frequency regions of each decision type. In speech absence regions hidden Markov models are trained on-line and in speech presence regions the trained models are used...

chapter

Sparse Representation Based Distributed Multisensor Track Fusion

Wang Huan, Sun Jinping

2015 2nd International Conference on Information Science and Control Engineering > 476 - 479

2015 2nd International Conference on Information Science and Control Engineering (ICISCE)

In the distributed multisensory information fusion system, each local sensor independently forms local tracks, and multisensory track fusion refers to fusing multiple local tracks that represent the same target into one global track. By studying the theory of multisensory track fusion and signal sparse representation, a sparse representation based multisensory track fusion algorithm is proposed. This...

chapter

Solving optimization tasks in construction of FDI systems: An evolutionary approach

A. Obuchowicz, J. Korbicz

2001 European Control Conference (ECC) > 1647 - 1652

2001 European Control Conference (ECC)

Model-based FDI systems are considered here. The problem of constructing the diagnosed system model as well as the automatic search for the best rule base of the residual analyzer is reduced to a set of global optimization tasks. Various optimization problems are considered depending on the chosen technology of the non-analytical model construction as well as that of the residual evaluation. Most...

chapter

Robust decision feedback equalizer design via the solution of a regularized least squares problem

P.R. Fraanje, M. Verhaegen, N.J. Doelman

2001 European Control Conference (ECC) > 906 - 911

2001 European Control Conference (ECC)

This paper¹ presents a method to estimate a Decision Feedback Equalizer (DFE) directly from training data, which is robust w.r.t. time-variations in the communication channel. It is based on the indirect method proposed in [15], where the time variations in the channel are modeled as a probabilistic uncertainty. The robust DFE optimizes the performance by minimizing the mean squared error averaged...

chapter

Using models of the Human Visual System in the design of stack filters for the enhancement of color images

Jen Huang, Edward J. Coyle

2000 10th European Signal Processing Conference > 1 - 4

2000 10th European Signal Processing Conference

A technique is developed for utilizing models of the Human Visual System to improve the design of filters for the enhancement of color images. The technique uses an image fidelity measure based on models of the human visual system — such as the Visible Differences Predictor (VDP) — in a nested loop training algorithm. In the inner loop of the algorithm, a stack filter is trained under a Weighted Mean...

chapter

Anatomical structure labeling in apical four-chamber view echocardiogram images

Yu Cao, Colin B Compas, Hongzhi Wang, Tanveer F Syeda-Mahmood

Computing in Cardiology 2014 > 317 - 320

2014 Computing in Cardiology Conference (CinC)

Anatomical structure labeling in echocardiogram images will assist cardiac disease diagnosis by providing a framework for doing geometrical statistics. General labeling algorithms often focus on stationary body structures and do not perform well in echocardiography due to cardiac motion, low signal to noise ratio, and structural deformation caused by diseases. In this paper, we propose a new method...

chapter

Bispectral Gammatone Cepstral Coefficient based Neural Network Classifier

Mohankumar K., Supriya M.H, P.R. Saseendran Pillai

2015 IEEE Underwater Technology (UT) > 1 - 5

2015 IEEE Underwater Technology (UT)

The estimation of the power spectrum of discrete-time signals is one of the most fundamental and useful tools in signal processing. However, there are practical situations where one needs to look beyond the power spectrum, especially to extract information regarding the phase relations and deviations from Gaussianity. This has created considerable interest in the use of higher order spectra such as...

chapter

A Robust Adaptive Classifier for Detector Adaptation in a Video

Pramod Sharma, Ram Nevatia

2015 IEEE Winter Conference on Applications of Computer Vision > 921 - 928

2015 IEEE Winter Conference on Applications of Computer Vision (WACV)

We propose a novel method for improved object detection in a video. Our approach adapts a generic offline trained detector (OTD) to a specific test video by collecting online samples in an unsupervised manner. Most of the existing adaptation methods focus on collecting confident online samples and do not address how to deal with ambiguous and noisy online samples. We address the importance of collecting...

chapter

Enhanced local feature approach for overlapping sound event recognition

Jonathan Dennis, Huy Dat Tran

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific > 1 - 4

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, we propose a feature-based approach to address the challenging task of recognising overlapping sound events from single channel audio. Our approach is based on our previous work on Local Spectrogram Features (LSFs), where we combined a local spectral representation of the spectrogram with the Generalised Hough Transform (GHT) voting system for recognition. Here we propose to take the...

chapter

Joint time synchronization and channel estimation for two-way amplify-and-forward relay systems

Chin-Liang Wang, Po-Chun Chiu, Hung-Chin Wang

2014 IEEE Global Communications Conference > 3543 - 3548

GLOBECOM 2014 - 2014 IEEE Global Communications Conference

In this paper, we consider a two-way relay system where two terminals exchange their information via an amplify-and-forward relay in a bi-directional manner. Due to the two-way relay protocol, signals from both terminals travel through different cascaded channels, and this makes synchronization and channel estimation much more complicated than those in conventional one-way relay systems. To cope with...

chapter

Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features

Norihide Kitaoka, Tomoki Hayashi, Kazuya Takeda

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific > 1 - 5

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this study, we investigate the effect of blind spatial subtraction arrays (BSSA) on speech recognition systems by comparing the performance of a method using Mel-Frequency Cepstral Coefficients (MFCCs) with a method using Deep Bottleneck Features (DBNF) based on Deep Neural Networks (DNN). Performance is evaluated under various conditions, including noisy, in-vehicle conditions. Although performance...

chapter

Extreme learning machine with dead zone and its application to WiFi based indoor positioning

Xiaoxuan Lu, Chengpu Yu, Han Zou, Hao Jiang, more

2014 13th International Conference on Control Automation Robotics & Vision (ICARCV) > 625 - 630

2014 13th International Conference on Control Automation Robotics & Vision (ICARCV)

Extreme learning machine (ELM) as an emergent technology has shown its good performance in regression applications as well as in large dataset classification applications. It has been broadly embedded in many applications due to its fast speed of computation and accuracy. How to make good use of machine learning techniques in Indoor Positioning System (IPS) is a hot research topic in recent years...

chapter

Robust impaired speech segmentation using neural network mixture model

Sunday Iliya, Dylan Menzies, Ferrante Neri, Pip Cornelius, more

2014 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 444 - 449

2014 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

This paper presents a signal processing technique for segmenting short speech utterances into unvoiced and voiced sections and identifying points where the spectrum becomes steady. The segmentation process is part of a system for deriving musculoskeletal articulation data from disordered utterances, in order to provide training feedback for people with speech articulation problem. The approach implement...

Keywords:
TRAINING
NOISE

Publication date

Set your own date range

Content availability

Available (988)
None (8)

Keywords

ARTIFICIAL NEURAL NETWORKS (201)
FEATURE EXTRACTION (195)
SUPPORT VECTOR MACHINES (180)
ACCURACY (168)
NOISE MEASUREMENT (137)
DATA MINING (122)
SPEECH (121)
CLASSIFICATION ALGORITHMS (109)
ROBUSTNESS (103)
LEARNING (ARTIFICIAL INTELLIGENCE) (102)
TESTING (96)
MATHEMATICAL MODEL (91)
SIGNAL PROCESSING (89)
DATA MODELS (83)
ESTIMATION (83)
NEURAL NETS (80)
SPEECH RECOGNITION (78)
ALGORITHM DESIGN AND ANALYSIS (77)
PATTERN CLASSIFICATION (77)
HIDDEN MARKOV MODELS (76)
TRAINING DATA (70)
DATABASES (69)
VECTORS (69)
EQUATIONS (67)
KERNEL (66)
OPTIMIZATION (64)
PATTERN RECOGNITION (63)
COMPUTATIONAL MODELING (61)
CORRELATION (61)
MACHINE LEARNING (57)
NEURONS (56)
EDUCATIONAL INSTITUTIONS (54)
IMAGE EDGE DETECTION (54)
SIGNAL PROCESSING ALGORITHMS (54)
SUPPORT VECTOR MACHINE CLASSIFICATION (54)
IMAGE PROCESSING (53)
IMAGE SEGMENTATION (53)
PIXEL (51)
PRINCIPAL COMPONENT ANALYSIS (51)
IMAGE RECOGNITION (50)
NEURAL NETWORKS (50)
TRANSFORMS (50)
COMPLEXITY THEORY (48)
COMPUTERS (45)
WAVELET TRANSFORMS (41)
NOISE REDUCTION (40)
SIGNAL TO NOISE RATIO (40)
CONFERENCES (38)
FACE RECOGNITION (38)
IMAGE COLOR ANALYSIS (38)
IMAGE CLASSIFICATION (37)
NEURAL NETWORK (37)
ACOUSTICS (36)
CHANNEL ESTIMATION (35)
DETECTORS (35)
SHAPE (35)
ELECTRONIC MAIL (34)
CHARACTER RECOGNITION (33)
REGRESSION ANALYSIS (33)
BACKPROPAGATION (32)
CONVERGENCE (32)
INDEXES (32)
PREDICTIVE MODELS (32)
SUPPORT VECTOR MACHINE (32)
ANALYTICAL MODELS (31)
CAMERAS (31)
FILTERING (31)
FILTERING THEORY (31)
IMAGE DENOISING (30)
MONITORING (30)
PREDICTION ALGORITHMS (30)
CLUSTERING ALGORITHMS (29)
FACE (29)
CLASSIFICATION (28)
SVM (28)
ADAPTATION MODEL (26)
APPROXIMATION METHODS (26)
DICTIONARIES (26)
GENETIC ALGORITHMS (26)
MEL FREQUENCY CEPSTRAL COEFFICIENT (26)
PRESSES (26)
RADIAL BASIS FUNCTION NETWORKS (26)
ARTIFICIAL INTELLIGENCE (25)
COMPUTER VISION (25)
REAL TIME SYSTEMS (25)
STATISTICAL ANALYSIS (25)
EIGENVALUES AND EIGENFUNCTIONS (24)
FUZZY SET THEORY (24)
IMAGE RECONSTRUCTION (24)
IMAGE RESOLUTION (24)
INTERFERENCE (24)
SPEECH PROCESSING (24)
ARTIFICIAL NEURAL NETWORK (23)
COVARIANCE MATRIX (23)
SENSORS (23)
SPEAKER RECOGNITION (23)
BIOLOGICAL SYSTEM MODELING (22)
GAUSSIAN PROCESSES (22)
more

INFONA - science communication portal

Search results

Bird-phrase segmentation and verification: A noise-robust template-based approach

Weighted training for speech under Lombard Effect for speaker recognition

Regularizing DNN acoustic models with Gaussian stochastic neurons

Deep NMF for speech separation

Averaging random projection: A fast online solution for large-scale constrained stochastic optimization

Approximate best linear unbiased channel estimation with CFAR detection for frequency selective sparse multipath channels with long delay spreads

Multimodal biometric score fusion: The Mean Rule vs. support vector classifiers

Noise power spectral density estimation from noisy speech using on-line trained hidden Markov models

Sparse Representation Based Distributed Multisensor Track Fusion

Solving optimization tasks in construction of FDI systems: An evolutionary approach

Robust decision feedback equalizer design via the solution of a regularized least squares problem

Using models of the Human Visual System in the design of stack filters for the enhancement of color images

Anatomical structure labeling in apical four-chamber view echocardiogram images

Bispectral Gammatone Cepstral Coefficient based Neural Network Classifier

A Robust Adaptive Classifier for Detector Adaptation in a Video

Enhanced local feature approach for overlapping sound event recognition

Joint time synchronization and channel estimation for two-way amplify-and-forward relay systems

Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features

Extreme learning machine with dead zone and its application to WiFi based indoor positioning

Robust impaired speech segmentation using neural network mixture model

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options