Search results

Items from 101 to 120 out of 1,200 results

1 ...
3
4
5
6
7
8
9

chapter

A pairwise algorithm for pitch estimation and speech separation using deep stacking network

Hui Zhang, Xueliang Zhang, Shuai Nie, Guanglai Gao, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 246 - 250

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pitch information is an important cue for speech separation. However, pitch estimation in noisy condition is also a task as challenging as speech separation. In this paper, we propose a supervised learning architecture which combines these two problems concisely. The proposed algorithm is based on deep stacking network (DSN) which provides a method of stacking simple processing modules in building...

chapter

Representation models in single channel source separation

Matthias Zohrer, Franz Pernkopf

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 713 - 717

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Model-based single-channel source separation (SCSS) is an ill-posed problem requiring source-specific prior knowledge. In this paper, we use representation learning and compare general stochastic networks (GSNs), Gauss Bernoulli restricted Boltzmann machines (GBRBMs), conditional Gauss Bernoulli restricted Boltzmann machines (CGBRBMs), and higher order contractive autoencoders (HCAEs) for modeling...

chapter

Leveraging automatic speech recognition in cochlear implants for improved speech intelligibility under reverberation

Oldooz Hazrati, Shabnam Ghaffarzadegan, John H.L. Hansen

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5093 - 5097

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Despite recent advancements in digital signal processing technology for cochlear implant (CI) devices, there still remains a significant gap between speech identification performance of CI users in reverberation compared to that in anechoic quiet conditions. Alternatively, automatic speech recognition (ASR) systems have seen significant improvements in recent years resulting in robust speech recognition...

chapter

Combining sparse NMF with deep neural network: A new classification-based approach for speech enhancement

Hung-Wei Tseng, Mingyi Hong, Zhi-Quan Luo

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2145 - 2149

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this work, we consider enhancing a target speech from a singlechannel noisy observation corrupted by non-stationary noises at low signal-to-noise ratios (SNRs). We take a classification-based approach, where the objective is to estimate an Ideal Binary Mask (IBM) that classifies each time-frequency (T-F) unit of the noisy observation into one of the two categories: speech-dominant unit or noise-dominant...

chapter

On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence

Axel Roebel, Jordi Pons, Marco Liuni, Mathieu Lagrangey

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 414 - 418

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents an investigation into the detection and classification of drum sounds in polyphonic music and drum loops using non-negative matrix deconvolution (NMD) and the Itakura Saito divergence. The Itakura Saito divergence has recently been proposed as especially appropriate for decomposing audio spectra due to the fact that it is scale invariant, but it has not yet been widely adopted...

chapter

A novel ranking method for multiple classifier systems

Anurag Kumar, Bhiksha Raj

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1931 - 1935

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We introduce an unsupervised optimization method for optimal fusion of multiple classifiers in retrieval problems. The method is based on a ranking loss called the “clarity” index, which does not depend on the label of the test instances. The technique optimizes the weights with which individual classifier scores must be combined to maximize this clarity. Our method is instance-specific; the weights...

chapter

Bird-phrase segmentation and verification: A noise-robust template-based approach

Kantapon Kaewtip, Lee Ngee Tan, Charles E. Taylor, Abeer Alwan

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 758 - 762

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we present a birdsong-phrase segmentation and verification algorithm that is robust to limited training data, class variability, and noise. The algorithm comprises a noise-robust, Dynamic-Time-Warping (DTW)-based segmentation and a discriminative classifier for outlier rejection. The algorithm utilizes DTW and prominent (high energy) time-frequency regions of training spectrograms to...

chapter

Weighted training for speech under Lombard Effect for speaker recognition

Muhammad Muneeb Saleem, Gang Liu, John H.L. Hansen

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4350 - 4354

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The presence of Lombard Effect in speech is proven to have severe effects on the performance of speech systems, especially speaker recognition. Varying kinds of Lombard speech are produced by speakers under influence of varying noise types [1]. This study proposes a high-accuracy classifier using deep neural networks for detecting various kinds of Lombard speech against neutral speech, independent...

chapter

Regularizing DNN acoustic models with Gaussian stochastic neurons

Hao Zhang, Yajie Miao, Florian Metze

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4964 - 4968

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Dropout and DropConnect can be viewed as regularization methods for deep neural network (DNN) training. In DNN acoustic modeling, the huge number of speech samples makes it expensive to sample the neuron mask (Dropout) or the weight mask (DropConnect) repetitively from a high dimensional distribution. In this paper we investigate the effect of Gaussian stochastic neurons on DNN acoustic modeling....

chapter

Deep NMF for speech separation

Jonathan Le Roux, John R. Hershey, Felix Weninger

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 66 - 70

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Non-negative matrix factorization (NMF) has been widely used for challenging single-channel audio source separation tasks. However, inference in NMF-based models relies on iterative inference methods, typically formulated as multiplicative updates. We propose “deep NMF”, a novel non-negative deep network architecture which results from unfolding the NMF iterations and untying its parameters. This...

chapter

Averaging random projection: A fast online solution for large-scale constrained stochastic optimization

Jialin Liu, Yuantao Gu, Mengdi Wang

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3586 - 3590

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Stochastic optimization finds wide application in signal processing, online learning, and network problems, especially problems processing large-scale data. We propose an Incremental Constraint Averaging Projection Method (ICAPM) that is tailored to optimization problems involving a large number of constraints. The ICAPM makes fast updates by taking sample gradients and averaging over random constraint...

chapter

Approximate best linear unbiased channel estimation with CFAR detection for frequency selective sparse multipath channels with long delay spreads

Serdar Ozen

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

We provide a non-iterative channel impulse response (CIR) estimation algorithm for communication systems which utilize a periodically transmitted training sequence within a continuous stream of information symbols. The non-iterative channel estimate is an approximation to the Best Linear Unbiased Estimate (BLUE) of the CIR, achieving almost similar performance, with much lower complexity. We first...

chapter

Multimodal biometric score fusion: The Mean Rule vs. support vector classifiers

Sonia Garcia-Salicetti, Mohamed Anouar Mellakh, Lorene Allano, Bernadette Dorizzi

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

Recently, a discrepancy in results has appeared in the literature concerning score fusion methods, classified in “combination methods” and “classification methods” [1]. Some works suggest that a simple Arithmetic Mean Rule (AMR) can outperform some training-based methods on multimodal data [2], while others favour, among other trained classifiers, a Support Vector Machine [3]. This paper makes a comparative...

chapter

Noise power spectral density estimation from noisy speech using on-line trained hidden Markov models

Karsten Vandborg Sorensen, Soren Vang Andersen

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

In this paper we describe a method for estimation of noise power spectral densities from a noisy speech signal. The method is used in conjunction with a time-frequency domain speech presence detection method that provides connected time-frequency regions of each decision type. In speech absence regions hidden Markov models are trained on-line and in speech presence regions the trained models are used...

chapter

Sparse Representation Based Distributed Multisensor Track Fusion

Wang Huan, Sun Jinping

2015 2nd International Conference on Information Science and Control Engineering > 476 - 479

2015 2nd International Conference on Information Science and Control Engineering (ICISCE)

In the distributed multisensory information fusion system, each local sensor independently forms local tracks, and multisensory track fusion refers to fusing multiple local tracks that represent the same target into one global track. By studying the theory of multisensory track fusion and signal sparse representation, a sparse representation based multisensory track fusion algorithm is proposed. This...

chapter

Solving optimization tasks in construction of FDI systems: An evolutionary approach

A. Obuchowicz, J. Korbicz

2001 European Control Conference (ECC) > 1647 - 1652

2001 European Control Conference (ECC)

Model-based FDI systems are considered here. The problem of constructing the diagnosed system model as well as the automatic search for the best rule base of the residual analyzer is reduced to a set of global optimization tasks. Various optimization problems are considered depending on the chosen technology of the non-analytical model construction as well as that of the residual evaluation. Most...

chapter

Robust decision feedback equalizer design via the solution of a regularized least squares problem

P.R. Fraanje, M. Verhaegen, N.J. Doelman

2001 European Control Conference (ECC) > 906 - 911

2001 European Control Conference (ECC)

This paper¹ presents a method to estimate a Decision Feedback Equalizer (DFE) directly from training data, which is robust w.r.t. time-variations in the communication channel. It is based on the indirect method proposed in [15], where the time variations in the channel are modeled as a probabilistic uncertainty. The robust DFE optimizes the performance by minimizing the mean squared error averaged...

chapter

Using models of the Human Visual System in the design of stack filters for the enhancement of color images

Jen Huang, Edward J. Coyle

2000 10th European Signal Processing Conference > 1 - 4

2000 10th European Signal Processing Conference

A technique is developed for utilizing models of the Human Visual System to improve the design of filters for the enhancement of color images. The technique uses an image fidelity measure based on models of the human visual system — such as the Visible Differences Predictor (VDP) — in a nested loop training algorithm. In the inner loop of the algorithm, a stack filter is trained under a Weighted Mean...

chapter

Anatomical structure labeling in apical four-chamber view echocardiogram images

Yu Cao, Colin B Compas, Hongzhi Wang, Tanveer F Syeda-Mahmood

Computing in Cardiology 2014 > 317 - 320

2014 Computing in Cardiology Conference (CinC)

Anatomical structure labeling in echocardiogram images will assist cardiac disease diagnosis by providing a framework for doing geometrical statistics. General labeling algorithms often focus on stationary body structures and do not perform well in echocardiography due to cardiac motion, low signal to noise ratio, and structural deformation caused by diseases. In this paper, we propose a new method...

article

Highly Efficient Known-Plaintext Attacks Against Orthogonal Blinding Based Physical Layer Security

Yao Zheng, Matthias Schulz, Wenjing Lou, Y. Thomas Hou, more

IEEE Wireless Communications Letters > 2015 > 4 > 1 > 34 - 37

In this letter, we describe highly effective known-plaintext attacks against physical layer security schemes. We substantially reduce the amount of required known-plaintext symbols and lower the symbol error rate (SER) for the attacker. In particular, we analyze the security of orthogonal blinding schemes that disturb an eavesdropper's signal reception using artificial noise transmission. We improve...

1 ...
3
4
5
6
7
8
9

Data set:
ieee
Keywords:
NOISE
TRAINING

Publication date

Set your own date range

Content availability

Available (1,192)
None (8)

Publication type

book (996)
article (204)

Keywords

FEATURE EXTRACTION (218)
ARTIFICIAL NEURAL NETWORKS (215)
SUPPORT VECTOR MACHINES (198)
ACCURACY (187)
NOISE MEASUREMENT (165)
SPEECH (156)
ROBUSTNESS (130)
DATA MINING (128)
CLASSIFICATION ALGORITHMS (116)
LEARNING (ARTIFICIAL INTELLIGENCE) (105)
ESTIMATION (104)
TESTING (101)
DATA MODELS (97)
MATHEMATICAL MODEL (97)
SIGNAL PROCESSING (92)
VECTORS (92)
SPEECH RECOGNITION (91)
ALGORITHM DESIGN AND ANALYSIS (90)
HIDDEN MARKOV MODELS (89)
KERNEL (85)
PATTERN CLASSIFICATION (85)
TRAINING DATA (85)
NEURAL NETS (81)
OPTIMIZATION (80)
DATABASES (76)
EQUATIONS (75)
COMPUTATIONAL MODELING (72)
CORRELATION (72)
PATTERN RECOGNITION (71)
CHANNEL ESTIMATION (61)
MACHINE LEARNING (61)
NEURONS (61)
SIGNAL PROCESSING ALGORITHMS (59)
NEURAL NETWORKS (58)
IMAGE SEGMENTATION (57)
SUPPORT VECTOR MACHINE CLASSIFICATION (57)
COMPLEXITY THEORY (56)
EDUCATIONAL INSTITUTIONS (56)
IMAGE EDGE DETECTION (56)
IMAGE PROCESSING (55)
PIXEL (55)
PRINCIPAL COMPONENT ANALYSIS (55)
TRANSFORMS (55)
IMAGE RECOGNITION (50)
SIGNAL TO NOISE RATIO (50)
NOISE REDUCTION (48)
COMPUTERS (45)
ACOUSTICS (44)
IMAGE COLOR ANALYSIS (43)
WAVELET TRANSFORMS (43)
CLASSIFICATION (42)
CONVERGENCE (41)
DETECTORS (41)
FACE RECOGNITION (41)
NEURAL NETWORK (40)
CONFERENCES (39)
SHAPE (39)
ANALYTICAL MODELS (38)
INDEXES (38)
IMAGE CLASSIFICATION (37)
APPROXIMATION METHODS (36)
COVARIANCE MATRIX (36)
DICTIONARIES (36)
IMAGE DENOISING (36)
FACE (35)
PREDICTION ALGORITHMS (35)
REGRESSION ANALYSIS (35)
SUPPORT VECTOR MACHINE (35)
CAMERAS (34)
CHARACTER RECOGNITION (34)
CLUSTERING ALGORITHMS (34)
ELECTRONIC MAIL (34)
PREDICTIVE MODELS (34)
RECEIVERS (34)
BACKPROPAGATION (33)
FILTERING (33)
ADAPTATION MODEL (32)
INTERFERENCE (32)
FILTERING THEORY (31)
IMAGE RECONSTRUCTION (31)
MONITORING (31)
MEL FREQUENCY CEPSTRAL COEFFICIENT (30)
STATISTICAL ANALYSIS (30)
SPEECH PROCESSING (29)
SVM (29)
EIGENVALUES AND EIGENFUNCTIONS (28)
IMAGE RESOLUTION (28)
OBJECT RECOGNITION (28)
RADIAL BASIS FUNCTION NETWORKS (28)
SPEAKER RECOGNITION (27)
WIRELESS COMMUNICATION (27)
COMPUTER VISION (26)
GAUSSIAN PROCESSES (26)
GENETIC ALGORITHMS (26)
MAXIMUM LIKELIHOOD ESTIMATION (26)
PRESSES (26)
REAL TIME SYSTEMS (26)
SPEECH ENHANCEMENT (26)
more

INFONA - science communication portal

Search results

A pairwise algorithm for pitch estimation and speech separation using deep stacking network

Representation models in single channel source separation

Leveraging automatic speech recognition in cochlear implants for improved speech intelligibility under reverberation

Combining sparse NMF with deep neural network: A new classification-based approach for speech enhancement

On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence

A novel ranking method for multiple classifier systems

Bird-phrase segmentation and verification: A noise-robust template-based approach

Weighted training for speech under Lombard Effect for speaker recognition

Regularizing DNN acoustic models with Gaussian stochastic neurons

Deep NMF for speech separation

Averaging random projection: A fast online solution for large-scale constrained stochastic optimization

Approximate best linear unbiased channel estimation with CFAR detection for frequency selective sparse multipath channels with long delay spreads

Multimodal biometric score fusion: The Mean Rule vs. support vector classifiers

Noise power spectral density estimation from noisy speech using on-line trained hidden Markov models

Sparse Representation Based Distributed Multisensor Track Fusion

Solving optimization tasks in construction of FDI systems: An evolutionary approach

Robust decision feedback equalizer design via the solution of a regularized least squares problem

Using models of the Human Visual System in the design of stack filters for the enhancement of color images

Anatomical structure labeling in apical four-chamber view echocardiogram images

Highly Efficient Known-Plaintext Attacks Against Orthogonal Blinding Based Physical Layer Security

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options