ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Items from 81 to 100 out of 1,693 results

chapter

Reduction of acoustic model training time and required data passes via stochastic approaches to maximum likelihood and discriminative training

Petr Novak, Roman Otec, Antonio Lee, Vaibhava Goel

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5577 - 5581

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The recent boom in use of speech recognition technology has made the access to potentially large amounts of training data easier. This, however, also constitutes a challenge in processing such large, continuously growing amount of information. Here we present a stochastic modification of traditional iterative training approach which leads to the same or even better accuracy of acoustic models and...

chapter

Projection onto the cosparse set is NP-hard

Andreas M. Tillmann, Remi Gribonval, Marc E. Pfetsch

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7148 - 7152

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The computational complexity of a problem arising in the context of sparse optimization is considered, namely, the projection onto the set of k-cosparse vectors w.r.t. some given matrix Ω. It is shown that this projection problem is (strongly) NP-hard, even in the special cases in which the matrix Ω contains only ternary or bipolar coefficients. Interestingly, this is in contrast to the projection...

chapter

A low-complexity and lossless reference frame encoder algorithm for video coding

Dieison Silveira, Guilherme Povala, Livia Amaral, Bruno Zatt, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7358 - 7362

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a lossless coding solution to reduce the large overhead of external memory communication during the motion estimation process in current video coders. Our solution is called Differential Reference Frame Coder (DRFC), and uses two techniques together to compress the reference frame: a differential coding based on a simplified intra-prediction process to reduce the spatial redundancy...

chapter

Smart decoder: A new paradigm for video coding

D-K. Vo-Nguyen, J. Jung, J-M. Thiesse, M. Antonini

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7382 - 7386

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The coding efficiency of the new video coding standard, High Efficiency Video Coding (HEVC), is strongly associated with better use of spatio-temporal redundancies thanks to an increased number of competing coding modes. However, this competition involves a massive increase in signaling bitrate which becomes a possible limit for the next generation of encoder. This paper proposesa new coding scheme...

chapter

Joint inter-intra prediction based on mode-variant and edge-directed weighting approaches in video coding

Yue Chen, Debargha Mukherjee, Jingning Ean, Kenneth Rose

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7372 - 7376

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Most modern video compression codecs, like VP9, HEVC and H.264, encode square or rectangular blocks either by inter prediction or intra prediction. A joint inter-intra predictor that combines motion compensation and intra extrapolation by two novel weighting schemes is proposed to improve compression quality. Prior work on joint prediction employs inter-intra weights that only rely on the pixel locations...

chapter

Intervention framework for counteracting collusion in spectrum leasing systems

Juan J. Alcaraz, Mihaela van der Schaar

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7318 - 7322

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We consider a spectrum leasing system in which secondary networks offer offload services to a primary network (PN) in exchange of temporary access to the PN's spectrum. When the SANs collude and coordinate their prices, forming a cartel, the PN experiences cartel overcharge, which in our scenario implies lower transmission rates for the serviced PUs. To protect the spectrum owner's interests and possibly...

chapter

A convex-optimization framework for frame-level optimal rate allocation in predictive video coding

Aniello Fiengo, Giovanni Chierchia, Marco Cagnazzo, Beatrice Pesquet-Popescu

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7328 - 7332

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Optimal rate allocation is among the most challenging tasks to perform in the context of predictive video coding, because of the dependencies between frames induced by motion compensation. In this paper, we derive an analytical rate-distortion model that explicitly takes into account the dependencies between frames. The proposed approach allows us to formulate the frame-level optimal rate allocation...

chapter

Outlier removal for improved source estimation in atmospheric inverse problems

Marta Martinez-Camara, Martin Vetterli, Andreas Stohl

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6820 - 6824

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Estimation of the quantities of harmful substances emitted into the atmosphere is one of the main challenges in modern environmental sciences. In most of the cases, this estimation requires solving a linear inverse problem. A key difficulty in evaluating the performance of any algorithm to solve this linear inverse problem is that the ground truth is typically unknown. In this paper we show that the...

chapter

Automatic language identification using deep neural networks

Ignacio Lopez-Moreno, Javier Gonzalez-Dominguez, Oldrich Plchot, David Martinez, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5337 - 5341

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This work studies the use of deep neural networks (DNNs) to address automatic language identification (LID). Motivated by their recent success in acoustic modelling, we adapt DNNs to the problem of identifying the language of a given spoken utterance from short-term acoustic features. The proposed approach is compared to state-of-the-art i-vector based acoustic systems on two different datasets: Google...

chapter

New bivariate statistical model of natural image correlations

Che-Chun Su, Lawrence K. Cormack, Alan C. Bovik

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5362 - 5366

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We perform bivariate statistical analysis and modeling of the joint distributions of spatially adjacent sub-band responses for both luminance/chrominance and range data in natural scenes. In particular, we introduce a multivariate generalized Gaussian distribution and an exponentiated sine function to model the underlying statistics and correlations. The experimental results show that the bivariate...

chapter

Sequence classification using the high-level features extracted from deep neural networks

Li Deng, Jianshu Chen

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6844 - 6848

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The recent success of deep neural networks (DNNs) in speech recognition can be attributed largely to their ability to extract a specific form of high-level features from raw acoustic data for subsequent sequence classification or recognition tasks. Among the many possible forms of DNN features, what forms are more useful than others and how effective these DNN features are in connection with the different...

chapter

Improvements to filterbank and delta learning within a deep neural network framework

Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George Saon, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6839 - 6843

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Many features used in speech recognition tasks are hand-crafted and are not always related to the objective at hand, that is minimizing word error rate. Recently, we showed that replacing a perceptually motivated mel-filter bank with a filter bank layer that is learned jointly with the rest of a deep neural network was promising. In this paper, we extend filter learning to a speaker-adapted, state-of-the-art...

chapter

Physics-based sea clutter model for improved detection of low radar cross-section targets

Brian O'Donnell, Richard LeBaron, Rodolfo Diaz, Antonia Papandreou-Suppappola

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6830 - 6833

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper studies the challenging problem of detecting a low radar cross-section target in heavy sea clutter by proposing a physics-based sea clutter generation model. The model includes a process that generates random dynamic sea clutter based on the governing physics of water gravity and capillary waves and a finite-difference time-domain electromagnetics simulations process based on Maxwells equations...

chapter

Tactile tomographic fluid-flow imaging with a robotic whisker array

Cagdas Tuna, Douglas L. Jones, Farzad Kamalabadi

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6815 - 6819

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Current sensory array systems do not fully exploit tactile sensing strategies widely used by vibrissal sensing animals to explore their surroundings. We develop a new tactile fluid-flow imaging technique, which relates rat's whisker movements to tomographic imaging to extract fluid-flow characteristics with a robotic whisker array. At high Reynolds numbers, the drag force on a whisker segment is proportional...

chapter

Sub-Nyquist sampling of OFDM signals for cognitive radios

Tom Zahavy, Oran Shayer, Deborah Cohen, Alex Tolmachev, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 8092 - 8096

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We investigate sampling and detection of orthogonal frequency-division multiplexing (OFDM) signals with unknown carriers at sub-Nyquist rates. Efficient acquisition and processing of such broadcast signals is a challenge but constitutes a crucial part of enabling cognitive radios. In order to alleviate both the analog and digital burden when treating wideband signals, we adapt the modulated wideband...

chapter

Dynamic upstream power back-off for mixtures of vectored and non-vectored VDSL

Ming-Yang Chen, Georgios Ginis, Mehdi Mohseni

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 8068 - 8072

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Far-end crosstalk severely degrades upstream rates in mixtures of vectored and non-vectored Very high-speed Digital Subscriber Loops (VDSL). As replacement of non-vectored VDSL systems by vectored VDSL systems is expected to be gradual, a crucial problem is the upstream rate optimization of vectored lines while maintaining the rate targets of non-vectored lines. To address this problem, this paper...

chapter

A clustering approach for detecting moving objects captured by a moving aerial camera

Joseph DeGol, Myra Nam

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6538 - 6542

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a novel approach to motion detection in scenes captured from a camera onboard an aerial vehicle. In particular, we are interested in detecting small objects such as cars or people that move slowly and independently in the scene. Slow motion detection in an aerial video is challenging because it is difficult to differentiate object motion from camera motion. We adopt an unsupervised learning...

chapter

Video background subtraction using semi-supervised robust matrix completion

Hassan Mansour, Anthony Vetro

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6528 - 6532

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a factorized robust matrix completion (FRMC) algorithm with global motion compensation to solve the video background subtraction problem. The algorithm decomposes a sequence of video frames into the sum of a low rank background component and a sparse motion component. The algorithm alternates between the solution of each component following a Pareto curve trajectory for each subproblem...

chapter

Visual object tracking via random ferns based classification

Acharya K. Aniruddha, R. Venkatesh Babu

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6533 - 6537

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Designing a robust algorithm for visual object tracking has been a challenging task since many years. There are trackers in the literature that are reasonably accurate for many tracking scenarios but most of them are computationally expensive. This narrows down their applicability as many tracking applications demand real time response. In this paper, we present a tracker based on random ferns. Tracking...

chapter

Automatic foreground extraction in video

Haoqian Wang, Bowen Deng, Kai Li, Yongbing Zhang, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6553 - 6557

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents an automatic and efficient system for extracting dynamic objects of interest from videos. We take advantage of a saliency map and an optimization-based segmentation algorithm to extract the foreground objects automatically in some key frames. Then, the segmentation results in those key frames are propagated to other frames via an error map-based propagation scheme. Finally, a Bayesian...

Publication date

Set your own date range

Content availability

Available (1,692)
None (1)

Keywords

SPEECH RECOGNITION (35)
COMPRESSED SENSING (34)
SPARSITY (26)
DEEP NEURAL NETWORKS (24)
COMPRESSIVE SENSING (22)
DEEP NEURAL NETWORK (21)
CLASSIFICATION (20)
SPARSE REPRESENTATION (20)
SPEECH ENHANCEMENT (20)
CONVEX OPTIMIZATION (19)
DICTIONARY LEARNING (19)
SPEAKER RECOGNITION (18)
AUTOMATIC SPEECH RECOGNITION (17)
BEAMFORMING (14)
COGNITIVE RADIO (14)
DEEP LEARNING (14)
I-VECTOR (14)
NON-NEGATIVE MATRIX FACTORIZATION (13)
MIMO (12)
NEURAL NETWORKS (12)
SPEECH SYNTHESIS (12)
CLUSTERING (11)
INTERFERENCE ALIGNMENT (11)
MUSIC INFORMATION RETRIEVAL (11)
OPTIMIZATION (11)
SPARSE CODING (11)
SPOKEN TERM DETECTION (11)
HEVC (10)
HIDDEN MARKOV MODEL (10)
I-VECTORS (10)
OFDM (10)
SOURCE SEPARATION (10)
SPEAKER ADAPTATION (10)
SPEAKER VERIFICATION (10)
ADAPTIVE FILTERING (9)
CHANNEL ESTIMATION (9)
DETECTION (9)
DISTRIBUTED ESTIMATION (9)
EEG (9)
KALMAN FILTER (9)
NONNEGATIVE MATRIX FACTORIZATION (9)
VOICE CONVERSION (9)
BLIND SOURCE SEPARATION (8)
FACE RECOGNITION (8)
GAUSSIAN MIXTURE MODEL (8)
HMM (8)
KERNEL METHODS (8)
KEYWORD SEARCH (8)
NOISE REDUCTION (8)
NOISE ROBUSTNESS (8)
ROBUSTNESS (8)
SPARSE RECOVERY (8)
SPEECH ANALYSIS (8)
TRACKING (8)
WIRELESS SENSOR NETWORKS (8)
ARRAY PROCESSING (7)
CONSENSUS (7)
ELECTROENCEPHALOGRAPHY (7)
EMOTION RECOGNITION (7)
FEATURE EXTRACTION (7)
GAUSSIAN PROCESS (7)
HIDDEN MARKOV MODELS (7)
HMM-BASED SPEECH SYNTHESIS (7)
KEYWORD SPOTTING (7)
LINEAR PREDICTION (7)
MACHINE LEARNING (7)
MAXIMUM LIKELIHOOD (7)
PLDA (7)
RECURRENT NEURAL NETWORKS (7)
ROBUST SPEECH RECOGNITION (7)
SEGMENTATION (7)
SOURCE LOCALIZATION (7)
SPARSE REPRESENTATIONS (7)
SPECTRAL ESTIMATION (7)
SPECTRUM SENSING (7)
SPEECH SEPARATION (7)
SUPER-RESOLUTION (7)
UNSUPERVISED LEARNING (7)
ACOUSTIC MODELING (6)
ALTERNATING DIRECTION METHOD OF MULTIPLIERS (6)
ARRAY SIGNAL PROCESSING (6)
DENOISING (6)
DISCRIMINATIVE TRAINING (6)
ENTROPY (6)
ESTIMATION (6)
INDEPENDENT COMPONENT ANALYSIS (6)
INVERSE PROBLEM (6)
LOCALIZATION (6)
MICROPHONE ARRAY (6)
MICROPHONE ARRAYS (6)
MIMO RADAR (6)
MULTITASK LEARNING (6)
PARALLEL PROCESSING (6)
PARAMETER ESTIMATION (6)
PARTICLE FILTER (6)
PARTICLE FILTERING (6)
PARTICLE FILTERS (6)
REVERBERATION (6)
SCORE NORMALIZATION (6)
SIGNAL RECONSTRUCTION (6)
more

INFONA - science communication portal

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Reduction of acoustic model training time and required data passes via stochastic approaches to maximum likelihood and discriminative training

Projection onto the cosparse set is NP-hard

A low-complexity and lossless reference frame encoder algorithm for video coding

Smart decoder: A new paradigm for video coding

Joint inter-intra prediction based on mode-variant and edge-directed weighting approaches in video coding

Intervention framework for counteracting collusion in spectrum leasing systems

A convex-optimization framework for frame-level optimal rate allocation in predictive video coding

Outlier removal for improved source estimation in atmospheric inverse problems

Automatic language identification using deep neural networks

New bivariate statistical model of natural image correlations

Sequence classification using the high-level features extracted from deep neural networks

Improvements to filterbank and delta learning within a deep neural network framework

Physics-based sea clutter model for improved detection of low radar cross-section targets

Tactile tomographic fluid-flow imaging with a robotic whisker array

Sub-Nyquist sampling of OFDM signals for cognitive radios

Dynamic upstream power back-off for mixtures of vectored and non-vectored VDSL

A clustering approach for detecting moving objects captured by a moving aerial camera

Video background subtraction using semi-supervised robust matrix completion

Visual object tracking via random ferns based classification

Automatic foreground extraction in video

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)