Search results

Items from 141 to 160 out of 1,206 results

1 ...
5
6
7
8
9
10
11

chapter

Performance evaluation of mixtures of PLDA and conventional PLDA for a small-set speaker verification system

Qianhui Wan, Martin Bouchard

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 4

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE)

This paper compares the use of signal to noise ratio (SNR)-dependent and SNR-independent mixtures of probabilistic linear discriminant analysis (PLDA) versus conventional PLDA, under multi-noise and multi-SNR conditions for a small-set speaker verification system. Results indicate that conventional PLDA is more robust under multi-SNR conditions. The effect of the testing speech length is also examined...

chapter

Handcrafted features vs ConvNets in 2D echocardiographic images

C. Raynaud, H. Langet, M.S. Amzulescu, E. Saloux, more

2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017) > 1116 - 1119

2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017)

In this paper, we address the problem of automated pose classification and segmentation of the left ventricle (LV) in 2D echocardiographic images. For this purpose, we compare two complementary approaches. The first one is based on engineering ad-hoc features according to the traditional machine learning paradigm. Namely, we extract phase features to build an unsupervised LV pose estimator, as well...

chapter

Learning size adaptive local maxima selection for robust nuclei detection in histopathology images

N. Brieu, G. Schmidt

2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017) > 937 - 941

2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017)

The detection of cells and nuclei is a crucial step for the automatic analysis of digital pathology slides and as such for the quantification of the phenotypic information contained in tissue sections. This task is however challenging because of high variability in size, shape and textural appearance of the objects to be detected and of the high variability of tissue appearance. In this work, we propose...

chapter

LTD-RBM: Robust and Fast Latent Truth Discovery Using Restricted Boltzmann Machines

Klaus Broelemann, Thomas Gottron, Gjergji Kasneci

2017 IEEE 33rd International Conference on Data Engineering (ICDE) > 143 - 146

2017 IEEE 33rd International Conference on Data Engineering (ICDE)

We address the problem of latent truth discovery, LTD for short, where the goal is to discover the underlying true values of entity attributes in the presence of noisy, conflicting or incomplete information. Despite a multitude of algorithms addressing the LTD problem, only little is known about their overall performance with respect to effectiveness, efficiency and robustness. The LTD model proposed...

chapter

Weighted non-negative sparse low-rank representation classification

Jingshan Li, Caikou Chen, Xielian Hou, Tianchen Dai, more

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 2153 - 2157

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

In the calculation of rank minimization, the non-negative sparse low-rank representation classification (NSLRRC) regularizes nuclear norm's each singular value equally, but this limits its flexibility and ability to solve many practical problems, where the singular values with clear physical meanings ought to be treated differently. In this paper, a weighted non-negative sparse low-rank representation...

chapter

Local similarity and community paradigm: The robust methods toward link prediction

Ratha Pech, Hao Dong

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)( > 827 - 831

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)

In this work, we propose a method collaborating the local similarity and local community paradigm with a tunable parameter to balance the contribution of the energy from these two sources. We show that local similarity e.g., common neighbors and local community paradigm e.g., local community links both play significant roles in network evolution; therefore, one cannot ignore or penalize anyone of...

chapter

Rotation invariance through structured sparsity for robust hyperspectral image classification

Saurabh Prasad, Demetrio Labate, Minshan Cui, Yuhang Zhang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6205 - 6209

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Sparse representation based classification has gained popularity with geospatial image analysis in general and hyperspectral image analysis in particular. A central idea with such classification approaches is that a test pixel (spectral reflectance vector) can be sparsely represented in a training dictionary of pixels from all classes - in particular, only training pixels in the dictionary that bear...

chapter

Robust and compact video descriptor learned by deep neural network

Yue Nan Li, Xue Piao Chen

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2162 - 2166

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose to extract robust video descriptor by training deep neural network to automatically capture the intrinsic visual characteristics of digital video. More specifically, we first train a conditional generative model to capture the spatio-temporal correlations among visual contents and represent them as an intermediate descriptor. A nonlinear encoder, with the functions of dimension...

chapter

Learning concepts through conversations in spoken dialogue systems

Robin Jia, Larry Heck, Dilek Hakkani-Tur, Georgi Nikolov

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5725 - 5729

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Spoken dialogue systems must be able to recover gracefully from unexpected user inputs. In many cases, these unexpected utterances may be within the scope of the system, but include previously unseen phrases that the system cannot interpret. In this work, we augment a spoken dialogue system with the ability to learn about new concepts by conversing with the user in natural language. We present a novel...

chapter

Encoder-decoder with focus-mechanism for sequence labelling based spoken language understanding

Su Zhu, Kai Yu

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5675 - 5679

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper investigates the framework of encoder-decoder with attention for sequence labelling based spoken language understanding. We introduce Bidirectional Long Short Term Memory - Long Short Term Memory networks (BLSTM-LSTM) as the encoder-decoder model to fully utilize the power of deep learning. In the sequence labelling task, the input and output sequences are aligned word by word, while the...

chapter

Robust linear discriminant analysis with a Laplacian assumption on projection distribution

Shujian Yu, Zheng Cao, Xiubao Jiang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2567 - 2571

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Linear discriminant analysis (LDA) is typically carried out using Fisher's method, which relies heavily on the estimation of sample mean vectors and covariance matrices. However, Fisher LDA is vulnerable to outliers as it happens to other multivariate statistical methods. In this paper, we analyzed the optimal discriminant design based on the criterion of minimizing total misclassification rate, assuming...

chapter

Ensemble feature selection for domain adaptation in speech emotion recognition

Mohammed Abdelwahab, Carlos Busso

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5000 - 5004

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

When emotion recognition systems are used in new domains, the classification performance usually drops due to mismatches between training and testing conditions. Annotations of new data in the new domain is expensive and time demanding. Therefore, it is important to design strategies that efficiently use limited amount of new data to improve the robustness of the classification system. The use of...

chapter

Facial attractiveness prediction using psychologically inspired convolutional neural network (PI-CNN)

Jie Xu, Lianwen Jin, Lingyu Liang, Ziyong Feng, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1657 - 1661

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper proposes a psychologically inspired convolutional neural network (PI-CNN) to achieve automatic facial beauty prediction. Different from the previous methods, the PI-CNN is a hierarchical model that facilitates both the facial beauty representation learning and predictor training. Inspired by the recent psychological studies, significant appearance features of facial detail, lighting and...

chapter

On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition

Xiong Xiao, Shengkui Zhao, Douglas L. Jones, Eng Siong Chng, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3246 - 3250

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Acoustic beamforming has played a key role in the robust automatic speech recognition (ASR) applications. Accurate estimates of the speech and noise spatial covariance matrices (SCM) are crucial for successfully applying the minimum variance distortionless response (MVDR) beamforming. Reliable estimation of time-frequency (TF) masks can improve the estimation of the SCMs and significantly improve...

chapter

Deep multi-view robust representation learning

Zhenyu Jiao, Chao Xu

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2851 - 2855

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Multi-view representations are widely existed in practical applications, the quality of latent representation learned from multi-view observations often suffer from noise and outliers in original data. In this work, we propose an auto encoder based deep multi-view robust representation learning (DMRRL) algorithm, which can learn a shared representation from multi-view observations and the algorithm...

chapter

An investigation into learning effective speaker subspaces for robust unsupervised DNN adaptation

Lahiru Samarakoon, Khe Chai Sim, Brian Mak

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5035 - 5039

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Subspace methods are used for deep neural network (DNN)-based acoustic model adaptation. These methods first construct a subspace and then perform the speaker adaptation as a point in the subspace. This paper aims to investigate the effectiveness of subspace methods for robust unsupervised adaptation. For the analysis, we compare two state-of-the-art subspace methods, namely, the singular value decomposition...

chapter

Human action recognition using Adaptive Hierarchical Depth Motion Maps and Gabor filter

Hong Liu, Qinqin He, Mengyuan Liu

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1432 - 1436

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Depth motion maps (DMMs) have shown effectiveness in human action recognition, however, they lose the temporal information and suffer from intra-class variations caused by action speed variations. To address these challenges, we propose a novel method for human action recognition. Firstly, Adaptive Hierarchical Depth Motion Maps (AH-DMMs) are calculated over temporal hierarchical windows of video...

chapter

Solving Occlusion Problem in Pedestrian Detection by Constructing Discriminative Part Layers

Cong Cao, Yu Wang, Jien Kato, Guanwen Zhang, more

2017 IEEE Winter Conference on Applications of Computer Vision (WACV) > 91 - 99

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

Occlusion handling is one of the most challenging issues for pedestrian detection, and no satisfactory achievement has been found in this issue yet. Using human body parts has been considered as a reasonable way to overcome such an issue. In this paper, we propose a brand new approach based on the fusion of Mid-level body part mining and Convolutional Neural Network (CNN) to solve this problem, named...

chapter

DeepText: A new approach for text proposal generation and text detection in natural images

Zhuoyao Zhong, Lianwen Jin, Shuangping Huang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1208 - 1212

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we develop a new approach called DeepText for text region proposal generation and text detection in natural images via a fully convolutional neural network (CNN). First, we propose the novel inception region proposal network (Inception-RPN), which slides an inception network with multi-scale windows over the top of convolutional feature maps and associates a set of text characteristic...

chapter

Scale selective extended local binary pattern for texture classification

Yuting Hu, Zhiling Long, Ghassan AlRegib

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1413 - 1417

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a new texture descriptor, scale selective extended local binary pattern (SSELBP), to characterize texture images with scale variations. We first utilize multi-scale extended local binary patterns (ELBP) with rotation-invariant and uniform mappings to capture robust local microand macro-features. Then, we build a scale space using Gaussian filters and calculate the histogram...

1 ...
5
6
7
8
9
10
11

Keywords:
TRAINING
ROBUSTNESS

Publication date

Set your own date range

Content availability

Available (1,201)
None (5)

Keywords

FEATURE EXTRACTION (372)
SUPPORT VECTOR MACHINES (179)
DATABASES (157)
ACCURACY (156)
FACE (127)
FACE RECOGNITION (119)
ARTIFICIAL NEURAL NETWORKS (114)
TESTING (114)
ESTIMATION (103)
NOISE (103)
SHAPE (95)
DETECTORS (94)
COMPUTATIONAL MODELING (92)
PATTERN RECOGNITION (92)
CLASSIFICATION ALGORITHMS (91)
DATA MINING (89)
SIGNAL PROCESSING (86)
VISUALIZATION (86)
OPTIMIZATION (85)
TRAINING DATA (84)
KERNEL (77)
PRINCIPAL COMPONENT ANALYSIS (77)
ALGORITHM DESIGN AND ANALYSIS (76)
DATA MODELS (75)
MATHEMATICAL MODEL (75)
NEURAL NETWORKS (74)
IMAGE COLOR ANALYSIS (73)
SPEECH (72)
CAMERAS (71)
VECTORS (71)
LIGHTING (70)
IMAGE SEGMENTATION (69)
OBJECT DETECTION (69)
COMPUTER VISION (68)
IMAGE RECOGNITION (68)
HIDDEN MARKOV MODELS (66)
MACHINE LEARNING (66)
EQUATIONS (62)
IMAGE CLASSIFICATION (62)
SIGNAL PROCESSING ALGORITHMS (61)
HISTOGRAMS (57)
LEARNING (ARTIFICIAL INTELLIGENCE) (56)
NOISE MEASUREMENT (54)
CONFERENCES (53)
IMAGE EDGE DETECTION (51)
CORRELATION (50)
DICTIONARIES (49)
SPEECH RECOGNITION (49)
TARGET TRACKING (49)
EDUCATIONAL INSTITUTIONS (44)
TRANSFORMS (44)
OBJECT RECOGNITION (43)
COMPUTERS (42)
PATTERN CLASSIFICATION (42)
SUPPORT VECTOR MACHINE CLASSIFICATION (42)
PIXEL (39)
SIGNAL TO NOISE RATIO (39)
BOOSTING (38)
IMAGE PROCESSING (38)
IMAGE RECONSTRUCTION (38)
COMPLEXITY THEORY (37)
MEASUREMENT (35)
TRACKING (35)
IMAGE RESOLUTION (34)
CLASSIFICATION (33)
ELECTRONIC MAIL (32)
STANDARDS (32)
ADAPTATION MODELS (31)
SOFTWARE (31)
ACOUSTICS (30)
CLUSTERING ALGORITHMS (30)
IMAGE SEQUENCES (30)
INDEXES (29)
REAL TIME SYSTEMS (29)
SUPPORT VECTOR MACHINE (29)
CONVOLUTION (27)
ENCODING (27)
IMAGE CODING (27)
NEURAL NETS (27)
ADAPTATION MODEL (26)
PREDICTIVE MODELS (26)
SPARSE REPRESENTATION (26)
BIOLOGICAL SYSTEM MODELING (25)
THREE-DIMENSIONAL DISPLAYS (25)
ANALYTICAL MODELS (24)
APPROXIMATION METHODS (23)
CONVERGENCE (23)
LABELING (23)
LEARNING SYSTEMS (23)
NEURAL NETWORK (23)
NEURONS (23)
PREDICTION ALGORITHMS (23)
PRESSES (23)
UNCERTAINTY (23)
WATERMARKING (23)
WAVELET TRANSFORMS (23)
BAYES METHODS (22)
COMPUTER ARCHITECTURE (22)
more

INFONA - science communication portal

Search results

Performance evaluation of mixtures of PLDA and conventional PLDA for a small-set speaker verification system

Handcrafted features vs ConvNets in 2D echocardiographic images

Learning size adaptive local maxima selection for robust nuclei detection in histopathology images

LTD-RBM: Robust and Fast Latent Truth Discovery Using Restricted Boltzmann Machines

Weighted non-negative sparse low-rank representation classification

Local similarity and community paradigm: The robust methods toward link prediction

Rotation invariance through structured sparsity for robust hyperspectral image classification

Robust and compact video descriptor learned by deep neural network

Learning concepts through conversations in spoken dialogue systems

Encoder-decoder with focus-mechanism for sequence labelling based spoken language understanding

Robust linear discriminant analysis with a Laplacian assumption on projection distribution

Ensemble feature selection for domain adaptation in speech emotion recognition

Facial attractiveness prediction using psychologically inspired convolutional neural network (PI-CNN)

On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition

Deep multi-view robust representation learning

An investigation into learning effective speaker subspaces for robust unsupervised DNN adaptation

Human action recognition using Adaptive Hierarchical Depth Motion Maps and Gabor filter

Solving Occlusion Problem in Pedestrian Detection by Constructing Discriminative Part Layers

DeepText: A new approach for text proposal generation and text detection in natural images

Scale selective extended local binary pattern for texture classification

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options