Search results

chapter

A Sparse Coding Approach to RUL Prediction in Rolling Bearing

Huaxin Li, Yanxue Wang

2017 International Conference on Sensing, Diagnostics, Prognostics, and Control (SDPC) > 174 - 179

2017 International Conference on Sensing, Diagnostics, Prognostics, and Control (SDPC)

Rolling element bearings are among the most frequently encountered components in the majority of rotating machines. Thus, prognostic and health management (PHM) of rolling bearing plays an important role on the working status of the machine system. Remaining useful life (RUL) prediction is the core of PHM. It's well known that original auto-regression (AR) model is suitable for the prediction of linear...

chapter

Zero-Shot Classification with Discriminative Semantic Representation Learning

Meng Ye, Yuhong Guo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5103 - 5111

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Zero-shot learning, a special case of unsupervised domain adaptation where the source and target domains have disjoint label spaces, has become increasingly popular in the computer vision community. In this paper, we propose a novel zero-shot learning method based on discriminative sparse non-negative matrix factorization. The proposed approach aims to identify a set of common high-level semantic...

chapter

Straight to Shapes: Real-Time Detection of Encoded Shapes

Saumya Jetley, Michael Sapienza, Stuart Golodetz, Philip H. S. Torr

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4207 - 4216

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Current object detection approaches predict bounding boxes that provide little instance-specific information beyond location, scale and aspect ratio. In this work, we propose to regress directly to objects shapes in addition to their bounding boxes and categories. It is crucial to find an appropriate shape representation that is compact and decodable, and in which objects can be compared for higher-order...

chapter

Semantic Image Inpainting with Deep Generative Models

Raymond A. Yeh, Chen Chen, Teck Yian Lim, Alexander G. Schwing, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6882 - 6890

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic image inpainting is a challenging task where large missing regions have to be filled based on the available visual data. Existing methods which extract information from only a single image generally produce unsatisfactory results due to the lack of high level context. In this paper, we propose a novel method for semantic image inpainting, which generates the missing content by conditioning...

chapter

Local Binary Convolutional Neural Networks

Felix Juefei-Xu, Vishnu Naresh Boddeti, Marios Savvides

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4284 - 4293

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose local binary convolution (LBC), an efficient alternative to convolutional layers in standard convolutional neural networks (CNN). The design principles of LBC are motivated by local binary patterns (LBP). The LBC layer comprises of a set of fixed sparse pre-defined binary convolutional filters that are not updated during the training process, a non-linear activation function and a set of...

chapter

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

Zhaofan Qiu, Ting Yao, Tao Mei

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4085 - 4094

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep convolutional neural networks (CNNs) have proven highly effective for visual recognition, where learning a universal representation from activations of convolutional layer plays a fundamental problem. In this paper, we present Fisher Vector encoding with Variational Auto-Encoder (FV-VAE), a novel deep architecture that quantizes the local activations of convolutional layer in a deep generative...

chapter

Self-Supervised Video Representation Learning with Odd-One-Out Networks

Basura Fernando, Hakan Bilen, Efstratios Gavves, Stephen Gould

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5729 - 5738

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a new self-supervised CNN pre-training technique based on a novel auxiliary task called odd-one-out learning. In this task, the machine is asked to identify the unrelated or odd element from a set of otherwise related elements. We apply this technique to self-supervised video representation learning where we sample subsequences from videos and ask the network to learn to predict the odd...

chapter

Convolutional sparse coding for face recognition

Junwei Jin, C. L. Philip Chen

2017 4th International Conference on Information, Cybernetics and Computational Social Systems (ICCSS) > 137 - 141

2017 4th International Conference on Information, Cybernetics and Computational Social Systems (ICCSS)

Face recognition has been an important task in pattern recognition and computer vision. Recently, sparse representation has become a popular data representation method in face recognition field. Convolutional sparse coding, which replaces the linear combination of a set of dictionary atoms with the sum of s series of mapping term convoluted with the dictionary filters, was proposed to improve the...

chapter

Spectral-spatial online dictionary learning for hyperspectral image classification

Wei Fu, Shutao Li, Leyuan Fang, Jon Atli Benediktsson

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 3724 - 3727

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

Sparse representation (SR) based hyperspectral image (HSI) classification is a rapidly evolving research topic. How to construct an optimized dictionary to better characterize spectral-spatial features of HSI is an important problem. In this paper, a novel spectral-spatial online dictionary learning (SSODL) method for HSI classification is proposed. The main idea is to learn a complete and discriminative...

chapter

Classification of fusing SAR and multispectral image via deep bimodal autoencoders

Jie Geng, Hongyu Wang, Jianchao Fan, Xiaorui Ma

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 823 - 826

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

Classification of multisensor data provides potential advantages over a single sensor in accuracy. In this paper, deep bimodal autoencoders are proposed for classification of fusing synthetic aperture radar (SAR) and multispectral images. The proposed deep network based on autoencoders is trained to discover both independencies of each modality and correlations across the modalities. Specifically,...

chapter

Gabor feature based support vector guided dictionary learning for hyperspectral image classification

Sen Jia, Huimin Xie, Lin Deng, Qiang Huang, more

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 2211 - 2214

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

Discriminative dictionary learning aims to learn a dictionary from training samples in order to improve the discriminative ability of their coding vectors. Gabor wavelets have recently been successfully applied for hyperspectral image (HSI) classification due to their ability to extract joint spatial and spectrum information. Due to the high discriminative power of Gabor features, an efficient method,...

chapter

Decoder-side HEVC quality enhancement with scalable convolutional neural network

Ren Yang, Mai Xu, Zulin Wang

2017 IEEE International Conference on Multimedia and Expo (ICME) > 817 - 822

2017 IEEE International Conference on Multimedia and Expo (ICME)

The latest High Efficiency Video Coding (HEVC) has been increasingly used to generate video streams over Internet. However, the decoded HEVC video streams may incur severe quality degradation, especially at low bit-rates. Thus, it is necessary to enhance visual quality of HEVC videos at the decoder side. To this end, we propose in this paper a Decoder-side Scalable Convolutional Neural Network (DS-CNN)...

chapter

Continuous Video to Simple Signals for Swimming Stroke Detection with Convolutional Neural Networks

Brandon Victor, Zhen He, Stuart Morgan, Dino Miniutti

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 122 - 131

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In many sports, it is useful to analyse video of an athlete in competition for training purposes. In swimming, stroke rate is a common metric used by coaches; requiring a laborious labelling of each individual stroke. We show that using a Convolutional Neural Network (CNN) we can automatically detect discrete events in continuous video (in this case, swimming strokes). We create a CNN that learns...

chapter

Keyword-driven image captioning via Context-dependent Bilateral LSTM

Xiaodan Zhang, Shengfeng He, Xinhang Song, Pengxu Wei, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 781 - 786

2017 IEEE International Conference on Multimedia and Expo (ICME)

Image captioning has recently received much attention. Existing approaches, however, are limited to describing images with simple contextual information, which typically generate one sentence to describe each image with only a single contextual emphasis. In this paper, we address this limitation from a user perspective with a novel approach. Given some keywords as additional inputs, the proposed method...

chapter

VIDEOWHISPER: Towards unsupervised learning of discriminative features of videos with RNN

Na Zhao, Hanwang Zhang, Mingxing Zhang, Richang Hong, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 277 - 282

2017 IEEE International Conference on Multimedia and Expo (ICME)

We present VidedWhisfer, a novel approach for unsupervised video representation learning, in which video sequence is treated as a self-supervision entity based on the observation that the sequence encodes video temporal dynamics (e.g., object movement and event evolution). Specifically, for each video sequence, we use a pre-learned visual dictionary to generate a sequence of high-level semantics,...

chapter

Curiosity-Driven Exploration by Self-Supervised Prediction

Deepak Pathak, Pulkit Agrawal, Alexei A. Efros, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 488 - 489

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In many real-world scenarios, rewards extrinsic to the agent are extremely sparse, or absent altogether. In such cases, curiosity can serve as an intrinsic reward signal to enable the agent to explore its environment and learn skills that might be useful later in its life. We formulate curiosity as the error in an agent's ability to predict the consequence of its own actions in a visual feature space...

chapter

Recognition and retrieval of sound events using sparse coding convolutional neural network

Chien-Yao Wang, Andri Santoso, Seksan Mathulaprangsan, Chin-Chin Chiang, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 589 - 594

2017 IEEE International Conference on Multimedia and Expo (ICME)

This paper proposes a novel deep convolutional neural network (CNN), called sparse coding convolutional neural network (SC-CNN), to address the problem of sound event recognition and retrieval task. Unlike the general framework of a CNN, in which feature learning process is performed hierarchically, the proposed framework models the whole memorizing procedures in the human brain, including encoding,...

chapter

Leveraging geometric correlation for input-adaptive facial landmark regression

Yuyao Feng, Risheng Liu, Xin Fan, Kang Huyan, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 385 - 390

2017 IEEE International Conference on Multimedia and Expo (ICME)

Facial analysis plays very important role in many vision applications, such as authentication and entertainments. The very early works in the 1990s mostly focus on estimating geometric deformations of facial landmarks to address this task. While in the past several years, more and more efforts have been made to directly learn an appearance regression for facial analysis. Though training regressions...

chapter

Online Bayesian Learning for Remote-Sensing Imagery Compression

Zizhuo Zhang, Shaoyang Li, Xiaoming Tao, Linhao Dong, more

2017 IEEE 85th Vehicular Technology Conference (VTC Spring) > 1 - 5

2017 IEEE 85th Vehicular Technology Conference: VTC2017-Spring

This work investigates a statistical technique for high performance remote-sensing imagery compression. By exploiting existing remote-sensing data sets, useful structural and texture prior information can be learned. The main methodologies are Bayesian dictionary learning and stochastic approximation. A Bayesian network simulating the generation mechanism of remote- sensing images is modelled. The...

chapter

A Study on Teacher Training Mechanism Supported by Blended Learning from the Perspectives of Communication and Knowledge Management

Zhiming Liu, Shan Jia

2017 International Symposium on Educational Technology (ISET) > 62 - 66

2017 International Symposium on Educational Technology (ISET)

This study firstly points out the essence of knowledge acquirement can be accounted for learners' process of decoding and encoding for knowledge from the perspective of knowledge learning. This process includes the encoding and decoding of knowledge transfer process from the perspective of communication, and knowledge information in different storage forms from the perspective of knowledge management...

INFONA - science communication portal

Search results

A Sparse Coding Approach to RUL Prediction in Rolling Bearing

Zero-Shot Classification with Discriminative Semantic Representation Learning

Straight to Shapes: Real-Time Detection of Encoded Shapes

Semantic Image Inpainting with Deep Generative Models

Local Binary Convolutional Neural Networks

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

Self-Supervised Video Representation Learning with Odd-One-Out Networks

Convolutional sparse coding for face recognition

Spectral-spatial online dictionary learning for hyperspectral image classification

Classification of fusing SAR and multispectral image via deep bimodal autoencoders

Gabor feature based support vector guided dictionary learning for hyperspectral image classification

Decoder-side HEVC quality enhancement with scalable convolutional neural network

Continuous Video to Simple Signals for Swimming Stroke Detection with Convolutional Neural Networks

Keyword-driven image captioning via Context-dependent Bilateral LSTM

VIDEOWHISPER: Towards unsupervised learning of discriminative features of videos with RNN

Curiosity-Driven Exploration by Self-Supervised Prediction

Recognition and retrieval of sound events using sparse coding convolutional neural network

Leveraging geometric correlation for input-adaptive facial landmark regression

Online Bayesian Learning for Remote-Sensing Imagery Compression

A Study on Teacher Training Mechanism Supported by Blended Learning from the Perspectives of Communication and Knowledge Management

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options