Search results

chapter

Zero-Shot Classification with Discriminative Semantic Representation Learning

Meng Ye, Yuhong Guo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5103 - 5111

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Zero-shot learning, a special case of unsupervised domain adaptation where the source and target domains have disjoint label spaces, has become increasingly popular in the computer vision community. In this paper, we propose a novel zero-shot learning method based on discriminative sparse non-negative matrix factorization. The proposed approach aims to identify a set of common high-level semantic...

chapter

Fried Binary Embedding for High-Dimensional Visual Features

Weixiang Hong, Junsong Yuan, Sreyasee Das Bhattacharjee

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6221 - 6229

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most existing binary embedding methods prefer compact binary codes (b-dimensional) to avoid high computational and memory cost of projecting high-dimensional visual features (d-dimensional, b

chapter

HOPE: Hierarchical Object Prototype Encoding for Efficient Object Instance Search in Videos

Tan Yu, Yuwei Wu, Junsong Yuan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3195 - 3204

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper tackles the problem of efficient and effective object instance search in videos. To effectively capture the relevance between a query and video frames and precisely localize the particular object, we leverage the object proposals to improve the quality of object instance search in videos. However, hundreds of object proposals obtained from each frame could result in unaffordable memory...

chapter

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

Zhaofan Qiu, Ting Yao, Tao Mei

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4085 - 4094

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep convolutional neural networks (CNNs) have proven highly effective for visual recognition, where learning a universal representation from activations of convolutional layer plays a fundamental problem. In this paper, we present Fisher Vector encoding with Variational Auto-Encoder (FV-VAE), a novel deep architecture that quantizes the local activations of convolutional layer in a deep generative...

chapter

Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos

Ionut Cosmin Duta, Bogdan Ionescu, Kiyoharu Aizawa, Nicu Sebe

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3205 - 3214

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce Spatio-Temporal Vector of Locally Max Pooled Features (ST-VLMPF), a super vector-based encoding method specifically designed for local deep features encoding. The proposed method addresses an important problem of video understanding: how to build a video representation that incorporates the CNN features over the entire video. Feature assignment is carried out at two levels, by using the...

chapter

Traffic scene recognition based on deep CNN and VLAD spatial pyramids

Fang-Yu Wu, Shi-Yang Yan, Jeremy S. Smith, Bai-Ling Zhang

2017 International Conference on Machine Learning and Cybernetics (ICMLC) > 1 > 156 - 161

2017 International Conference on Machine Learning and Cybernetics (ICMLC)

Traffic scene recognition is an important and challenging issue in Intelligent Transportation Systems (ITS). Recently, Convolutional Neural Network (CNN) models have achieved great success in many applications, including scene classification. The remarkable representational learning capability of CNN remains to be further explored for solving real-world problems. Vector of Locally Aggregated Descriptors...

chapter

Visualization Techniques for the Comparative Analysis of Weighted Free Trees

Yuichi Naragino, Kazuo Misue

2017 21st International Conference Information Visualisation (IV) > 45 - 51

2017 21st International Conference Information Visualisation (IV)

A weighted free tree is an undirected connected graph with no cycles whose edges and nodes have weights (positive real numbers). The purpose of our research is to support the comparative analysis of weighted free trees. The fundamental categories of visualization techniques that support comparison are juxtaposition, superposition, and explicit encoding. Juxtaposition is often used for comparison....

chapter

Decoder-side HEVC quality enhancement with scalable convolutional neural network

Ren Yang, Mai Xu, Zulin Wang

2017 IEEE International Conference on Multimedia and Expo (ICME) > 817 - 822

2017 IEEE International Conference on Multimedia and Expo (ICME)

The latest High Efficiency Video Coding (HEVC) has been increasingly used to generate video streams over Internet. However, the decoded HEVC video streams may incur severe quality degradation, especially at low bit-rates. Thus, it is necessary to enhance visual quality of HEVC videos at the decoder side. To this end, we propose in this paper a Decoder-side Scalable Convolutional Neural Network (DS-CNN)...

chapter

Optimized video coding for omnidirectional videos

Minhao Tang, Yu Zhang, Jiangtao Wen, Shiqiang Yang

2017 IEEE International Conference on Multimedia and Expo (ICME) > 799 - 804

2017 IEEE International Conference on Multimedia and Expo (ICME)

The ever widening application of virtual reality requires the ultra high resolution omnidirectional videos (OVs) to be transmitted over the wired and wireless Internet at low cost (i.e. bitrate). Various solutions have been proposed to intelligently reduce the bitrate, e.g. adapting the spatial resolution of the video for different directions of the panorama with regard to current direction that the...

chapter

Spatial weighted fisher vector for image retrieval

Chengzuo Qi, Cunzhao Shi, Jian Xu, Chunheng Wang, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 463 - 468

2017 IEEE International Conference on Multimedia and Expo (ICME)

Several recent works interpret convolutional features produced by deep convolutional neural networks as local descriptors. Existing high-dimensional aggregation based methods, e.g., Fisher Vector (FV) obtain inferior performance to pooling based methods in most situations, and we observe that it is mainly caused by the ignorance of spatial weights. In this paper, we propose a novel method named spatial...

chapter

A new combined PSNR for objective video quality assessment

Xiwu Shang, Guozhong Wang, Haiwu Zhao, Jie Liang, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 811 - 816

2017 IEEE International Conference on Multimedia and Expo (ICME)

In video coding, quality evaluation is important for improving the coding efficiency. Usually Peak Signal-to-Noise Ratio (PSNR) is utilized to measure the performance of different coding techniques. During the video coding process in YCbCr color space, there are three PSNRs, one for each color component. Sometimes they may contradict to each other, which poses a problem for evaluating the coding performance...

chapter

Online visual tracking with high-order pooling

Xiyu Yan, Bo Ma

2017 IEEE International Conference on Multimedia and Expo (ICME) > 289 - 294

2017 IEEE International Conference on Multimedia and Expo (ICME)

Most local sparse representation models in visual tracking generally contain three components: 1) extracting local descriptors from target region, 2) encoding the extracted local descriptors as mid-level features, 3) aggregating statistics of mid-level features into a signature. Since the last step aggregates only first-order statistics of mid-level features, it is named as First-order Pooling (FP)...

chapter

Real-Time Visual Feedback: A Study in Coding Analytics

Jeremie Seanosky, Isabelle Guillot, David Boulanger, Rebecca Guillot, more

2017 IEEE 17th International Conference on Advanced Learning Technologies (ICALT) > 264 - 266

2017 IEEE 17th International Conference on Advanced Learning Technologies (ICALT)

Higher dropout and failure rates among computer science students in introductory programming courses tend to be a norm for many institutions. Years of evidence indicate that dropouts and failures persist in spite of advancements in pedagogy, technology, and teacher training. Most advancements have relied on summative assessments and of late formative assessments. This research explores assessments...

chapter

The Role of Synchronic Causal Conditions in Visual Knowledge Learning

Seng-Beng Ho

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 9 - 16

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

We propose a principled approach for the learning of causal conditions from actions and activities taking place in the physical environment through visual input. Causal conditions are the preconditions that must exist before a certain effect can ensue. We propose to consider diachronic and synchronic causal conditions separately for the learning of causal knowledge. Diachronic condition captures the...

chapter

VIDEOWHISPER: Towards unsupervised learning of discriminative features of videos with RNN

Na Zhao, Hanwang Zhang, Mingxing Zhang, Richang Hong, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 277 - 282

2017 IEEE International Conference on Multimedia and Expo (ICME)

We present VidedWhisfer, a novel approach for unsupervised video representation learning, in which video sequence is treated as a self-supervision entity based on the observation that the sequence encodes video temporal dynamics (e.g., object movement and event evolution). Specifically, for each video sequence, we use a pre-learned visual dictionary to generate a sequence of high-level semantics,...

chapter

Multiscale dictionary learning for hierarchical sparse representation

Yangmei Shen, Hongkai Xiong, Wenrui Dai

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1332 - 1337

2017 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, we propose a multiscale dictionary learning framework for hierarchical sparse representation of natural images. The proposed framework leverages an adaptive quadtree decomposition to represent structured sparsity in different scales. In dictionary learning, a tree-structured regularized optimization is formulated to distinguish and represent high-frequency details based on varying local...

chapter

Pattern classification reveals developmental differences in how memories influence new learning

Margaret L. Schlichting, Katharine F. Guarino, Hannah E. Roome, Alison R. Preston

2017 International Workshop on Pattern Recognition in Neuroimaging (PRNI) > 1 - 4

2017 International Workshop on Pattern Recognition in Neuroimaging (PRNI)

Recent studies suggest that the ability to use memories flexibly emerges gradually with development; however, the mechanistic changes that underlie this shift remain unknown. Participants aged 7-30 years encoded a series of related associations during functional magnetic resonance imaging (fMRI) scanning. We hypothesized that the comparatively more rigid memory behaviors characteristic of children...

chapter

Research of red tide visualization system based on OpenGL

Chai Jianfei, Hu Xiaomei

2017 2nd International Conference on Image, Vision and Computing (ICIVC) > 729 - 733

2017 2nd International Conference on Image, Vision and Computing (ICIVC)

Red tide harms the ecological environment. In order to reduce the occurrence of red tide phenomenon, it is necessary to study and analyze the reasons for the formation of red tides. The establishment of a set of red tide phenomenon visualization system has important significance. The paper improved inverse distance weighted interpolation algorithm based on the research on the spatial interpolation...

chapter

LLC encoded BoW features and softmax regression for microscopic image classification

Dongyun Lin, Zhiping Lin, Lei Sun, Kar-Ann Toh, more

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

This paper proposes a method based on the bag-of-words (BoW) and the softmax regression for microscopic image classification. Essentially, the locality-constrained linear coding (LLC) is adopted for local feature encoding. Compared with the traditionally adopted vector quantization (VQ) in the BoW framework, the LLC encodes local structures of microscopic images with lower quantization errors and...

chapter

A stimulation platform for optogenetic and bionic vision restoration

Francesco Galluppi, Didier Pruneau, Joel Chavas, Xavier Lagorce, more

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

Optogenetic therapy holds the promise to restore visual function in patients affected by retinal degenerative diseases. However, the light-sensitivity of the molecule mediating light responses is much less than the one of healthy retinal cells so that no photo-stimulation is expected under natural environmental conditions. In this work, we present a platform set up to stimulate optogenetically-engineered...

INFONA - science communication portal

Search results

Zero-Shot Classification with Discriminative Semantic Representation Learning

Fried Binary Embedding for High-Dimensional Visual Features

HOPE: Hierarchical Object Prototype Encoding for Efficient Object Instance Search in Videos

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos

Traffic scene recognition based on deep CNN and VLAD spatial pyramids

Visualization Techniques for the Comparative Analysis of Weighted Free Trees

Decoder-side HEVC quality enhancement with scalable convolutional neural network

Optimized video coding for omnidirectional videos

Spatial weighted fisher vector for image retrieval

A new combined PSNR for objective video quality assessment

Online visual tracking with high-order pooling

Real-Time Visual Feedback: A Study in Coding Analytics

The Role of Synchronic Causal Conditions in Visual Knowledge Learning

VIDEOWHISPER: Towards unsupervised learning of discriminative features of videos with RNN

Multiscale dictionary learning for hierarchical sparse representation

Pattern classification reveals developmental differences in how memories influence new learning

Research of red tide visualization system based on OpenGL

LLC encoded BoW features and softmax regression for microscopic image classification

A stimulation platform for optogenetic and bionic vision restoration

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options