Search results

chapter

Genetic CNN

Lingxi Xie, Alan Yuille

2017 IEEE International Conference on Computer Vision (ICCV) > 1388 - 1397

2017 IEEE International Conference on Computer Vision (ICCV)

The deep convolutional neural network (CNN) is the state-of-the-art solution for large-scale visual recognition. Following some basic principles such as increasing network depth and constructing highway connections, researchers have manually designed a lot of fixed network architectures and verified their effectiveness.,,In this paper, we discuss the possibility of learning deep network structures...

chapter

SuBiC: A Supervised, Structured Binary Code for Image Search

Himalaya Jain, Joaquin Zepeda, Patrick Perez, Remi Gribonval

2017 IEEE International Conference on Computer Vision (ICCV) > 833 - 842

2017 IEEE International Conference on Computer Vision (ICCV)

For large-scale visual search, highly compressed yet meaningful representations of images are essential. Structured vector quantizers based on product quantization and its variants are usually employed to achieve such compression while minimizing the loss of accuracy. Yet, unlike binary hashing schemes, these unsupervised methods have not yet benefited from the supervision, end-to-end learning and...

chapter

Generalized Orderless Pooling Performs Implicit Salient Matching

Marcel Simon, Yang Gao, Trevor Darrell, Joachim Denzler, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4970 - 4979

2017 IEEE International Conference on Computer Vision (ICCV)

Most recent CNN architectures use average pooling as a final feature encoding step. In the field of fine-grained recognition, however, recent global representations like bilinear pooling offer improved performance. In this paper, we generalize average and bilinear pooling to “α-pooling”, allowing for learning the pooling strategy during training. In addition, we present a novel way to visualize decisions...

chapter

Dictionary learning for spontaneous neural activity modeling

Birini Troullinou, Grigorios Tsagkatakis, Ganna Palagina, Maria Papadopouli, more

2017 25th European Signal Processing Conference (EUSIPCO) > 1579 - 1583

2017 25th European Signal Processing Conference (EUSIPCO)

Modeling the activity of an ensemble of neurons can provide critical insights into the workings of the brain. In this work we examine if learning based signal modeling can contribute to a high quality modeling of neuronal signal data. To that end, we employ the sparse coding and dictionary learning schemes for capturing the behavior of neuronal responses into a small number of representative prototypical...

chapter

Zero-Shot Classification with Discriminative Semantic Representation Learning

Meng Ye, Yuhong Guo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5103 - 5111

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Zero-shot learning, a special case of unsupervised domain adaptation where the source and target domains have disjoint label spaces, has become increasingly popular in the computer vision community. In this paper, we propose a novel zero-shot learning method based on discriminative sparse non-negative matrix factorization. The proposed approach aims to identify a set of common high-level semantic...

chapter

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

Zhaofan Qiu, Ting Yao, Tao Mei

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4085 - 4094

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep convolutional neural networks (CNNs) have proven highly effective for visual recognition, where learning a universal representation from activations of convolutional layer plays a fundamental problem. In this paper, we present Fisher Vector encoding with Variational Auto-Encoder (FV-VAE), a novel deep architecture that quantizes the local activations of convolutional layer in a deep generative...

chapter

Decoder-side HEVC quality enhancement with scalable convolutional neural network

Ren Yang, Mai Xu, Zulin Wang

2017 IEEE International Conference on Multimedia and Expo (ICME) > 817 - 822

2017 IEEE International Conference on Multimedia and Expo (ICME)

The latest High Efficiency Video Coding (HEVC) has been increasingly used to generate video streams over Internet. However, the decoded HEVC video streams may incur severe quality degradation, especially at low bit-rates. Thus, it is necessary to enhance visual quality of HEVC videos at the decoder side. To this end, we propose in this paper a Decoder-side Scalable Convolutional Neural Network (DS-CNN)...

chapter

VIDEOWHISPER: Towards unsupervised learning of discriminative features of videos with RNN

Na Zhao, Hanwang Zhang, Mingxing Zhang, Richang Hong, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 277 - 282

2017 IEEE International Conference on Multimedia and Expo (ICME)

We present VidedWhisfer, a novel approach for unsupervised video representation learning, in which video sequence is treated as a self-supervision entity based on the observation that the sequence encodes video temporal dynamics (e.g., object movement and event evolution). Specifically, for each video sequence, we use a pre-learned visual dictionary to generate a sequence of high-level semantics,...

chapter

LLC encoded BoW features and softmax regression for microscopic image classification

Dongyun Lin, Zhiping Lin, Lei Sun, Kar-Ann Toh, more

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

This paper proposes a method based on the bag-of-words (BoW) and the softmax regression for microscopic image classification. Essentially, the locality-constrained linear coding (LLC) is adopted for local feature encoding. Compared with the traditionally adopted vector quantization (VQ) in the BoW framework, the LLC encodes local structures of microscopic images with lower quantization errors and...

chapter

Does frequency resolution affect the classification performance of steady-state visual evoked potentials?

Masaki Nakanishi, Yijun Wang, Yu-Te Wang, Tzyy-Ping Jung

2017 8th International IEEE/EMBS Conference on Neural Engineering (NER) > 341 - 344

2017 8th International IEEE/EMBS Conference on Neural Engineering (NER)

Multi-target stimulus coding plays an important role in a steady-state visual evoked potential (SSVEP)-based brain-computer interface (BCI). In conventional SSVEP-based BCIs, a large interval between two neighboring stimulus frequencies is often used to improve classification accuracy. Although recent progresses in stimulus coding and target identification methods that have significantly improved...

chapter

Indexing Mayan hieroglyphs with neural codes

Edgar Roman-Rangel, Stephane Marchand-Maillet

2016 23rd International Conference on Pattern Recognition (ICPR) > 253 - 258

2016 23rd International Conference on Pattern Recognition (ICPR)

We present an approach for unsupervised computation of local shape descriptors, which relies on the use of linear autoencoders for characterizing local regions of complex shapes. The proposed approach responds to the need for a robust scheme to index binary images using local descriptors, which arises when only few examples of the complete images are available for training, thus making inaccurate...

chapter

Can Contextual Information Improve Scene Classification Performance?

Mana Shahriari, Robert Bergevin

2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 7

2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

Bag of visual words (BoVW) remains a very competitive representation in the domain of scene classification. In this framework, extracting SIFT descriptors on a dense grid of pixels has shown to lead to a better performance. However, due to the nature of SIFT as an edge-based descriptor, computing SIFT on homogeneous regions might result in non-stable region descriptors. The suggested solution in the...

chapter

Studying how digital logic instructors solve canonical problems

Geoffrey L. Herman

2016 IEEE Frontiers in Education Conference (FIE) > 1 - 5

2016 IEEE Frontiers in Education Conference (FIE)

Sketches and other forms of graphical communication are central to both the practice and learning of engineering. Visual representations play a critical role in helping students learn engineering concepts, socialize them into the engineering discipline, and facilitate or hinder the design process. Despite the importance of graphical communication and visual representations, our understanding of how...

chapter

Image classification based on hash codes and space pyramid

Peng Tian-qiang, Li Fang

2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC) > 114 - 118

2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)

Sparse Coding is a widely used method to represent an image. However, sparse coding and its improved algorithms have the problem of complex computation and long running time and so on. For these problems, we propose an image classification method based on hash codes and space pyramid, which encodes local feature points with hash codes instead of sparse coding. Firstly, extract the local feature points...

chapter

Spatial collaborative representation for image categorization

Mouna Dammak, Chokri Ben Amar

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 3776 - 3781

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

A novel proposed approach, collaborative representation-based classification, has been developed for face recognition and recently used in image classification task owing to its simplicity and effectiveness. The major drawback of this method is the neglect of the spatial structure among the image representations. Inspired by the success of this technique and motivated by the power of spatial information...

chapter

Bag of Genres for Video Retrieval

Leonardo A. Duarte, Otavio A. B. Penatti, Jurandy Almeida

2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 257 - 264

2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

Often, videos are composed of multiple concepts or even genres. For instance, news videos may contain sports, action, nature, etc. Therefore, encoding the distribution of such concepts/genres in a compact and effective representation is a challenging task. In this sense, we propose the Bag of Genres representation, which is based on a visual dictionary defined by a genre classifier. Each visual word...

chapter

Simple and effective visual question answering in a single modality

Yuetan Lin, Zhangyang Pang, Yanan Li, Donghui Wang

2016 IEEE International Conference on Image Processing (ICIP) > 2276 - 2280

2016 IEEE International Conference on Image Processing (ICIP)

Visual question answering (VQA) comes as a result of great development in computer vision and natural language processing, which requires deep understanding of images and questions and effective integration of them. Current works on VQA simply concatenated visual and textual features or compared them via dot product, which were unable to eliminate the semantic difference between them. We argue to...

chapter

Position dependent prediction combination for intra-frame video coding

Amir Said, Xin Zhao, Marta Karczewicz, Jianle Chen, more

2016 IEEE International Conference on Image Processing (ICIP) > 534 - 538

2016 IEEE International Conference on Image Processing (ICIP)

Intra-frame prediction in the High Efficiency Video Coding (HEVC) standard can be empirically improved by applying sets of recursive two-dimensional filters to the predicted values. However, this approach does not allow (or complicates significantly) the parallel computation of pixel predictions. In this work we analyze why the recursive filters are effective, and use the results to derive sets of...

chapter

Decoding of responses to mixed frequency and phase coded visual stimuli using multiset canonical correlation analysis

Kaori Suefusa, Toshihisa Tanaka

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) > 1492 - 1495

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Brain-computer interfacing (BCI) based on steady-state visual evoked potentials (SSVEPs) is one of the most practical BCIs because of its high recognition accuracies and little training of a user. Mixed frequency and phase coding which can implement a number of commands and achieve a high information transfer rate (ITR) has recently been gaining much attention. In order to implement mixed-coded SSVEP-BCI...

chapter

A deep sparse coding method for fine-grained visual categorization

Lihua Guo, Chenggang Guo

2016 International Joint Conference on Neural Networks (IJCNN) > 632 - 639

2016 International Joint Conference on Neural Networks (IJCNN)

In the fine-grained categories, images have lager diversity in their intra categories. Meanwhile, they have more similarity in their inter categories. Therefore, images are difficultly distinguish during fine-grained visual classification(FGVC). This paper proposes a deep sparse coding framework to implement the fine-grained visual categorization. In our framework, deep layer structures with sparse...

INFONA - science communication portal

Search results

Genetic CNN

SuBiC: A Supervised, Structured Binary Code for Image Search

Generalized Orderless Pooling Performs Implicit Salient Matching

Dictionary learning for spontaneous neural activity modeling

Zero-Shot Classification with Discriminative Semantic Representation Learning

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

Decoder-side HEVC quality enhancement with scalable convolutional neural network

VIDEOWHISPER: Towards unsupervised learning of discriminative features of videos with RNN

LLC encoded BoW features and softmax regression for microscopic image classification

Does frequency resolution affect the classification performance of steady-state visual evoked potentials?

Indexing Mayan hieroglyphs with neural codes

Can Contextual Information Improve Scene Classification Performance?

Studying how digital logic instructors solve canonical problems

Image classification based on hash codes and space pyramid

Spatial collaborative representation for image categorization

Bag of Genres for Video Retrieval

Simple and effective visual question answering in a single modality

Position dependent prediction combination for intra-frame video coding

Decoding of responses to mixed frequency and phase coded visual stimuli using multiset canonical correlation analysis

A deep sparse coding method for fine-grained visual categorization

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options