Advanced search

Advanced search in people

From:

To:

Items from 61 to 80 out of 894 results

chapter

Soft decoding of JPEG 2000 compressed images using bit-rate-driven deep convolutional neural networks

Xiaohai He, Honggang Chen, Jingxu Chen, Linbo Qing

2017 IEEE International Conference on Information and Automation (ICIA) > 843 - 847

2017 IEEE International Conference on Information and Automation (ICIA)

Lossy image compression methods always introduce various unpleasant artifacts into the compressed results, especially at low bit-rates. In recent years, many effective soft decoding methods for JPEG compressed images have been proposed. However, to the best of our knowledge, very few works have been done on soft decoding of JPEG 2000 compressed images. Inspired by the outstanding performance of Convolution...

chapter

A review of object detection based on convolutional neural network

Wang Zhiqiang, Liu Jun

2017 36th Chinese Control Conference (CCC) > 11104 - 11109

2017 36th Chinese Control Conference (CCC)

With the development of intelligent device and social media, the data bulk on Internet has grown with high speed. As an important aspect of image processing, object detection has become one of the international popular research fields. In recent years, the powerful ability with feature learning and transfer learning of Convolutional Neural Network (CNN) has received growing interest within the computer...

chapter

Learning Cross-Modal Embeddings for Cooking Recipes and Food Images

Amaia Salvador, Nicholas Hynes, Yusuf Aytar, Javier Marin, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3068 - 3076

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we introduce Recipe1M, a new large-scale, structured corpus of over 1m cooking recipes and 800k food images. As the largest publicly available collection of recipe data, Recipe1M affords the ability to train high-capacity models on aligned, multi-modal data. Using these data, we train a neural network to find a joint embedding of recipes and images that yields impressive results on...

chapter

Self-Supervised Learning of Visual Features through Embedding Images into Text Topic Spaces

Lluis Gomez, Yash Patel, Marcal Rusinol, Dimosthenis Karatzas, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2017 - 2026

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

End-to-end training from scratch of current deep architectures for new computer vision problems would require Imagenet-scale datasets, and this is not always possible. In this paper we present a method that is able to take advantage of freely available multi-modal content to train computer vision algorithms without human supervision. We put forward the idea of performing self-supervised learning of...

chapter

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

Miguel A. Bautista, Artsiom Sanakoyeu, Bjorn Ommer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1923 - 1932

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Unsupervised learning of visual similarities is of paramount importance to computer vision, particularly due to lacking training data for fine-grained similarities. Deep learning of similarities is often based on relationships between pairs or triplets of samples. Many of these relations are unreliable and mutually contradicting, implying inconsistencies when trained without supervision information...

chapter

Weakly Supervised Dense Video Captioning

Zhiqiang Shen, Jianguo Li, Zhou Su, Minjun Li, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5159 - 5167

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper focuses on a novel and challenging vision task, dense video captioning, which aims to automatically describe a video clip with multiple informative and diverse caption sentences. The proposed method is trained without explicit annotation of fine-grained sentence to video region-sequence correspondence, but is only based on weak video-level sentence annotations. It differs from existing...

chapter

All You Need is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks with Orthonormality and Modulation

Di Xie, Jiang Xiong, Shiliang Pu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5075 - 5084

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep neural network is difficult to train and this predicament becomes worse as the depth increases. The essence of this problem exists in the magnitude of backpropagated errors that will result in gradient vanishing or exploding phenomenon. We show that a variant of regularizer which utilizes orthonormality among different filter banks can alleviate this problem. Moreover, we design a backward error...

chapter

Learning to Predict Stereo Reliability Enforcing Local Consistency of Confidence Maps

Matteo Poggi, Stefano Mattoccia

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4541 - 4550

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Confidence measures estimate unreliable disparity assignments performed by a stereo matching algorithm and, as recently proved, can be used for several purposes. This paper aims at increasing, by means of a deep network, the effectiveness of state-of-the-art confidence measures exploiting the local consistency assumption. We exhaustively evaluated our proposal on 23 confidence measures, including...

chapter

Lean Crowdsourcing: Combining Humans and Machines in an Online System

Steve Branson, Grant Van Horn, Pietro Perona

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6109 - 6118

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a method to greatly reduce the amount of redundant annotations required when crowdsourcing annotations such as bounding boxes, parts, and class labels. For example, if two Mechanical Turkers happen to click on the same pixel location when annotating a part in a given image–an event that is very unlikely to occur by random chance–, it is a strong indication that the...

chapter

Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach

Giorgio Patrini, Alessandro Rozza, Aditya Krishna Menon, Richard Nock, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2233 - 2241

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a theoretically grounded approach to train deep neural networks, including recurrent networks, subject to class-dependent label noise. We propose two procedures for loss correction that are agnostic to both application domain and network architecture. They simply amount to at most a matrix inversion and multiplication, provided that we know the probability of each class being corrupted...

chapter

Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification

Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2027 - 2036

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-label image classification is a fundamental but challenging task in computer vision. Great progress has been achieved by exploiting semantic relations between labels in recent years. However, conventional approaches are unable to model the underlying spatial relations between labels in multi-label images, because spatial annotations of the labels are generally not provided. In this paper, we...

chapter

Improving RANSAC-Based Segmentation through CNN Encapsulation

Dustin Morley, Hassan Foroosh

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2661 - 2670

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we present a method for improving a random sample consensus (RANSAC) based image segmentation algorithm by encapsulating it within a convolutional neural network (CNN). The improvements are gained by gradient descent training on the set of pre-RANSAC filtering and thresholding operations using a novel RANSAC-based loss function, which is geared toward optimizing the strength of the correct...

chapter

Adaptive Class Preserving Representation for Image Classification

Jian-Xun Mi, Qiankun Fu, Weisheng Li

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2624 - 2632

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In linear representation-based image classification, an unlabeled sample is represented by the entire training set. To obtain a stable and discriminative solution, regularization on the vector of representation coefficients is necessary. For example, the representation in sparse representation-based classification (SRC) uses L1 norm penalty as regularization, which is equal to lasso. However, lasso...

chapter

PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning

Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2566 - 2574

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

State-of-the-art computer vision algorithms often achieve efficiency by making discrete choices about which hypotheses to explore next. This allows allocation of computational resources to promising candidates, however, such decisions are non-differentiable. As a result, these algorithms are hard to train in an end-to-end fashion. In this work we propose to learn an efficient algorithm for the task...

chapter

From Red Wine to Red Tomato: Composition with Context

Ishan Misra, Abhinav Gupta, Martial Hebert

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1160 - 1169

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Compositionality and contextuality are key building blocks of intelligence. They allow us to compose known concepts to generate new and complex ones. However, traditional learning methods do not model both these properties and require copious amounts of labeled data to learn new concepts. A large fraction of existing techniques, e.g., using late fusion, compose concepts but fail to model contextuality...

chapter

Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks

Rameswar Panda, Amran Bhuiyan, Vittorio Murino, Amit K. Roy-Chowdhury

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1377 - 1386

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Person re-identification is an open and challenging problem in computer vision. Existing approaches have concentrated on either designing the best feature representation or learning optimal matching metrics in a static setting where the number of cameras are fixed in a network. Most approaches have neglected the dynamic and open world nature of the re-identification problem, where a new camera may...

chapter

Deep Hashing Network for Unsupervised Domain Adaptation

Hemanth Venkateswara, Jose Eusebio, Shayok Chakraborty, Sethuraman Panchanathan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5385 - 5394

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In recent years, deep neural networks have emerged as a dominant machine learning tool for a wide variety of application domains. However, training a deep neural network requires a large amount of labeled data, which is an expensive process in terms of time, labor and human expertise. Domain adaptation or transfer learning algorithms address this challenge by leveraging labeled data in a different,...

chapter

Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension

Aniruddha Kembhavi, Minjoon Seo, Dustin Schwenk, Jonghyun Choi, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5376 - 5384

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce the task of Multi-Modal Machine Comprehension (M3C), which aims at answering multimodal questions given a context of text, diagrams and images. We present the Textbook Question Answering (TQA) dataset that includes 1,076 lessons and 26,260 multi-modal questions, taken from middle school science curricula. Our analysis shows that a significant portion of questions require complex parsing...

chapter

SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos

Dingwen Zhang, Le Yang, Deyu Meng, Dong Xu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5340 - 5348

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Object segmentation in weakly labelled videos is an interesting yet challenging task, which aims at learning to perform category-specific video object segmentation by only using video-level tags. Existing works in this research area might still have some limitations, e.g., lack of effective DNN-based learning frameworks, under-exploring the context information, and requiring to leverage the unstable...

chapter

Teaching Compositionality to CNNs

Austin Stone, Huayan Wang, Michael Stark, Yi Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 732 - 741

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional neural networks (CNNs) have shown great success in computer vision, approaching human-level performance when trained for specific tasks via application-specific loss functions. In this paper, we propose a method for augmenting and training CNNs so that their learned features are compositional. It encourages networks to form representations that disentangle objects from their surroundings...

Keywords:
TRAINING
COMPUTER VISION

Publication date

Set your own date range

Content availability

Available (884)
None (10)

Publication type

book (805)
article (89)

Keywords

FEATURE EXTRACTION (374)
COMPUTATIONAL MODELING (135)
ACCURACY (130)
OBJECT DETECTION (129)
PATTERN RECOGNITION (126)
VISUALIZATION (124)
SUPPORT VECTOR MACHINES (120)
IMAGE CLASSIFICATION (115)
IMAGE SEGMENTATION (110)
IMAGE RECOGNITION (104)
CLASSIFICATION ALGORITHMS (101)
DATABASES (96)
DETECTORS (95)
FACE RECOGNITION (95)
SHAPE (95)
LEARNING (ARTIFICIAL INTELLIGENCE) (88)
FACE (87)
IMAGE COLOR ANALYSIS (87)
CAMERAS (84)
MACHINE LEARNING (80)
OBJECT RECOGNITION (80)
TESTING (79)
CONFERENCES (78)
PIXEL (78)
ROBUSTNESS (75)
HISTOGRAMS (72)
DATA MINING (71)
ARTIFICIAL NEURAL NETWORKS (70)
NEURAL NETWORKS (70)
KERNEL (68)
PRINCIPAL COMPONENT ANALYSIS (67)
ESTIMATION (66)
HUMANS (64)
ALGORITHM DESIGN AND ANALYSIS (60)
IMAGE EDGE DETECTION (60)
IMAGE PROCESSING (55)
IMAGE MOTION ANALYSIS (52)
LIGHTING (48)
HIDDEN MARKOV MODELS (46)
MATHEMATICAL MODEL (46)
SIGNAL PROCESSING (44)
VECTORS (42)
OPTIMIZATION (41)
TRAINING DATA (41)
COMPUTER ARCHITECTURE (40)
BOOSTING (37)
EQUATIONS (37)
REAL TIME SYSTEMS (36)
SEMANTICS (36)
TRANSFORMS (36)
CONVOLUTION (35)
IMAGE SEQUENCES (35)
IMAGE REPRESENTATION (34)
MACHINE VISION (34)
CORRELATION (33)
NEURAL NETS (33)
IMAGE RESOLUTION (32)
IMAGE RECONSTRUCTION (31)
OPTICAL IMAGING (31)
TRACKING (31)
SUPPORT VECTOR MACHINE (29)
VEHICLES (29)
VIDEO SIGNAL PROCESSING (29)
VIDEOS (29)
COMPUTERS (28)
STANDARDS (28)
DATA MODELS (27)
EIGENVALUES AND EIGENFUNCTIONS (27)
SIGNAL PROCESSING ALGORITHMS (27)
NOISE (26)
SUPPORT VECTOR MACHINE CLASSIFICATION (26)
THREE DIMENSIONAL DISPLAYS (26)
DEEP LEARNING (25)
NEURONS (25)
POSE ESTIMATION (24)
TARGET TRACKING (24)
CLUSTERING ALGORITHMS (23)
FACE DETECTION (23)
IMAGE COLOUR ANALYSIS (23)
ARTIFICIAL INTELLIGENCE (22)
DICTIONARIES (22)
IMAGE MATCHING (22)
MEASUREMENT (22)
PEDESTRIAN DETECTION (22)
BAYES METHODS (21)
GESTURE RECOGNITION (21)
IMAGE RETRIEVAL (21)
PATTERN CLASSIFICATION (21)
VIDEO SEQUENCES (21)
ADABOOST (20)
SOLID MODELING (20)
COMPLEXITY THEORY (18)
CONTEXT (18)
DECISION TREES (18)
EDUCATIONAL INSTITUTIONS (18)
ELECTRONIC MAIL (18)
GEOMETRY (18)
SVM (18)
more

Data set

ieee (893)
Wiley (1)

INFONA - science communication portal

Advanced search

Advanced search in people

Soft decoding of JPEG 2000 compressed images using bit-rate-driven deep convolutional neural networks

A review of object detection based on convolutional neural network

Learning Cross-Modal Embeddings for Cooking Recipes and Food Images

Self-Supervised Learning of Visual Features through Embedding Images into Text Topic Spaces

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

Weakly Supervised Dense Video Captioning

All You Need is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks with Orthonormality and Modulation

Learning to Predict Stereo Reliability Enforcing Local Consistency of Confidence Maps

Lean Crowdsourcing: Combining Humans and Machines in an Online System

Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach

Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification

Improving RANSAC-Based Segmentation through CNN Encapsulation

Adaptive Class Preserving Representation for Image Classification

PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning

From Red Wine to Red Tomato: Composition with Context

Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks

Deep Hashing Network for Unsupervised Domain Adaptation

Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension

SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos

Teaching Compositionality to CNNs

Filter options

Publication date

Content availability

Publication type

Keywords

Data set

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options