Search results

Items from 21 to 40 out of 346 results

chapter

Deep Photo Style Transfer

Fujun Luan, Sylvain Paris, Eli Shechtman, Kavita Bala

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6997 - 7005

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper introduces a deep-learning approach to photographic style transfer that handles a large variety of image content while faithfully transferring the reference style. Our approach builds upon the recent work on painterly transfer that separates style from the content of an image by considering different layers of a neural network. However, as is, this approach is not suitable for photorealistic...

chapter

Video Propagation Networks

Varun Jampani, Raghudeep Gadde, Peter V. Gehler

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3154 - 3164

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a technique that propagates information forward through video data. The method is conceptually simple and can be applied to tasks that require the propagation of structured information, such as semantic labels, based on video content. We propose a Video Propagation Network that processes video frames in an adaptive manner. The model is applied online: it propagates information forward without...

chapter

Network Dissection: Quantifying Interpretability of Deep Visual Representations

David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3319 - 3327

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a general framework called Network Dissection for quantifying the interpretability of latent representations of CNNs by evaluating the alignment between individual hidden units and a set of semantic concepts. Given any CNN model, the proposed method draws on a data set of concepts to score the semantics of hidden units at each intermediate convolutional layer. The units with semantics are...

chapter

CASENet: Deep Category-Aware Semantic Edge Detection

Zhiding Yu, Chen Feng, Ming-Yu Liu, Srikumar Ramalingam

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1761 - 1770

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Boundary and edge cues are highly beneficial in improving a wide variety of vision tasks such as semantic segmentation, object recognition, stereo, and object proposal generation. Recently, the problem of edge detection has been revisited and significant progress has been made with deep learning. While classical edge detection is a challenging binary problem in itself, the category-aware semantic...

chapter

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Li Fei-Fei, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1988 - 1997

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

When building artificial intelligence systems that can reason and answer questions about visual data, we need diagnostic tests to analyze our progress and discover short-comings. Existing benchmarks for visual question answering can help, but have strong biases that models can exploit to correctly answer questions without reasoning. They also conflate multiple sources of error, making it hard to pinpoint...

chapter

DeshadowNet: A Multi-context Embedding Deep Network for Shadow Removal

Liangqiong Qu, Jiandong Tian, Shengfeng He, Yandong Tang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2308 - 2316

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Shadow removal is a challenging task as it requires the detection/annotation of shadows as well as semantic understanding of the scene. In this paper, we propose an automatic and end-to-end deep neural network (DeshadowNet) to tackle these problems in a unified manner. DeshadowNet is designed with a multi-context architecture, where the output shadow matte is predicted by embedding information from...

chapter

Colorization as a Proxy Task for Visual Understanding

Gustav Larsson, Michael Maire, Gregory Shakhnarovich

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 840 - 849

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We investigate and improve self-supervision as a drop-in replacement for ImageNet pretraining, focusing on automatic colorization as the proxy task. Self-supervised training has been shown to be more promising for utilizing unlabeled data than other, traditional unsupervised learning methods. We build on this success and evaluate the ability of our self-supervised network in several contexts. On VOC...

chapter

Estimating relative depth in single images via rankboost

Ralph Ewerth, Matthias Springstein, Eric Muller, Alexander Balz, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 919 - 924

2017 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, we present a novel approach to estimate the relative depth of regions in monocular images. There are several contributions. First, the task of monocular depth estimation is considered as a learning-to-rank problem which offers several advantages compared to regression approaches. Second, monocular depth clues of human perception are modeled in a systematic manner. Third, we show that...

chapter

Saliency detection with two-level fully convolutional networks

Yang Yi, Li Su, Qingming Huang, Zhe Wu, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 271 - 276

2017 IEEE International Conference on Multimedia and Expo (ICME)

This paper proposes a deep architecture for saliency detection by fusing pixel-level and superpixel-level predictions. Different from the previous methods that either make dense pixellevel prediction with complex networks or region-level prediction for each region with fully-connected layers, this paper investigates an elegant route to make two-level predictions based on a same simple fully convolutional...

chapter

Cartooning for Enhanced Privacy in Lifelogging and Streaming Videos

Eman T. Hassan, Rakibul Hasan, Patrick Shaffer, David Crandall, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1333 - 1342

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

We describe an object replacement approach whereby privacy-sensitive objects in videos are replaced by abstract cartoons taken from clip art. Our approach uses a combination of computer vision, deep learning, and image processing techniques to detect objects, abstract details, and replace them with cartoon clip art. We conducted a user study (N=85) to discern the utility and effectiveness of our cartoon...

chapter

Traffic scene segmentation based on boosting over multimodal low, intermediate and high order multi-range channel features

Arthur D. Costea, Sergiu Nedevschi

2017 IEEE Intelligent Vehicles Symposium (IV) > 74 - 81

2017 IEEE Intelligent Vehicles Symposium (IV)

In this paper we introduce a novel multimodal boosting based solution for semantic segmentation of traffic scenarios. Local structure and context are captured from both monocular color and depth modalities in the form of image channels. We define multiple channel types at three different levels: low, intermediate and high order channels. The low order channels are computed using a multimodal multiresolution...

chapter

SVM-based method for perceptual image recognition

Cheng Yang, Bin Zhu, Fang An

2017 2nd International Conference on Image, Vision and Computing (ICIVC) > 264 - 267

2017 2nd International Conference on Image, Vision and Computing (ICIVC)

Perceptual image of a product plays a significant role in decision making when users choose a product whose basic function is homogeneous nowadays. Designers try to design products that meet the all kinds of demands of users. However, a big gap between designers and users exists owning to the subjectivity of designers' experience. An objective model to recognize perceptual image of products is proposed...

chapter

Stixel based scene understanding for autonomous vehicles

Zygfryd Wieszok, Nabil Aouf, Odysseas Kechagias-Stamatis, Lounis Chermak

2017 IEEE 14th International Conference on Networking, Sensing and Control (ICNSC) > 43 - 48

2017 IEEE 14th International Conference on Networking, Sensing and Control (ICNSC)

We propose a stereo vision based obstacle detection and scene segmentation algorithm appropriate for autonomous vehicles. Our algorithm is based on an innovative extension of the Stixel world, which neglects computing a disparity map. Ground plane and stixel distance estimation is improved by exploiting an online learned color model. Furthermore, the stixel height estimation is leveraged by an innovative...

chapter

Combining eye movements for semantic image classification

Xin Liu, Xianzhong Zhou, Tianqi Ji, Han Bai, more

2017 IEEE 14th International Conference on Networking, Sensing and Control (ICNSC) > 761 - 766

2017 IEEE 14th International Conference on Networking, Sensing and Control (ICNSC)

Nowadays, the “semantic gap” problems have greatly limited development of image classification. The key to this problem is to get semantic information of the images. A semantic image feature extraction method is proposed in this paper, in which eye movement information is integrated. Firstly, the underlying visual features of images are extracted. Secondly, weighed feature vectors of images are constructed...

chapter

A survey on key frame extraction methods of a MPEG video

Shivangi Pandey, Prashant Dwivedy, Sunil Meena, Anjali Potnis

2017 International Conference on Computing, Communication and Automation (ICCCA) > 1192 - 1196

2017 International Conference on Computing, Communication and Automation (ICCCA)

The key frame extraction helps us to make obtainable summary of a video. After studying a variety of diverse methods of Key frame extraction, we will have comparative analysis of the methods depending on their important features and result. If we want to present the entire video within a squat interval of time, video summary becomes the best alternative for this. This has become a very essential work...

chapter

What no robot has seen before — Probabilistic interpretation of natural-language object descriptions

Daniel Nyga, Mareike Picklum, Michael Beetz

2017 IEEE International Conference on Robotics and Automation (ICRA) > 4278 - 4285

2017 IEEE International Conference on Robotics and Automation (ICRA)

We investigate the task of recognizing objects of daily use in human environments purely based on object descriptions given in natural language. In particular, we present an approach to transform phrases stated in natural language that describe such objects by their visual appearance into formal, semantic representations of their perceptual characteristics, which in turn can be used in a robot perception...

chapter

Typicality effect on N400 ERP in categories despite differences in semantic processing

Mansoureh Fahimi Hnazaee, Marc M. Van Hulle

2017 International Joint Conference on Neural Networks (IJCNN) > 4379 - 4386

2017 International Joint Conference on Neural Networks (IJCNN)

We investigate the effect of word typicality — the degree of membership of a word to its superordinate category — on the N400 event-related potential (ERP) using a single trial detection approach based on spatiotemporal beamforming. Unlike the norm in studies, where mostly concrete categories are used (imaginable objects), we considered a total of 6 basic categories: three abstract and unimaginable...

chapter

Leveraging deep visual features for content-based movie recommender systems

Ralph Jose Rassweiler Filho, Jonatas Wehrmann, Rodrigo C. Barros

2017 International Joint Conference on Neural Networks (IJCNN) > 604 - 611

2017 International Joint Conference on Neural Networks (IJCNN)

The movie domain is one of the most common scenarios to test and evaluate recommender systems. These systems are often implemented through a collaborative filtering model, which relies exclusively on the user's feedback on items, ignoring content features. Content-based filtering models are nevertheless a potentially good strategy for recommendation, even though identifying relevant semantic representation...

chapter

Visual attention based on long-short term memory model for image caption generation

Shiru Qu, Yuling Xi, Songtao Ding

2017 29th Chinese Control And Decision Conference (CCDC) > 4789 - 4794

2017 29th Chinese Control And Decision Conference (CCDC)

Image caption generation becomes a raising topic in computer vision and artificial intelligence. In order to solve the problem of stiff description, we intend to extract richer features using convolutional neural network (CNN). A neural and probabilistic framework has been proposed consequently which combines CNN with a special form of recurrent neural network (RNN) to produce an end-to-end image...

chapter

The research on segmentation methods of soccer videoes

Chenjing Feng, Peng Yu, Shuiyuan Yu, Shengyan Zhang

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS) > 645 - 648

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)

The video segmentation is the basis of video analysis and digging, the results directly affect the accuracy of follow-up processing. For the video segmentation, the predecessors have done a IoT of research. We have achieved very good results in regard to the mutant lens, but there is no good way for the gradient. Because football is a popular sports, it has a very practical significance for the process...

Keywords:
IMAGE COLOR ANALYSIS
SEMANTICS

Publication date

Set your own date range

Content availability

Available (342)
None (4)

Keywords

FEATURE EXTRACTION (154)
VISUALIZATION (106)
IMAGE SEGMENTATION (89)
IMAGE RETRIEVAL (63)
HISTOGRAMS (53)
TRAINING (52)
SHAPE (41)
COLOR (32)
SUPPORT VECTOR MACHINES (31)
IMAGE EDGE DETECTION (27)
IMAGE CLASSIFICATION (26)
ACCURACY (22)
IMAGE COLOUR ANALYSIS (21)
PIXEL (21)
DATABASES (19)
CLASSIFICATION ALGORITHMS (15)
COMPUTATIONAL MODELING (15)
CONTENT-BASED RETRIEVAL (15)
CONTEXT (15)
HUMANS (15)
VECTORS (15)
NEURAL NETWORKS (14)
ONTOLOGIES (14)
IMAGE REPRESENTATION (13)
RELEVANCE FEEDBACK (13)
CAMERAS (11)
CLUSTERING ALGORITHMS (11)
CORRELATION (11)
IMAGE ANNOTATION (11)
INDEXING (11)
KERNEL (11)
LABELING (11)
LEARNING (ARTIFICIAL INTELLIGENCE) (11)
SEMANTIC GAP (11)
STREAMING MEDIA (11)
CONTENT-BASED IMAGE RETRIEVAL (10)
MULTIMEDIA COMMUNICATION (10)
OBJECT RECOGNITION (10)
OPTIMIZATION (10)
VIDEOS (10)
COMPUTERS (9)
DATA MINING (9)
EDUCATIONAL INSTITUTIONS (9)
IMAGE TEXTURE (9)
LIGHTING (9)
OBJECT DETECTION (9)
PRAGMATICS (9)
VIDEO SIGNAL PROCESSING (9)
ALGORITHM DESIGN AND ANALYSIS (8)
COMPUTER ARCHITECTURE (8)
FACE (8)
INFORMATION RETRIEVAL (8)
SEGMENTATION (8)
THREE-DIMENSIONAL DISPLAYS (8)
CBIR (7)
ESTIMATION (7)
IMAGE RECOGNITION (7)
MACHINE LEARNING (7)
MATERIALS (7)
PROPOSALS (7)
RADIO FREQUENCY (7)
SYNTACTICS (7)
ANALYTICAL MODELS (6)
COMPUTER VISION (6)
DECISION TREES (6)
MATHEMATICAL MODEL (6)
PAINTING (6)
PATTERN CLUSTERING (6)
PROBABILISTIC LOGIC (6)
ROBOTS (6)
SEMANTIC SEGMENTATION (6)
SILICON (6)
SUPPORT VECTOR MACHINE (6)
VOCABULARY (6)
ADAPTATION MODEL (5)
BUILDINGS (5)
CLASSIFICATION (5)
CLOTHING (5)
COGNITION (5)
COLOR HISTOGRAM (5)
ELECTROENCEPHALOGRAPHY (5)
GAMES (5)
GAUSSIAN PROCESSES (5)
HIDDEN MARKOV MODELS (5)
INDEXES (5)
MEASUREMENT (5)
MOTION PICTURES (5)
MOTION SEGMENTATION (5)
OBJECT SEGMENTATION (5)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (5)
PROBABILITY (5)
PROTOTYPES (5)
PSYCHOLOGY (5)
ROADS (5)
ROBUSTNESS (5)
SKIN (5)
STANDARDS (5)
TEXTURE FEATURES (5)
more

INFONA - science communication portal

Search results

Deep Photo Style Transfer

Video Propagation Networks

Network Dissection: Quantifying Interpretability of Deep Visual Representations

CASENet: Deep Category-Aware Semantic Edge Detection

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

DeshadowNet: A Multi-context Embedding Deep Network for Shadow Removal

Colorization as a Proxy Task for Visual Understanding

Estimating relative depth in single images via rankboost

Saliency detection with two-level fully convolutional networks

Cartooning for Enhanced Privacy in Lifelogging and Streaming Videos

Traffic scene segmentation based on boosting over multimodal low, intermediate and high order multi-range channel features

SVM-based method for perceptual image recognition

Stixel based scene understanding for autonomous vehicles

Combining eye movements for semantic image classification

A survey on key frame extraction methods of a MPEG video

What no robot has seen before — Probabilistic interpretation of natural-language object descriptions

Typicality effect on N400 ERP in categories despite differences in semantic processing

Leveraging deep visual features for content-based movie recommender systems

Visual attention based on long-short term memory model for image caption generation

The research on segmentation methods of soccer videoes

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options