Search results

Items from 41 to 60 out of 2,693 results

chapter

Fast Multi-image Matching via Density-Based Clustering

Roberto Tron, Xiaowei Zhou, Carlos Esteves, Kostas Daniilidis

2017 IEEE International Conference on Computer Vision (ICCV) > 4077 - 4086

2017 IEEE International Conference on Computer Vision (ICCV)

We consider the problem of finding consistent matches across multiple images. Current state-of-the-art solutions use constraints on cycles of matches together with convex optimization, leading to computationally intensive iterative algorithms. In this paper, we instead propose a clustering-based formulation: we first rigorously show its equivalence with traditional approaches, and then propose QuickMatch,...

chapter

Deep Direct Regression for Multi-oriented Scene Text Detection

Wenhao He, Xu-Yao Zhang, Fei Yin, Cheng-Lin Liu

2017 IEEE International Conference on Computer Vision (ICCV) > 745 - 753

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we first provide a new perspective to divide existing high performance object detection methods into direct and indirect regressions. Direct regression performs boundary regression by predicting the offsets from a given point, while indirect regression predicts the offsets from some bounding box proposals. In the context of multioriented scene text detection, we analyze the drawbacks...

chapter

Stereo matching algorithm by hill-climbing segmentation

Tin Tin San, Nu War

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE) > 1 - 2

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE)

Stereo matching is important in the area of computer vision and photogrammetry. We present a stereo matching algorithm to refine depth map by using stereo image pair. The reference image is segmented by using hill-climbing algorithm and Scale Invariant Feature Transform (SIFT) feature descriptor with Sum of Absolute Difference (SAD) local stereo matching is performed. Next, we extract a set of disparity...

chapter

A Mobile Application for Plant Recognition through Deep Learning

Min Gao, Yang Lin, Richard O. Sinnott

2017 IEEE 13th International Conference on e-Science (e-Science) > 29 - 38

2017 IEEE 13th International Conference on e-Science (e-Science)

It is a simple task for humans to visually identify objects. However, computer-based image recognition remains challenging. In this paper we describe an approach for image recognition with specific focus on automated recognition of plants and flowers. The approach taken utilizes deep learning capabilities and unlike other approaches that focus on static images for feature classification, we utilize...

chapter

Person re-identification from CCTV silhouettes using generic fourier descriptor

Rawabi Alsedais, Richard Guest

2017 International Carnahan Conference on Security Technology (ICCST) > 1 - 6

2017 International Carnahan Conference on Security Technology (ICCST)

Person re-identification in public areas (such as airports, train stations and shopping malls) has recently received increased attention within computer vision research due, in part, to the demand for enhanced levels of security. Re-identifying subjects within non-overlapped camera networks can be considered as a challenging task. Illumination changes in different scenes, variations in camera resolutions,...

article

Extended Locality-Constrained Linear Self-Coding for Saliency Detection

Chunlei Yang, Jiexin Pu, Guo-Sen Xie, Yongsheng Dong, more

IEEE Signal Processing Letters > 2017 > 24 > 10 > 1458 - 1462

In complex scenes, foreground saliency can hardly be detected completely, which may further result in the ambiguous cues of objects for other computer vision tasks. In this letter, an extended locality-constrained linear self-coding (eLLsC) scheme is proposed to assist to solve the saliency detection problem under the complex scenes. The locality of both spatial relation and feature distance is preserved...

chapter

HHA-based CNN image features for indoor loop closure detection

Wei Zhang, Guoliang Liu, Guohui Tian

2017 Chinese Automation Congress (CAC) > 4634 - 4639

2017 Chinese Automation Congress (CAC)

Loop closure detection is an important part of visual simultaneous location and mapping (SLAM) system. Most of traditional loop closure detection approaches using hand-crafted features often lack robustness with respect to object occlusions and illumination changes, especially for the complicated indoor environment. Recently, convolutional neural network (CNN) makes a huge impact on many computer...

chapter

Generating High-Quality Crowd Density Maps Using Contextual Pyramid CNNs

Vishwanath A. Sindagi, Vishal M. Patel

2017 IEEE International Conference on Computer Vision (ICCV) > 1879 - 1888

2017 IEEE International Conference on Computer Vision (ICCV)

We present a novel method called Contextual Pyramid CNN (CP-CNN) for generating high-quality crowd density and count estimation by explicitly incorporating global and local contextual information of crowd images. The proposed CP-CNN consists of four modules: Global Context Estimator (GCE), Local Context Estimator (LCE), Density Map Estimator (DME) and a Fusion-CNN (F-CNN). GCE is a VGG-16 based CNN...

chapter

Infinite Latent Feature Selection: A Probabilistic Latent Graph-Based Ranking Approach

Giorgio Roffo, Simone Melzi, Umberto Castellani, Alessandro Vinciarelli

2017 IEEE International Conference on Computer Vision (ICCV) > 1407 - 1415

2017 IEEE International Conference on Computer Vision (ICCV)

Feature selection is playing an increasingly significant role with respect to many computer vision applications spanning from object recognition to visual object tracking. However, most of the recent solutions in feature selection are not robust across different and heterogeneous set of data. In this paper, we address this issue proposing a robust probabilistic latent graph-based feature selection...

chapter

Unsupervised Action Discovery and Localization in Videos

Khurram Soomro, Mubarak Shah

2017 IEEE International Conference on Computer Vision (ICCV) > 696 - 705

2017 IEEE International Conference on Computer Vision (ICCV)

This paper is the first to address the problem of unsupervised action localization in videos. Given unlabeled data without bounding box annotations, we propose a novel approach that: 1) Discovers action class labels and 2) Spatio-temporally localizes actions in videos. It begins by computing local video features to apply spectral clustering on a set of unlabeled training videos. For each cluster of...

chapter

The application of deep learning in computer vision

Qing Wu, Yungang Liu, Qiang Li, Shaoli Jin, more

2017 Chinese Automation Congress (CAC) > 6522 - 6527

2017 Chinese Automation Congress (CAC)

As the deep learning exhibits strong advantages in the feature extraction, it has been widely used in the field of computer vision and among others, and gradually replaced traditional machine learning algorithms. This paper first reviews the main ideas of deep learning, and displays several related frequently-used algorithms for computer vision. Afterwards, the current research status of computer...

chapter

Study of object detection based on Faster R-CNN

Bin Liu, Wencang Zhao, Qiaoqiao Sun

2017 Chinese Automation Congress (CAC) > 6233 - 6236

2017 Chinese Automation Congress (CAC)

Faster R-CNN (R corresponds to “Region”) which combined the RPN network and the Fast R-CNN network is one of the best ways to object detection of R-CNN series based on deep learning. The proposal obtained by RPN is directly connected to the ROI Pooling layer, which is a framework for CNN to achieve end-to-end object detection. The feasibility of Faster R-CNN implementation of ResNet101 network and...

chapter

Self-Organized Text Detection with Minimal Post-processing via Border Learning

Yue Wu, Prem Natarajan

2017 IEEE International Conference on Computer Vision (ICCV) > 5010 - 5019

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper we propose a new solution to the text detection problem via border learning. Specifically, we make four major contributions: 1) We analyze the insufficiencies of the classic non-text and text settings for text detection. 2) We introduce the border class to the text detection problem for the first time, and validate that the decoding process is largely simplified with the help of text...

chapter

Deep Scene Image Classification with the MFAFVNet

Yunsheng Li, Mandar Dixit, Nuno Vasconcelos

2017 IEEE International Conference on Computer Vision (ICCV) > 5757 - 5765

2017 IEEE International Conference on Computer Vision (ICCV)

The problem of transferring a deep convolutional network trained for object recognition to the task of scene image classification is considered. An embedded implementation of the recently proposed mixture of factor analyzers Fisher vector (MFA-FV) is proposed. This enables the design of a network architecture, the MFAFVNet, that can be trained in an end to end manner. The new architecture involves...

chapter

DeepFuse: A Deep Unsupervised Approach for Exposure Fusion with Extreme Exposure Image Pairs

K. Ram Prabhakar, V Sai Srikar, R. Venkatesh Babu

2017 IEEE International Conference on Computer Vision (ICCV) > 4724 - 4732

2017 IEEE International Conference on Computer Vision (ICCV)

We present a novel deep learning architecture for fusing static multi-exposure images. Current multi-exposure fusion (MEF) approaches use hand-crafted features to fuse input sequence. However, the weak hand-crafted representations are not robust to varying input conditions. Moreover, they perform poorly for extreme exposure image pairs. Thus, it is highly desirable to have a method that is robust...

chapter

Towards End-to-End Text Spotting with Convolutional Recurrent Neural Networks

Hui Li, Peng Wang, Chunhua Shen

2017 IEEE International Conference on Computer Vision (ICCV) > 5248 - 5256

2017 IEEE International Conference on Computer Vision (ICCV)

In this work, we jointly address the problem of text detection and recognition in natural scene images based on convolutional recurrent neural networks. We propose a unified network that simultaneously localizes and recognizes text with a single forward pass, avoiding intermediate processes, such as image cropping, feature re-calculation, word separation, and character grouping. In contrast to existing...

chapter

Visual Transformation Aided Contrastive Learning for Video-Based Kinship Verification

Hamdi Dibeklioglu

2017 IEEE International Conference on Computer Vision (ICCV) > 2478 - 2487

2017 IEEE International Conference on Computer Vision (ICCV)

Automatic kinship verification from facial information is a relatively new and open research problem in computer vision. This paper explores the possibility of learning an efficient facial representation for video-based kinship verification by exploiting the visual transformation between facial appearance of kin pairs. To this end, a Siamese-like coupled convolutional encoder-decoder network is proposed...

chapter

HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis

Xihui Liu, Haiyu Zhao, Maoqing Tian, Lu Sheng, more

2017 IEEE International Conference on Computer Vision (ICCV) > 350 - 359

2017 IEEE International Conference on Computer Vision (ICCV)

Pedestrian analysis plays a vital role in intelligent video surveillance and is a key component for security-centric computer vision systems. Despite that the convolutional neural networks are remarkable in learning discriminative features from images, the learning of comprehensive features of pedestrians for fine-grained tasks remains an open problem. In this study, we propose a new attentionbased...

chapter

RankIQA: Learning from Rankings for No-Reference Image Quality Assessment

Xialei Liu, Joost van de Weijer, Andrew D. Bagdanov

2017 IEEE International Conference on Computer Vision (ICCV) > 1040 - 1049

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a no-reference image quality assessment (NR-IQA) approach that learns from rankings (RankIQA). To address the problem of limited IQA dataset size, we train a Siamese Network to rank images in terms of image quality by using synthetically generated distortions for which relative image quality is known. These ranked image sets can be automatically generated without laborious human labeling...

chapter

DualNet: Learn Complementary Features for Image Recognition

Saihui Hou, Xu Liu, Zilei Wang

2017 IEEE International Conference on Computer Vision (ICCV) > 502 - 510

2017 IEEE International Conference on Computer Vision (ICCV)

In this work we propose a novel framework named Dual-Net aiming at learning more accurate representation for image recognition. Here two parallel neural networks are coordinated to learn complementary features and thus a wider network is constructed. Specifically, we logically divide an end-to-end deep convolutional neural network into two functional parts, i.e., feature extractor and image classifier...

Data set:
ieee
Keywords:
FEATURE EXTRACTION
COMPUTER VISION

Publication date

Set your own date range

Content availability

Available (2,660)
None (33)

Publication type

book (2,386)
article (307)

Keywords

CAMERAS (445)
IMAGE SEGMENTATION (419)
TRAINING (374)
OBJECT DETECTION (353)
IMAGE COLOR ANALYSIS (348)
PIXEL (328)
VISUALIZATION (312)
DATA MINING (294)
SHAPE (286)
PATTERN RECOGNITION (281)
IMAGE EDGE DETECTION (280)
IMAGE MOTION ANALYSIS (270)
ROBUSTNESS (270)
IMAGE CLASSIFICATION (266)
HISTOGRAMS (258)
IMAGE MATCHING (255)
COMPUTATIONAL MODELING (238)
SUPPORT VECTOR MACHINES (237)
OBJECT RECOGNITION (235)
IMAGE RECOGNITION (234)
DETECTORS (231)
IMAGE PROCESSING (221)
FACE RECOGNITION (205)
IMAGE SEQUENCES (197)
ACCURACY (190)
CONFERENCES (187)
MACHINE VISION (184)
HUMANS (173)
TRANSFORMS (161)
ALGORITHM DESIGN AND ANALYSIS (160)
CLASSIFICATION ALGORITHMS (158)
DATABASES (158)
TRACKING (151)
FACE (149)
ESTIMATION (148)
OPTICAL IMAGING (141)
IMAGE COLOUR ANALYSIS (134)
IMAGE RECONSTRUCTION (133)
EDGE DETECTION (131)
IMAGE REPRESENTATION (131)
LIGHTING (129)
THREE DIMENSIONAL DISPLAYS (124)
VIDEO SIGNAL PROCESSING (124)
ARTIFICIAL NEURAL NETWORKS (120)
NOISE (119)
IMAGE RESOLUTION (116)
PRINCIPAL COMPONENT ANALYSIS (116)
MATHEMATICAL MODEL (112)
KERNEL (108)
LEARNING (ARTIFICIAL INTELLIGENCE) (108)
MACHINE LEARNING (107)
IMAGE TEXTURE (106)
VEHICLES (106)
SIGNAL PROCESSING (105)
TARGET TRACKING (105)
VECTORS (102)
HIDDEN MARKOV MODELS (100)
IMAGE RETRIEVAL (100)
EQUATIONS (99)
REAL TIME SYSTEMS (85)
TESTING (84)
COMPUTERS (83)
CORRELATION (82)
SIGNAL PROCESSING ALGORITHMS (82)
STEREO IMAGE PROCESSING (82)
GESTURE RECOGNITION (77)
SIFT (74)
IMAGE ANALYSIS (73)
IMAGE REGISTRATION (72)
ROADS (72)
SURVEILLANCE (72)
WAVELET TRANSFORMS (72)
EDUCATIONAL INSTITUTIONS (70)
TRAJECTORY (70)
GEOMETRY (69)
VIDEO SEQUENCES (69)
OPTICAL FLOW (66)
POSE ESTIMATION (66)
CALIBRATION (65)
INSPECTION (65)
SUPPORT VECTOR MACHINE (65)
VIDEOS (65)
FACE DETECTION (64)
SOLID MODELING (64)
IMAGE CODING (63)
NEURAL NETS (62)
NEURAL NETWORKS (62)
TRAFFIC ENGINEERING COMPUTING (62)
MOTION ESTIMATION (61)
REAL-TIME SYSTEMS (61)
VIDEO SURVEILLANCE (61)
OBJECT TRACKING (59)
THREE-DIMENSIONAL DISPLAYS (58)
MONITORING (57)
STREAMING MEDIA (57)
CLUSTERING ALGORITHMS (56)
GABOR FILTERS (55)
OPTIMIZATION (55)
more

INFONA - science communication portal

Search results

Fast Multi-image Matching via Density-Based Clustering

Deep Direct Regression for Multi-oriented Scene Text Detection

Stereo matching algorithm by hill-climbing segmentation

A Mobile Application for Plant Recognition through Deep Learning

Person re-identification from CCTV silhouettes using generic fourier descriptor

Extended Locality-Constrained Linear Self-Coding for Saliency Detection

HHA-based CNN image features for indoor loop closure detection

Generating High-Quality Crowd Density Maps Using Contextual Pyramid CNNs

Infinite Latent Feature Selection: A Probabilistic Latent Graph-Based Ranking Approach

Unsupervised Action Discovery and Localization in Videos

The application of deep learning in computer vision

Study of object detection based on Faster R-CNN

Self-Organized Text Detection with Minimal Post-processing via Border Learning

Deep Scene Image Classification with the MFAFVNet

DeepFuse: A Deep Unsupervised Approach for Exposure Fusion with Extreme Exposure Image Pairs

Towards End-to-End Text Spotting with Convolutional Recurrent Neural Networks

Visual Transformation Aided Contrastive Learning for Video-Based Kinship Verification

HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis

RankIQA: Learning from Rankings for No-Reference Image Quality Assessment

DualNet: Learn Complementary Features for Image Recognition

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options