Search results

Items from 1 to 20 out of 235 results

chapter

Patched-based deep Boltzmann shape priors for visual tracking

Sanghoon Lee, Ilhong Shin, Eunjun Rhee, Sunghee Lee, more

2017 International Conference on Information and Communication Technology Convergence (ICTC) > 1109 - 1111

2017 International Conference on Information and Communication Technology Convergence (ICTC)

In this paper, we propose a patched-based deep Boltzmann shape priors for visual tracking. The shape priors are generated from deep Boltzmann machine network. The network consists of three layers of hidden and visible units. The generated shapes not only maintain general shapes from a variety of poses, but also entail local modifications with high probability.

chapter

Research on target detection and tracking system of rescue robot

Xiaoyan Lu, Dan Li

2017 Chinese Automation Congress (CAC) > 6623 - 6627

2017 Chinese Automation Congress (CAC)

This target detection and tracking system is the basis for rescue robots to achieve their independent search and rescue operations. In order to improve their mobile performance and sensing capability, the Kinect camera is employed by rescue robots to obtain environmental visual. The AKAZE(Accelerated-KAZE) feature matching algorithm is adopted to achieve target detection in video frames, combining...

chapter

Robust real-time visual tracking by using particle filter with sampling multiple importance resampling

Cheng-Ming Huang, Bo-Wei Jiang

2017 56th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE) > 1050 - 1051

2017 56th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE)

In this paper, a robust visual tracking system by utilizing the images acquired from a color camera and a thermal camera is proposed to track the target with real-time performance. The thermal camera, which can observe the heat originated from the target such as the human body or vehicle, can collaborate with the color camera to track the target in the cluttered environment or under occlusion. Unlike...

chapter

Depth image super resolution via multi-hypothesis estimation

Muzaffer Aslan, Abdulkadir Sengur

2017 International Artificial Intelligence and Data Processing Symposium (IDAP) > 1 - 4

2017 International Artificial Intelligence and Data Processing Symposium (IDAP)

The rapid development of three-dimensional (3D) imaging techniques has significantly increased the demand for high resolution (HR) depth video and images. Significant pixel deficiencies and too much noise can be seen in depth images especially taken from Kinect cameras. For this reason, usability in several computer vision applications is restricted. In the acquisition of HR depth images, in traditional...

chapter

Saliency detection for RGBD image using optimization

Zhengchao Lei, Weiyan Chai, Sanyuan Zhao, Hongmei Song, more

2017 12th International Conference on Computer Science and Education (ICCSE) > 440 - 443

2017 12th International Conference on Computer Science and Education (ICCSE)

Saliency detection in images attracts much research attention for its usage in numerous multimedia applications. In this paper, we propose a saliency detection method based on optimization for RGBD images. With RGBD images, our method utilizes the depth channel to enhance the identification of background and foreground regions. We firstly generate new depth image by using non-linear transformation...

chapter

Predicting Salient Face in Multiple-Face Videos

Yufan Liu, Songyang Zhang, Mai Xu, Xuming He

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3224 - 3232

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Although the recent success of convolutional neural network (CNN) advances state-of-the-art saliency prediction in static images, few work has addressed the problem of predicting attention in videos. On the other hand, we find that the attention of different subjects consistently focuses on a single face in each frame of videos involving multiple faces. Therefore, we propose in this paper a novel...

chapter

The More You Know: Using Knowledge Graphs for Image Classification

Kenneth Marino, Ruslan Salakhutdinov, Abhinav Gupta

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 20 - 28

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

One characteristic that sets humans apart from modern learning-based computer vision algorithms is the ability to acquire knowledge about the world and use that knowledge to reason about the visual world. Humans can learn about the characteristics of objects and the relationships that occur between them to learn a large variety of visual concepts, often with few examples. This paper investigates the...

chapter

Online Asymmetric Similarity Learning for Cross-Modal Retrieval

Yiling Wu, Shuhui Wang, Qingming Huang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3984 - 3993

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Cross-modal retrieval has attracted intensive attention in recent years. Measuring the semantic similarity between heterogeneous data objects is an essential yet challenging problem in cross-modal retrieval. In this paper, we propose an online learning method to learn the similarity function between heterogeneous modalities by preserving the relative similarity in the training data, which is modeled...

chapter

Attend to You: Personalized Image Captioning with Context Sequence Memory Networks

Cesc Chunseong Park, Byeongchang Kim, Gunhee Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6432 - 6440

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We address personalization issues of image captioning, which have not been discussed yet in previous research. For a query image, we aim to generate a descriptive sentence, accounting for prior knowledge such as the users active vocabularies in previous documents. As applications of personalized image captioning, we tackle two post automation tasks: hashtag prediction and post generation, on our newly...

chapter

Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies

Lena Gorelick, Yuri Boykov, Olga Veksler

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6062 - 6070

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Many computer vision problems require optimization of binary non-submodular energies. In this context, iterative submodularization techniques based on trust region (LSA-TR) and auxiliary functions (LSA-AUX) have been recently proposed [9]. They achieve state-of-the-art-results on a number of computer vision applications. In this paper we extend the LSA-AUX framework in two directions. First, unlike...

chapter

Exclusivity-Consistency Regularized Multi-view Subspace Clustering

Xiaobo Wang, Xiaojie Guo, Zhen Lei, Changqing Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1 - 9

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-view subspace clustering aims to partition a set of multi-source data into their underlying groups. To boost the performance of multi-view clustering, numerous subspace learning algorithms have been developed in recent years, but with rare exploitation of the representation complementarity between different views as well as the indicator consistency among the representations, let alone considering...

chapter

A change detection method based on cosegmentation

Zhenlei Xie, Ruoming Shi, Ling Zhu, Shu Peng, more

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 1954 - 1957

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

A method based on cosegmentation is applied to change detection to segment image patches belonging to each image. The image patches have the characteristics of spatial correspondence in multi-temporal images and precise boundary in its own image. By construction and optimization of energy function that consists of change feature item and image feature item, both of spectrum and shape change can successfully...

chapter

Scribbler: Controlling Deep Image Synthesis with Sketch and Color

Patsorn Sangkloy, Jingwan Lu, Chen Fang, Fisher Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6836 - 6845

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Several recent works have used deep convolutional networks to generate realistic imagery. These methods sidestep the traditional computer graphics rendering pipeline and instead generate imagery at the pixel level by learning from large collections of photos (e.g. faces or bedrooms). However, these methods are of limited utility because it is difficult for a user to control what the network produces...

chapter

ResNet-Based Vehicle Classification and Localization in Traffic Surveillance Systems

Heechul Jung, Min-Kook Choi, Jihun Jung, Jin-Hee Lee, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 934 - 940

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In this paper, we present ResNet-based vehicle classification and localization methods using real traffic surveillance recordings. We utilize a MIOvision traffic dataset, which comprises 11 categories including a variety of vehicles, such as bicycle, bus, car, motorcycle, and so on. To improve the classification performance, we exploit a technique called joint fine-tuning (JF). In addition, we propose...

chapter

Signal Classification in Quotient Spaces via Globally Optimal Variational Calculus

Gregory S. Chirikjian

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 735 - 743

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

A ubiquitous problem in pattern recognition is that of matching an observed time-evolving pattern (or signal) to a gold standard in order to recognize or characterize the meaning of a dynamic phenomenon. Examples include matching sequences of images in two videos, matching audio signals in speech recognition, or matching framed trajectories in robot action recognition. This paper shows that all of...

chapter

Deceiving Google’s Cloud Video Intelligence API Built for Summarizing Videos

Hossein Hosseini, Baicen Xiao, Radha Poovendran

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1305 - 1309

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Despite the rapid progress of the techniques for image classification, video annotation has remained a challenging task. Automated video annotation would be a breakthrough technology, enabling users to search within the videos. Recently, Google introduced the Cloud Video Intelligence API for video analysis. As per the website, the system can be used to "separate signal from noise, by retrieving...

chapter

Learning Dynamic GMM for Attention Distribution on Single-Face Videos

Yun Ren, Zulin Wang, Mai Xu, Haoyu Dong, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1632 - 1641

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

The past decade has witnessed the popularity of video conferencing, such as FaceTime and Skype. In video conferencing, almost every frame has a human face. Hence, it is necessary to predict attention on face videos by saliency detection, as saliency can be used as a guidance of regionof- interest (ROI) for the content-based applications. To this end, this paper proposes a novel approach for saliency...

chapter

Object-Specific Style Transfer Based on Feature Map Selection Using CNNs

Ayumu Shinya, Nguyen Duc Tung, Tomohiro Harada, Ruck Thawonmas

2017 Nicograph International (NicoInt) > 88

2017 Nicograph International (NicoInt)

We propose a method for transferring an arbitrary style to only a specific object in an image. Style transfer is the process of combining the content of an image and the style of another image into a new image. Our results show that the proposed method can realize style transfer to specific object.

chapter

Integrating a Priori Probabilistic Knowledge into Classification for Image Description

Andrea Apicella, Anna Corazza, Francesco Isgro, Giuseppe Vettigli

2017 IEEE 26th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE) > 197 - 199

2017 IEEE 26th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)

This paper discusses a possible implementation of the integration of knowledge from a probabilistic ontology in the automatic description of images. This combination not only provides the relations existing between the different segments, but also improve the classification accuracy, as the context often gives cues suggesting the correct class of the segment.

chapter

Phasic maximal and local maximal occurrence representation for video-based person re-identification

Gang Liu, Chang Tian, Ze-Min Wu

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN) > 1187 - 1190

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN)

This paper proposes a new spatio-temporal appearance feature named Phasic Maximal and Local Maximal Occurrence (PM-LOMO) representation for video-based person re-identification. To perform temporal alignment of the sequence, we selected the optimal period of walking cycle and divide frames into several phases based on the extreme points of the sequence's Flow Energy Profile (FEP). To describe the...

Content availability:
Available
Data set:
ieee
Keywords:
CONFERENCES
COMPUTER VISION
PATTERN RECOGNITION

Publication date

Set your own date range

Publication type

book (234)
article (1)

Keywords

FEATURE EXTRACTION (82)
SIGNAL PROCESSING (69)
COMPUTATIONAL MODELING (64)
IMAGE COLOR ANALYSIS (49)
CAMERAS (43)
IMAGE SEGMENTATION (43)
COMPUTERS (42)
IMAGE PROCESSING (41)
ROBUSTNESS (41)
EQUATIONS (40)
SIGNAL PROCESSING ALGORITHMS (39)
MATHEMATICAL MODEL (37)
TRAINING (37)
ALGORITHM DESIGN AND ANALYSIS (36)
IMAGE EDGE DETECTION (36)
ACCURACY (35)
SHAPE (35)
ESTIMATION (33)
IMAGE RECOGNITION (33)
NOISE (33)
EDUCATIONAL INSTITUTIONS (32)
LIGHTING (30)
TRANSFORMS (30)
COMPLEXITY THEORY (29)
DATA MINING (28)
DATABASES (27)
ANALYTICAL MODELS (26)
ARTIFICIAL NEURAL NETWORKS (26)
OBJECT RECOGNITION (26)
VISUALIZATION (26)
IMAGE RESOLUTION (25)
FACE RECOGNITION (23)
OBJECT DETECTION (23)
REAL TIME SYSTEMS (23)
INDEXES (21)
SUPPORT VECTOR MACHINES (21)
CLASSIFICATION ALGORITHMS (20)
GEOMETRY (20)
OPTIMIZATION (20)
TESTING (20)
PRESSES (19)
TRACKING (19)
GRAPHICS (18)
PATTERN ANALYSIS (18)
CORRELATION (17)
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (17)
MULTIMEDIA COMMUNICATION (17)
OPTICAL IMAGING (17)
PRINCIPAL COMPONENT ANALYSIS (17)
SURVEILLANCE (17)
AUTOMATION (16)
HISTOGRAMS (16)
IMAGE RECONSTRUCTION (16)
IMAGING (16)
LABORATORIES (16)
MACHINE INTELLIGENCE (16)
MACHINE LEARNING (16)
SOLID MODELING (16)
BRIGHTNESS (15)
IMAGE CLASSIFICATION (15)
IMAGE CODING (15)
SOFTWARE (15)
STREAMING MEDIA (15)
WAVELET TRANSFORMS (15)
ARTIFICIAL INTELLIGENCE (14)
COMPUTER SCIENCE (14)
FILTERING (14)
IMAGE ANALYSIS (14)
TARGET TRACKING (14)
USA COUNCILS (14)
VIDEO SEQUENCES (14)
ADAPTATION MODEL (13)
APPROXIMATION ALGORITHMS (13)
CLUSTERING ALGORITHMS (13)
DETECTORS (13)
ELECTRONIC MAIL (13)
HELIUM (13)
IMAGE MOTION ANALYSIS (13)
IMAGE SEQUENCES (13)
STEREO VISION (13)
VECTORS (13)
BIOLOGICAL SYSTEM MODELING (12)
IMAGE RETRIEVAL (12)
PROCEEDINGS OF THE IEEE (12)
SUPPORT VECTOR MACHINE CLASSIFICATION (12)
BIOMEDICAL IMAGING (11)
COMPUTER SOCIETY (11)
CYBERNETICS (11)
DATA MODELS (11)
ENTROPY (11)
HIDDEN MARKOV MODELS (11)
MANGANESE (11)
MATERIALS (11)
REMOTE SENSING (11)
ROBOTS (11)
SURFACE TREATMENT (11)
VEHICLES (11)
more

INFONA - science communication portal

Search results

Patched-based deep Boltzmann shape priors for visual tracking

Research on target detection and tracking system of rescue robot

Robust real-time visual tracking by using particle filter with sampling multiple importance resampling

Depth image super resolution via multi-hypothesis estimation

Saliency detection for RGBD image using optimization

Predicting Salient Face in Multiple-Face Videos

The More You Know: Using Knowledge Graphs for Image Classification

Online Asymmetric Similarity Learning for Cross-Modal Retrieval

Attend to You: Personalized Image Captioning with Context Sequence Memory Networks

Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies

Exclusivity-Consistency Regularized Multi-view Subspace Clustering

A change detection method based on cosegmentation

Scribbler: Controlling Deep Image Synthesis with Sketch and Color

ResNet-Based Vehicle Classification and Localization in Traffic Surveillance Systems

Signal Classification in Quotient Spaces via Globally Optimal Variational Calculus

Deceiving Google’s Cloud Video Intelligence API Built for Summarizing Videos

Learning Dynamic GMM for Attention Distribution on Single-Face Videos

Object-Specific Style Transfer Based on Feature Map Selection Using CNNs

Integrating a Priori Probabilistic Knowledge into Classification for Image Description

Phasic maximal and local maximal occurrence representation for video-based person re-identification

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options