Search results

chapter

Summarization of News Videos Considering the Consistency of Auditory and Visual Contents

Ichiro Ide, Ye Zhang, Ryunosuke Tanishige, Keisuke Doman, more

2017 IEEE International Symposium on Multimedia (ISM) > 193 - 199

2017 IEEE International Symposium on Multimedia (ISM)

Since news videos are valuable sources of multimedia information on real-world events, there is a demand for viewing them efficiently. However, there is a problem that summarization methods based on auditory contents do not take into account the visual contents. In the case of news videos, due to its presentation style where audio contents and visual contents do not necessarily come from the same...

chapter

Heterogeneous Image Stylization Using Neural Networks

Tong Bai, Gary Overett

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 7

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

Automated image stylization to create artistically pleasing images from ordinary photographs is an interesting and useful task in computer vision. Therefore, several automated styling methods have been developed using powerful Deep Neural Network (DNN) features. They typically use a carefully constructed joint loss function to separately consider the similarities between a proposed output and the...

chapter

Line-based monocular graph SLAM

Dong Ruifang, Vincent Fremont, Simon Lacroix, Isabelle Fantoni, more

2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI) > 494 - 500

2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)

This paper presents a new line based 6-DOF monocular algorithm that uses the iSAM2, a point-based Graph SLAM approach. We extend iSAM2 to minimize the reprojection error of the line features to solve the line-based SLAM problem. A specific line representation is exploited that combines the Plücker Coordinates and the Cayley representation. The Plücker Coordinates are used for the 3D line projection...

chapter

A novel method for underwater image segmentation based on M-band wavelet transform and human psychovisual phenomenon(HVS)

Soumyadip Dhar, Hiranmoy Roy, Madhurima Majumder, Chitrita Biswas, more

2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN) > 21 - 25

2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)

Underwater image segmentation becomes a difficult and challenging task due to various perturbations present in the water. In this paper we propose a novel method for underwater image segmentation based on M-band wavelet transform and human psychovisual phenomenon(HVS). The M-band wavelet transform captures the texture of the underwater image by decomposing the image into sub bands with different scales...

chapter

Chromosome banding for karyotype based on fractional order derivative

K. B. Jayanthi, Nirmala Madian, P. Kiruthika

TENCON 2017 - 2017 IEEE Region 10 Conference > 2466 - 2471

TENCON 2017 - 2017 IEEE Region 10 Conference

Karyotyping helps to evaluate the size, shape and number of chromosomes. It is a screening and diagnostic process for finding various abnormalities related with chromosomes. Banding pattern is unique for each pair of chromosome. In the present research, fractional derivatives find an important place in the field of signal processing and digital image processing. This paper proposes a method for segmenting...

chapter

[POSTER] Composite Realism: Effects of Object Knowledge and Mismatched Feature Type on Observer Gaze and Subjective Quality

Alan Dolhasz, Maite Frutos-Pascual, Ian Williams

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 9 - 14

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

We report on the results of the first visual search and rating study (N60) evaluating human gaze when assessing the realism of image composites. The effects of object identity knowledge and mismatched feature type on observers' gaze and subjective realism scores are studied. Gaze metrics used include: fixation count, fixation duration, time and duration of first fixation on target object, as well...

chapter

Satellite imagery features for the image similarity estimation

Y. I. Shedlovska, V. V. Hnatushenko, V. J. Kashtan

2017 IEEE International Young Scientists Forum on Applied Physics and Engineering (YSF) > 359 - 362

2017 IEEE International Young Scientists Forum on Applied Physics and Engineering (YSF)

This paper is devoted to investigation of features that will be the most appropriate for description of high resolution satellite imagery. We developed an image description model which is based on the distribution of image object classes. Proposed model could be used for image similarity estimation.

chapter

A non-reference image area division based on deep learning

Yan Fu, Dong Yue

2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 1 - 4

2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

As the human eye on the image of different regions of the contrast sensitivity is different, it is particularly important to segment the image region more accurately in the image quality evaluation. Based on this, this paper presents a non-reference image region division method based on deep learning. Firstly, the Canny operator performs image edge detection at low threshold to obtain the strong edge...

chapter

The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes

Gerhard Neuhold, Tobias Ollmann, Samuel Rota Bulo, Peter Kontschieder

2017 IEEE International Conference on Computer Vision (ICCV) > 5000 - 5009

2017 IEEE International Conference on Computer Vision (ICCV)

The Mapillary Vistas Dataset is a novel, large-scale street-level image dataset containing 25000 high-resolution images annotated into 66 object categories with additional, instance-specific labels for 37 classes. Annotation is performed in a dense and fine-grained style by using polygons for delineating individual objects. Our dataset is 5× larger than the total amount of fine annotations for Cityscapes...

chapter

A Stagewise Refinement Model for Detecting Salient Objects in Images

Tiantian Wang, Ali Borji, Lihe Zhang, Pingping Zhang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4039 - 4048

2017 IEEE International Conference on Computer Vision (ICCV)

Deep convolutional neural networks (CNNs) have been successfully applied to a wide variety of problems in computer vision, including salient object detection. To detect and segment salient objects accurately, it is necessary to extract and combine high-level semantic features with low-levelfine details simultaneously. This happens to be a challenge for CNNs as repeated subsampling operations such...

chapter

Recurrent Multimodal Interaction for Referring Image Segmentation

Chenxi Liu, Zhe Lin, Xiaohui Shen, Jimei Yang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1280 - 1289

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper we are interested in the problem of image segmentation given natural language descriptions, i.e. referring expressions. Existing works tackle this problem by first modeling images and sentences independently and then segment images by combining these two types of representations. We argue that learning word-to-image interaction is more native in the sense of jointly modeling two modalities...

chapter

HHA-based CNN image features for indoor loop closure detection

Wei Zhang, Guoliang Liu, Guohui Tian

2017 Chinese Automation Congress (CAC) > 4634 - 4639

2017 Chinese Automation Congress (CAC)

Loop closure detection is an important part of visual simultaneous location and mapping (SLAM) system. Most of traditional loop closure detection approaches using hand-crafted features often lack robustness with respect to object occlusions and illumination changes, especially for the complicated indoor environment. Recently, convolutional neural network (CNN) makes a huge impact on many computer...

chapter

Semi Supervised Semantic Segmentation Using Generative Adversarial Network

Nasim Souly, Concetto Spampinato, Mubarak Shah

2017 IEEE International Conference on Computer Vision (ICCV) > 5689 - 5697

2017 IEEE International Conference on Computer Vision (ICCV)

Semantic segmentation has been a long standing challenging task in computer vision. It aims at assigning a label to each image pixel and needs a significant number of pixel-level annotated data, which is often unavailable. To address this lack of annotations, in this paper, we leverage, on one hand, a massive amount of available unlabeled or weakly labeled data, and on the other hand, non-real images...

chapter

Cascaded Feature Network for Semantic Segmentation of RGB-D Images

Di Lin, Guangyong Chen, Daniel Cohen-Or, Pheng-Ann Heng, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1320 - 1328

2017 IEEE International Conference on Computer Vision (ICCV)

Fully convolutional network (FCN) has been successfully applied in semantic segmentation of scenes represented with RGB images. Images augmented with depth channel provide more understanding of the geometric information of the scene in the image. The question is how to best exploit this additional information to improve the segmentation performance.,,In this paper, we present a neural network with...

chapter

Computer aided motile sperm counting

Hamza Osman Ilhan, Nizamettin Aydin

2017 Medical Technologies National Congress (TIPTEKNO) > 1 - 5

2017 Medical Technologies National Congress (TIPTEKNO)

The rapid and irregular motion of semen cells makes the counting process of semen difficult in the visual assessment. Therefore, computer based techniques are necessary to evaluate the tests with more accurately. In this paper, an alternative way to the visual assessment technique in spermiogram tests is presented. Analyses are performed on the recorded microscope video images by computer, automatically...

chapter

Exploiting Spatial Structure for Localizing Manipulated Image Regions

Jawadul H. Bappy, Amit K. Roy-Chowdhury, Jason Bunk, Lakshmanan Nataraj, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4980 - 4989

2017 IEEE International Conference on Computer Vision (ICCV)

The advent of high-tech journaling tools facilitates an image to be manipulated in a way that can easily evade state-of-the-art image tampering detection approaches. The recent success of the deep learning approaches in different recognition tasks inspires us to develop a high confidence detection framework which can localize manipulated regions in an image. Unlike semantic object segmentation where...

chapter

Unsupervised Learning of Important Objects from First-Person Videos

Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi

2017 IEEE International Conference on Computer Vision (ICCV) > 1974 - 1982

2017 IEEE International Conference on Computer Vision (ICCV)

A first-person camera, placed at a person's head, captures, which objects are important to the camera wearer. Most prior methods for this task learn to detect such important objects from the manually labeled first-person data in a supervised fashion. However, important objects are strongly related to the camera wearer's internal state such as his intentions and attention, and thus, only the person...

chapter

VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation

Chuang Gan, Yandong Li, Haoxiang Li, Chen Sun, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1829 - 1838

2017 IEEE International Conference on Computer Vision (ICCV)

Rich and dense human labeled datasets are among the main enabling factors for the recent advance on visionlanguage understanding. Many seemingly distant annotations (e.g., semantic segmentation and visual question answering (VQA)) are inherently connected in that they reveal different levels and perspectives of human understandings about the same visual scenes — and even the same set of images (e...

chapter

Unsupervised Learning from Video to Detect Foreground Objects in Single Images

Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu

2017 IEEE International Conference on Computer Vision (ICCV) > 4345 - 4353

2017 IEEE International Conference on Computer Vision (ICCV)

Unsupervised learning from visual data is one of the most difficult challenges in computer vision. It is essential for understanding how visual recognition works. Learning from unsupervised input has an immense practical value, as huge quantities of unlabeled videos can be collected at low cost. Here we address the task of unsupervised learning to detect and segment foreground objects in single images...

chapter

A system for online quality analysis for cherry harvest process inside the orchard

Yetzabel Gonzalez, Roberto Ahumada-Garcia, Patricia Moller-Acuna, Jose Antonio Reyes-Suarez

2017 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON) > 1 - 4

2017 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON)

The quality control of cherries harvested in the orchard is a process of great relevance for the Chilean export industry. Nowadays companies carry out this process manually, obtaining a high error rate in the measurements of color and caliber of the fruits. This article seeks to develop a system to automate this process and thus reduce measurement failures. For this, an information system was implemented...

INFONA - science communication portal

Search results

Summarization of News Videos Considering the Consistency of Auditory and Visual Contents

Heterogeneous Image Stylization Using Neural Networks

Line-based monocular graph SLAM

A novel method for underwater image segmentation based on M-band wavelet transform and human psychovisual phenomenon(HVS)

Chromosome banding for karyotype based on fractional order derivative

[POSTER] Composite Realism: Effects of Object Knowledge and Mismatched Feature Type on Observer Gaze and Subjective Quality

Satellite imagery features for the image similarity estimation

A non-reference image area division based on deep learning

The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes

A Stagewise Refinement Model for Detecting Salient Objects in Images

Recurrent Multimodal Interaction for Referring Image Segmentation

HHA-based CNN image features for indoor loop closure detection

Semi Supervised Semantic Segmentation Using Generative Adversarial Network

Cascaded Feature Network for Semantic Segmentation of RGB-D Images

Computer aided motile sperm counting

Exploiting Spatial Structure for Localizing Manipulated Image Regions

Unsupervised Learning of Important Objects from First-Person Videos

VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation

Unsupervised Learning from Video to Detect Foreground Objects in Single Images

A system for online quality analysis for cherry harvest process inside the orchard

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options