Wyniki wyszukiwania

rozdział

Summarization of News Videos Considering the Consistency of Auditory and Visual Contents

Ichiro Ide, Ye Zhang, Ryunosuke Tanishige, Keisuke Doman, więcej

2017 IEEE International Symposium on Multimedia (ISM) > 193 - 199

2017 IEEE International Symposium on Multimedia (ISM)

Since news videos are valuable sources of multimedia information on real-world events, there is a demand for viewing them efficiently. However, there is a problem that summarization methods based on auditory contents do not take into account the visual contents. In the case of news videos, due to its presentation style where audio contents and visual contents do not necessarily come from the same...

rozdział

Weakly Supervised Object Localization Using Things and Stuff Transfer

Miaojing Shi, Holger Caesar, Vittorio Ferrari

2017 IEEE International Conference on Computer Vision (ICCV) > 3401 - 3410

2017 IEEE International Conference on Computer Vision (ICCV)

We propose to help weakly supervised object localization for classes where location annotations are not available, by transferring things and stuff knowledge from a source set with available annotations. The source and target classes might share similar appearance (e.g. bear fur is similar to cat fur) or appear against similar background (e.g. horse and sheep appear against grass). To exploit this,...

rozdział

Extreme Clicking for Efficient Object Annotation

Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari

2017 IEEE International Conference on Computer Vision (ICCV) > 4940 - 4949

2017 IEEE International Conference on Computer Vision (ICCV)

Manually annotating object bounding boxes is central to building computer vision datasets, and it is very time consuming (annotating ILSVRC [53] took 35s for one high-quality box [62]). It involves clicking on imaginary comers of a tight box around the object. This is difficult as these comers are often outside the actual object and several adjustments are required to obtain a tight box. We propose...

rozdział

A fast circle detection method based on threshold segmentation and validity check for FPC images

Jiaxiang Luo, Xuchao Chen, Yueming Hu

2017 Chinese Automation Congress (CAC) > 3214 - 3217

2017 Chinese Automation Congress (CAC)

Flexible printed circuit board (FPC) is a popular substrate for packaging integrated circuits (ICs). Detecting the circles rapidly on FPCs by using computer vision is very important to assess the quality of FPCs during its manufacturing. In this paper, a fast circle detection approach based on a threshold segmentation method and a validation check is proposed. In the algorithm, the image is firstly...

rozdział

Adaptive thresholding to robust image binarization for degraded document images

Prashant Devidas Ingle, Parminder Kaur

2017 1st International Conference on Intelligent Systems and Information Management (ICISIM) > 189 - 193

2017 1st International Conference on Intelligent Systems and Information Management (ICISIM)

Owing to the elevated intra/inter variation among the foreground and background text of various document images, the text segmentation from the poorly degraded document images is the difficult job. This paper presents the document image binarization method by adaptive image contrast which is the integration of the local image gradient and the local image contrast which is lenient to background and...

rozdział

Automatic parking slot detection based on around view monitor (AVM) systems

Lei Li, Changle Li, Qieshi Zhang, Tao Guo, więcej

2017 9th International Conference on Wireless Communications and Signal Processing (WCSP) > 1 - 6

2017 9th International Conference on Wireless Communications and Signal Processing (WCSP)

Automatic parking systems have significant effects on intelligent transport systems (ITS) and have been extensively researched. However, most existing vision-based automatic parking slot detection methods cannot obtain the desired results due to variation in light intensity or complex obstacle conditions. Besides, most previous parking slot detection methods only consider the target position occupied...

rozdział

Leveraging weak segmentation for multi-object tracking system

JiaXin Wang, CuiXia Ma, Hao Wang, HongAn Wang

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 63 - 68

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Object Tracking is an important task in Computer Vision, which has gained increasing attention from academia to industry. In this paper, we propose a real-time tracking system based on weak segmentation. Different from general tracking by detection systems, we do not classify objects into car, cat or bike, instead we just classify the image into object area and non-object area. Many tracking systems...

rozdział

Occlusion localization based on convolutional neural networks

Ya-Li Hou, Jinzhang Peng, Xiaoli Hao, Yan Shen, więcej

2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 1 - 5

2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

In most convolutional neural networks (CNNs), the output is a single classification result by combining all the neuron activations in the last layer. As we know, local connectivity is an important characteristic of CNNs. Each neuron in the network corresponds to a local region in the original image. Hence, it is possible to simultaneously obtain local visibility of a target object by analyzing neuron...

rozdział

Semantic Line Detection and Its Applications

Jun-Tae Lee, Han-Ul Kim, Chul Lee, Chang-Su Kim

2017 IEEE International Conference on Computer Vision (ICCV) > 3249 - 3257

2017 IEEE International Conference on Computer Vision (ICCV)

Semantic lines characterize the layout of an image. Despite their importance in image analysis and scene understanding, there is no reliable research for semantic line detection. In this paper, we propose a semantic line detector using a convolutional neural network with multi-task learning, by regarding the line detection as a combination of classification and regression tasks. We use convolution...

rozdział

Learned Watershed: End-to-End Learning of Seeded Segmentation

Steffen Wolf, Lukas Schott, Ullrich Kothe, Fred Hamprecht

2017 IEEE International Conference on Computer Vision (ICCV) > 2030 - 2038

2017 IEEE International Conference on Computer Vision (ICCV)

Learned boundary maps are known to outperform handcrafted ones as a basis for the watershed algorithm. We show, for the first time, how to train watershed computation jointly with boundary map prediction. The estimator for the merging priorities is cast as a neural network that is convolutional (over space) and recurrent (over iterations). The latter allows learning of complex shape priors. The method...

rozdział

Deep Detection of People and their Mobility Aids for a Hospital Robot

Andres Vasquez, Marina Kollmitz, Andreas Eitel, Wolfram Burgard

2017 European Conference on Mobile Robots (ECMR) > 1 - 7

2017 European Conference on Mobile Robots (ECMR)

Robots operating in populated environments encounter many different types of people, some of whom might have an advanced need for cautious interaction, because of physical impairments or their advanced age. Robots therefore need to recognize such advanced demands to provide appropriate assistance, guidance or other forms of support. In this paper, we propose a depth-based perception pipeline that...

rozdział

Background subtraction using encoder-decoder structured convolutional neural network

Kyungsun Lim, Won-Dong Jang, Chang-Su Kim

2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) > 1 - 6

2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

A background subtraction algorithm using an encoderdecoder structured convolutional neural network is proposed in this work, in order to segment out moving objects from the background. A target frame, its previous frame, and a background model are concatenated and fed into the network as the input. Then, the encoder generates a highlevel feature vector, and the decoder converts the feature vector...

rozdział

Novel method for joining missing line fragments for medical image analysis

Patryk Najgebauer, Leszek Rutkowski, Rafal Scherer

2017 22nd International Conference on Methods and Models in Automation and Robotics (MMAR) > 861 - 866

2017 22nd International Conference on Methods and Models in Automation and Robotics (MMAR)

We present a method of predictive reconstructing connections between parts of object outlines in images. The method was developed mainly to analyze microscopic medical images but is applicable to other types of images. Examined objects in such images are highly transparent, moreover close objects can overlap each other. Thus, segmentation and separation of such objects can be difficult. Another frequently...

rozdział

Enhanced pedestrian detection using deep learning based semantic image segmentation

Tianrui Liu, Tania Stathaki

2017 22nd International Conference on Digital Signal Processing (DSP) > 1 - 5

2017 22nd International Conference on Digital Signal Processing (DSP)

Pedestrian detection and semantic segmentation are highly correlated tasks which can be jointly used for better performance. In this paper, we propose a pedestrian detection method making use of semantic labeling to improve pedestrian detection results. A deep learning based semantic segmentation method is used to pixel-wise label images into 11 common classes. Semantic segmentation results which...

rozdział

Retinal image segmentation based on weighted local detectors and confusion matrix

Loretta Ichim, Dan Popescu

2017 40th International Conference on Telecommunications and Signal Processing (TSP) > 654 - 658

2017 40th International Conference on Telecommunications and Signal Processing (TSP)

The paper presents a method for accurate detection and localization of important regions from retinal images like: optic disc, macula, exudates and hemorrhages. To this end, the image is locally decomposed in sub-images (patches) and then it is processed based on the fusion of different information types: first order statistics, textural, fractal and spectral. Two multilayer processing networks are...

rozdział

CityPersons: A Diverse Dataset for Pedestrian Detection

Shanshan Zhang, Rodrigo Benenson, Bernt Schiele

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4457 - 4465

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convnets have enabled significant progress in pedestrian detection recently, but there are still open questions regarding suitable architectures and training data. We revisit CNN design and point out key adaptations, enabling plain FasterRCNN to obtain state-of-the-art results on the Caltech dataset. To achieve further improvement from more and better data, we introduce CityPersons, a new set of person...

rozdział

Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly

Hao Jiang, Kristen Grauman

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3435 - 3443

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Todays person detection methods work best when people are in common upright poses and appear reasonably well spaced out in the image. However, in many real images, thats not what people do. People often appear quite close to each other, e.g., with limbs linked or heads touching, and their poses are often not pedestrian-like. We propose an approach to detangle people in multi-person images. We formulate...

rozdział

Network Dissection: Quantifying Interpretability of Deep Visual Representations

David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, więcej

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3319 - 3327

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a general framework called Network Dissection for quantifying the interpretability of latent representations of CNNs by evaluating the alignment between individual hidden units and a set of semantic concepts. Given any CNN model, the proposed method draws on a data set of concepts to score the semantics of hidden units at each intermediate convolutional layer. The units with semantics are...

rozdział

What Can Help Pedestrian Detection?

Jiayuan Mao, Tete Xiao, Yuning Jiang, Zhimin Cao

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6034 - 6043

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Aggregating extra features has been considered as an effective approach to boost traditional pedestrian detection methods. However, there is still a lack of studies on whether and how CNN-based pedestrian detectors can benefit from these extra features. The first contribution of this paper is exploring this issue by aggregating extra features into CNN-based pedestrian detection framework. Through...

rozdział

Pixelwise Instance Segmentation with a Dynamically Instantiated Network

Anurag Arnab, Philip H. S. Torr

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 879 - 888

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic segmentation and object detection research have recently achieved rapid progress. However, the former task has no notion of different instances of the same object, and the latter operates at a coarse, bounding-box level. We propose an Instance Segmentation system that produces a segmentation map where each pixel is assigned an object class and instance identity label. Most approaches adapt...

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Summarization of News Videos Considering the Consistency of Auditory and Visual Contents

Weakly Supervised Object Localization Using Things and Stuff Transfer

Extreme Clicking for Efficient Object Annotation

A fast circle detection method based on threshold segmentation and validity check for FPC images

Adaptive thresholding to robust image binarization for degraded document images

Automatic parking slot detection based on around view monitor (AVM) systems

Leveraging weak segmentation for multi-object tracking system

Occlusion localization based on convolutional neural networks

Semantic Line Detection and Its Applications

Learned Watershed: End-to-End Learning of Seeded Segmentation

Deep Detection of People and their Mobility Aids for a Hospital Robot

Background subtraction using encoder-decoder structured convolutional neural network

Novel method for joining missing line fragments for medical image analysis

Enhanced pedestrian detection using deep learning based semantic image segmentation

Retinal image segmentation based on weighted local detectors and confusion matrix

CityPersons: A Diverse Dataset for Pedestrian Detection

Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly

Network Dissection: Quantifying Interpretability of Deep Visual Representations

What Can Help Pedestrian Detection?

Pixelwise Instance Segmentation with a Dynamically Instantiated Network

Opcje filtrowania

Data publikacji

Dostępność treści

Słowa kluczowe

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu