Serwis Infona wykorzystuje pliki cookies (ciasteczka). Są to wartości tekstowe, zapamiętywane przez przeglądarkę na urządzeniu użytkownika. Nasz serwis ma dostęp do tych wartości oraz wykorzystuje je do zapamiętania danych dotyczących użytkownika, takich jak np. ustawienia (typu widok ekranu, wybór języka interfejsu), zapamiętanie zalogowania. Korzystanie z serwisu Infona oznacza zgodę na zapis informacji i ich wykorzystanie dla celów korzytania z serwisu. Więcej informacji można znaleźć w Polityce prywatności oraz Regulaminie serwisu. Zamknięcie tego okienka potwierdza zapoznanie się z informacją o plikach cookies, akceptację polityki prywatności i regulaminu oraz sposobu wykorzystywania plików cookies w serwisie. Możesz zmienić ustawienia obsługi cookies w swojej przeglądarce.
We propose a new pretreatment for pedestrian detection with convolutional networks. It is widely known that the phenomenon of overlapping feature distribution is common, which leads to overfitting problem. We present a method that divide one category that have overlapping distributed features into multi-subcategories. By this means smooth boundaries can be easily found to separate different subcategories,...
Object detection is a challenging task in the field of pattern recognition. The objective of object detection is to locate the target objects in the testing images. In this paper, we use SVM trained active basis model as a sparse coding model for representing objects. The sparse coding model represents each image as the linear superposition of a small number of Gabor wavelets selected from an over-complete...
Deformable Part Model (DPM) is one of the best algorithms for image-based object detection. However, the high computation intensity leads to relatively long detecting time. Even with the powerful CPU or GPU computing system, it is still too slow for practical applications. To solve this problem, this paper proposes a high performance DPM accelerator based on FPGA, where a dedicated JPEG decoder is...
Partially visibility frequently happens in airplane detection tasks for remote sensing community, yet rarely considered. The commonly-used Deformable Part Model (DPM), which is specialized for object detection, has long been used for pedestrian detection in ground shooting images and shown its unbelievable power to handle with structure deformation and appearance variation for object modeling. However...
Vehicle detection applications play an important role in the reduction of the number of road accidents. In the same vein, this paper tends to summarize the recent advances in vehicle detection approaches. Both the approaches based on motion and those based on appearance are dealt with. Also, the challenges and limitations of using handcraft features are discussed. Moreover, we compare different approaches...
Deep Learning (DL), especially Convolutional Neural Networks (CNN), has become the state-of-the-art for a variety of pattern recognition issues. Technological developments have allowed the use of high-end General Purpose Graphic Processor Units (GPGPU) for accelerating numerical problem solving. They resort no only to lower computational time, but also allow considering much larger networks. Hence,...
Dictionary Learning Functions of Multiple Instances (DL-FUMI) is proposed to address target detection problems with inaccurate training labels. DL-FUMI is a multiple instance dictionary learning method that estimates target atoms that describe distinctive and representative features of the target class and background atoms that account for the shared features found across both target and non-target...
We aim to study the modeling limitations of the commonly employed boosted decision trees classifier. Inspired by the success of large, data-hungry visual recognition models (e.g. deep convolutional neural networks), this paper focuses on the relationship between modeling capacity of the weak learners, dataset size, and dataset properties. A set of novel experiments on the Caltech Pedestrian Detection...
Efficient detection of three dimensional (3D) objects in point clouds is a challenging problem. Performing 3D descriptor matching or 3D scanning-window search with detector are both time-consuming due to the 3-dimensional complexity. One solution is to project 3D point cloud into 2D images and thus transform the 3D detection problem into 2D space, but projection at multiple viewpoints and rotations...
Determining visual saliency is one of the fundamental problems in computer vision as the saliency not only identifies the most informative parts of a visual scene but may also reduce computational complexity by filtering out irrelevant segments of the scene. In this paper, we propose a novel saliency object detection method that combines a shape-preserving saliency prediction driven by a convolutional...
Domain adaption tends to transfer knowledge across domains following dissimilar distribution and where target domain has inadequate labelled samples. When knowledge is transferred from abundantly irrelevant sources negative transfer may occur resulting in poor classification of test samples. Deep learning research illustrates the semantic clustering as well as transferability of deep convolutional...
There is a need for automatic processing and extracting of meaningful metadata from multimedia information, especially in the audiovisual industry. This higher level information is used in a variety of practices, such as enriching multimedia content with external links, clickable objects and useful related information in general. This paper presents a system for efficient multimedia content analysis...
In this paper, we propose a mutual framework that combines two state-of-the-art visual object tracking algorithms. Both trackers benefit from each other's advantage leading to an efficient visual tracking approach. Many state-of-the-art trackers have poor performance due to rain, fog or occlusion in real-world scenarios. Often, after several frames, objects are getting lost, only leading to a short-term...
Object detection in high resolution remote sensing images is a crucial yet challenging problem for many applications. With the development of satellite and sensor technologies, remote sensing images attain very high spatial resolution, giving rise to the employment of many computer vision algorithms. Therefore, the object detection is usually formalized as a supervised classification task. In this...
Hough Forest is a framework combining Hough Transform and Random Forest for object detection. The purpose of the present paper is to improve the efficiency and reliability of the original framework by the mean of two contributions. First, instead of generating the image samples by drawing patches randomly from the training set, we bias this step toward the most relevant image content by selecting...
Object detection and recognition are typically stages that form part of the perception module of Autonomous Underwater Vehicles, used with different sensors such as Sonar and Optical imaging, but their design is usually separate and they are only combined at test time. In this work we present a convolutional neural network that does both object detection (through detection proposals) and recognition...
Agent system is strategy to enhance the underwater manipulation. The conventional manipulation is generally robot arm-based configuration which has singular points. On the other hand, the agent system is an armless manipulation that the agent vehicle works as the end-effector. If the location of the agent can be measured, the end effector is able to be place to any position. To implement this system,...
Automatic Target Recognition (ATR) technology is of great significance in security inspection, while traditional object detection methods are proved not efficient in human body millimeter-wave images. In this paper, we propose a synthetic objection detection method for millimeter-wave images. We choose saliency, SIFT and HOG features to form image descriptors. According to sparse representation, the...
Hand detection is an important issue in the analysis of drivers activities, assessment of drivers alertness, and subsequent development of driver safety monitoring system. In this work, the hand detection problem is addressed in the deep Convolutional Neural Network (CNN) framework. Hypothesis of hand regions are first generated with high recall rate by AdaBoost detector associated with Aggregated...
The detection of people in crowded scenes is a challenging task owing to both the severe occlusion among people and various changes in posture. We propose a regional-based convolutional network to address the tasks of people detection in the crowded scenes. Unlike the traditional methods, we propose an end-to-end framework that uses the convolutional network for feature representation, which generates...
Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.