The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Persistent detection and tracking of moving vehicles in airborne imagery provide indispensable information for many traffic surveillance applications including traffic monitoring and management, navigation systems, activity recognition and event detection. This paper presents a collaborative Spatial Pyramid Context-aware detection and Tracking system (SPCT) for moving vehicles in dense urban aerial...
This paper presents a method for estimating the number, as well as the activity periods of spatially distributed sound sources using an uncalibrated microphone array. This methodology is applied for the purposes of speaker diarization. In general, speaker diarization has difficulty with: 1) estimating the number of sound sources (speakers), and 2) activity detection of multiple sound sources including...
Vehicle classification plays an important part in Intelligent Transport System. Recently, deep learning has showed outstanding performance in image classification. However, numerous parameters of the deep network need to be optimized which is time-consuming. PCANet is a light-weight deep learning network that is easy to train. In this paper, a new robust vehicle classification method is proposed,...
With the expansion of digital world, the scope of copyright has started facing a great threat. Thus, there is a need to protect the original work from being copied or distributed illegally. Image watermarking plays a very important role in solving this issue. Image watermarking is the procedure to embed a particular piece of information (watermark) in an image such that it is not distorted. Thus on...
We propose a direct monocular SLAM algorithm based on the Normalised Information Distance (NID) metric. In contrast to current state-of-the-art direct methods based on photometric error minimisation, our information-theoretic NID metric provides robustness to appearance variation due to lighting, weather and structural changes in the scene. We demonstrate successful localisation and mapping across...
We present a novel method to track 3D models in color and depth data. To this end, we introduce approximations that accelerate the state-of-the-art in region-based tracking by an order of magnitude while retaining similar accuracy. Furthermore, we show how the method can be made more robust in the presence of depth data and consequently formulate a new joint contour and ICP tracking energy. We present...
In this article, we compare three change detection methods for hyperspectral imagery and establish their sensitivity to registration mismatch. We further present metrics that enable this comparison and seek to rank the methods.
This paper describes a joint intensity metric learning method to improve the robustness of gait recognition with silhouette-based descriptors such as gait energy images. Because existing methods often use the difference of image intensities between a matching pair (e.g., the absolute difference of gait energies for the l1-norm) to measure a dissimilarity, large intrasubject differences derived from...
For the pedestrian tracking failure problems of the multiple concurrent particle filter with partial occlusion in active safety, this paper proposed a fuzzy decision algorithm to decrease the error probability when the multiple deformable parts are used to tracking an up-right person in the video frames. This algorithm applies the multiple adaptive color-based particle filter trackers to trace the...
Human action classification, which is vital for content-based video retrieval and human-machine interaction, finds problem in distinguishing similar actions. Previous works typically detect spatial-temporal interest points (STIPs) from action sequences and then adopt bag-of-visual words (BoVW) model to describe actions as numerical statistics of STIPs. Despite the robustness of BoVW, this model ignores...
Despite many proposed solutions, multi-object tracking remains a challenging problem in complex situations involving partial occlusions and non-uniform and abrupt illumination changes. Considering modular systems, the tracking performance strongly depends on the consistency of the different blocks relatively to error features. In this work, using the Belief Function framework, we take into account...
Text embedded in natural scene images provide rich semantic information about the scene, which is of great value for content-based image applications. Due to the variety of text appearance and the complexity of scene context, however, text detection in natural images remains a challenging task. In this paper, we propose a robust text detection method that hierarchically and progressively localizes...
Fragment reconstruction aims to restore broken images and documents via matching spatial adjacent fragments. As the existing solutions in the literature still remain problematic, we present a novel feature descriptor, Normal Direction Local Binary Pattern (termed as ND-LBP), for document/image fragment matching. ND-LBP is based on the conventional LBP descriptor, however, it outstands LBP by introducing...
Person re-identification (re-id) aims to match a specific person across non-overlapping views of different cameras, which is currently one of the hot topics in computer vision. Compared with image-based person re-id, video-based techniques could achieve better performance by fully utilizing the space-time information. This paper presents a novel video-based person re-id method named Deep Feature Guided...
Stereo matching is an active research area in computer vision for decades. Most of the existing stereo matching algorithms assume that the corresponding pixels have the same intensity or color in both images. But in real world situations, image color values are often affected by various radiometric factors such as exposure and lighting variations. This paper introduces a robust stereo matching algorithm...
This paper studies an image descriptor that mimics the retina's photo receiving cell pattern. Various pattern differencing combinations and second order differencing techniques were explored. This new method shows higher precision and recall performance than the classical BRIEF and steered BRIEF descriptors.
Offline signature identification and verification systems encounter several challenges such as the diversity of signatories and the limited number of references. To address these problems we propose a new writer-independent system for signature identification and verification. Besides, a new feature generation scheme is proposed by using the Histogram Of Templates (HOT). The identification and verification...
This paper addresses a difficulty in large-scale long term laser localisation — how to deal with scene change. We pose this as a distraction suppression problem. Urban driving environments are frequently subject to large dynamic outliers, such as buses, trucks etc. These objects can mask the static elements of the prior map that we rely on for localisation. At the same time some objects change shape...
This paper presents a novel appearance and shape feature, RISAS, which is robust to viewpoint, illumination, scale and rotation variations. RISAS consists of a keypoint detector and a feature descriptor both of which utilise texture and geometric information present in the appearance and shape channels. A novel response function based on the surface normals is used in combination with the Harris corner...
Object representation is a major component in object tracking, however, most conventional patch-based methods just simply decompose the object into patches with grid or stochastic rectangles. This kind of decomposition ignores the intrinsic structure of object, leading to low discriminative power and weak representation effectiveness when similar objects appear or under background clutters. In this...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.