The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Despite of the ultra-wideband (UWB) system's robustness against multipath in cluttered environments, a number of challenges remain before UWB localization can be implemented. In particular, non-line-of-sight (NLOS) propagation is especially critical for high-resolution localization systems because non-negligibly positive biases will be introduced in distance measurements, thus degrading the localization...
Detecting potential aerial threats like drones with computer vision is at the paramount of interest for the protection of critical locations. This type of a system should prevent efficiently the false alarms caused by non-malign objects such as birds, which intrude the image plane. In this paper, we propose an improved version of a previously presented Speeded-up Robust Feature Transform (SURF) based...
Visual words of Bag-of-Visual-Words (BoVW) framework are independent each other, which results in not only discarding spatial orders between visual words but also lacking semantic information. This study is inspired by word embeddings that a similar embedding procedure is applied to a large number of visual words. By this way, the corresponding embedding vectors of the visual words can be formulated...
This paper describes a component of an Augmented Reality (AR) based system focused on supporting workers in manufacturing and maintenance industry. Particularly, it describes a component responsible for verification of performed steps. Correct handling is crucial in both manufacturing and maintenance industries and deviations may cause problems in later stages of the production and assembly. The primary...
In today world the necessity for the autonomous mobile robots and vehicles is increasing. The safety autonomous moving demands the reliable and fast detection algorithms. The Histogram of Oriented Gradients (HOG) descriptors show significantly outperforms the existing feature sets for a human detection. Though the given method has a lot of type I errors. The amount of these errors can be decreased...
Planetary rovers face mobility hazards associated with various classes of terrains they traverse: sand, bedrock, and rock-strewn terrain. This work develops visual classifiers for these 3 terrain types for single monochrome navigation images from the NASA Mars Exploration Rover missions. The classifiers are based primarily on visual texture, captured in histograms of edges filter responses at various...
In this work, we propose a framework to deal with cross-modal visuo-tactile object recognition. By cross-modal visuo-tactile object recognition, we mean that the object recognition algorithm is trained only with visual data and is able to recognize objects leveraging only tactile perception. The proposed cross-modal framework is constituted by three main elements. The first is a unified representation...
In human action classification task, a video must be classified into a pre-determined class. To cope with this problem, we propose a mid-level representation, in which information about quantization errors is embedded together with the aggregated data on low level features. The main contributions of this article are twofold: (i) assembly of low-level features (dense trajectories) by a mid-level representation...
Distributed surveillance systems have become popular in recent years due to security concerns. However, transmitting high dimensional data in bandwidth-limited distributed systems becomes a major challenge. In this paper, we address this issue by proposing a novel probabilistic algorithm based on the divergence between the probability distributions of the visual features in order to reduce their dimensionality...
Abstract—In this paper, we analyse the use of Convolutional Neural Networks (CNNs or ConvNets) to discriminate vegetation species with few labelled samples. To the best of our knowledge, this is the first work dedicated to the investigation of the use of deep features in such task. The experimental evaluation demonstrate that deep features significantly outperform wellknown feature extraction techniques...
In case when higher-order statistic is used for local feature aggregation, final descriptor can have very high dimensionality. In this paper different methods for descriptor dimensionality reduction are evaluated for land-use classification. Concretely, aerial image classification accuracy is compared for the cases when dimensionality reduction is made per band with fixed and variable sizes. For both...
Sparse Coding is a widely used method to represent an image. However, sparse coding and its improved algorithms have the problem of complex computation and long running time and so on. For these problems, we propose an image classification method based on hash codes and space pyramid, which encodes local feature points with hash codes instead of sparse coding. Firstly, extract the local feature points...
In recent years, deep convolutional neural networks have achieved state of the art performance in various computer vision tasks such as classification, detection or segmentation. Due to their outstanding performance, CNNs are more and more used in the field of document image analysis as well. In this work, we present a CNN architecture that is trained with the recently proposed PHOC representation...
The H-KWS 2016, organized in the context of the ICFHR 2016 conference aims at setting up an evaluation framework for benchmarking handwritten keyword spotting (KWS) examining both the Query by Example (QbE) and the Query by String (QbS) approaches. Both KWS approaches were hosted into two different tracks, which in turn were split into two distinct challenges, namely, a segmentation-based and a segmentation-free...
Often, videos are composed of multiple concepts or even genres. For instance, news videos may contain sports, action, nature, etc. Therefore, encoding the distribution of such concepts/genres in a compact and effective representation is a challenging task. In this sense, we propose the Bag of Genres representation, which is based on a visual dictionary defined by a genre classifier. Each visual word...
Distributed object recognition is a significantly fast-growing research area, mainly motivated by the emergence of high performance cameras and their integration with modern wireless sensor network technologies. In wireless distributed object recognition, the bandwidth is limited and it is desirable to avoid transmitting redundant visual features from multiple cameras to the base station. In this...
Network analysis based on anatomical covariance (cortical thickness) has been gaining increasing popularity in the last decade. However, there has not been a systematic study of the impact of nodal sizes and edge definitions on predictive performance among various network studies. In order to obtain a clear understanding of relative performance, there is a need for systematic comparison. In this study,...
In this paper we propose an automatic marine life monitoring system. First task in the monitoring process is to detect underwater moving objects as fishes. Second Task is to identify the species of the detected fish. Third task is to track the detected fish to avoid multiple counting and record their activities. Detection is performed using GMM based background subtraction method, classification is...
Object tracking is a challenging problem in computer vision as many performance affecting factors need to be considered in a robust algorithm. We propose a framework to consolidate Integral Channel Features (ICF) to represent targets' appearance by embedding global and patch based approaches which offer feature strength and accuracy to the target template. The use of ICF expedites the extraction of...
Over the years, computer vision researchers have spent an immense amount of effort on designing image features for the visual object recognition task. We propose to incorporate this valuable experience to guide the task of training deep neural networks. Our idea is to pretrain the network through the task of replicating the process of hand-designed feature extraction. By learning to replicate the...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.