The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Persistent detection and tracking of moving vehicles in airborne imagery provide indispensable information for many traffic surveillance applications including traffic monitoring and management, navigation systems, activity recognition and event detection. This paper presents a collaborative Spatial Pyramid Context-aware detection and Tracking system (SPCT) for moving vehicles in dense urban aerial...
This paper presents a framework which allows urban planners to navigate and interact with large datasets fused with social feeds in real-time, enhanced by a virtual reality (VR) capability, which further promotes the knowledge discovery process and allows to interact with urban data in natural yet immersive way. A challenge in urban planning is making decisions based on datasets which are many times...
Zero-shot image classification using auxiliary information, such as attributes describing discriminative object properties, requires time-consuming annotation by domain experts. We instead propose a method that relies on human gaze as auxiliary information, exploiting that even non-expert users have a natural ability to judge class membership. We present a data collection paradigm that involves a...
Visual words of Bag-of-Visual-Words (BoVW) framework are independent each other, which results in not only discarding spatial orders between visual words but also lacking semantic information. This study is inspired by word embeddings that a similar embedding procedure is applied to a large number of visual words. By this way, the corresponding embedding vectors of the visual words can be formulated...
A novel and effective framework for the enhancement of low lighting images is proposed in this paper. The novel framework presents an optimized de-haze algorithm on inverted images to enhance the low-dynamic-range images which optimizes the complicated process of computing the parameters A and t(x). The improved gamma correction is used to enhance the image contrast for providing better visual performance...
Recent advances in technology and rapid growth of consumer electronics have made tremendous amount of multimedia information available to the general population. Browsing through large collections of consumer videos and manually creating summaries can be tedious. Automatic summarization techniques will give the user an easy way to look up important content of a collection of media and to browse media...
This paper describes a component of an Augmented Reality (AR) based system focused on supporting workers in manufacturing and maintenance industry. Particularly, it describes a component responsible for verification of performed steps. Correct handling is crucial in both manufacturing and maintenance industries and deviations may cause problems in later stages of the production and assembly. The primary...
In today world the necessity for the autonomous mobile robots and vehicles is increasing. The safety autonomous moving demands the reliable and fast detection algorithms. The Histogram of Oriented Gradients (HOG) descriptors show significantly outperforms the existing feature sets for a human detection. Though the given method has a lot of type I errors. The amount of these errors can be decreased...
This article gives a more robust justification for the use of the Bhattacharyya distance in the algorithm used by our Automated Sport Analysis System named ACE in the first of its three perception modules. Such first module consists in the temporal segmentation of television video broadcasts, aiming to break down the video into shots, delimited by scene boundaries. An evaluation of other seven histogram...
We propose a novel method for developing static storyboard for video clips included with biomedical research literature. The technique uses both visual and audio content in the video to select candidate key frames for the storyboard. From the visual channel, the Intra-frames are extracted using FFmpeg tool. IBM Watson speech-to-text service is used to extract words from the audio channel, from which...
Thermal image has many applications on image processing such as human detection, face recognition and physiological signal evaluation, etc. The respiratory rate is an important physiological signal, and it is highly related to emotion and some diseases. Therefore, we propose a non-contact method to estimate the respiratory rate from thermal image in this paper. Thermal image can provide the information...
This paper presents a new approach towards the selection of color image features to be used in the classification of burn wounds. The features are selected such that they generate similarity matrices and multidimensional scaling (MDS) plots that match the similarity matrix and the MDS-plot resulting from a subjective visual burn area similarity test performed by trained surgeons. We show that standard...
In this paper, a color image enhancement method is presented by using intensity histogram equalization (HE) approach without changing hue and saturation in HSI color space. The proposed method has better visual colorfulness than the conventional HE method because hue and saturation are preserved in the enhancement process. The back-lighting image and night-time image are used to demonstrate the effectiveness...
This paper proposes a modified spatially-constrained similarity measure (mSCSM) method for endosomal structure detection and localization under the bag-of-words (BoW) framework. To our best knowledge, the proposed mSCSM is the first method for fully automatic detection and localization of complex subcellular compartments like endosomes. Essentially, a new similarity score and a novel two-stage output...
The key frame extraction helps us to make obtainable summary of a video. After studying a variety of diverse methods of Key frame extraction, we will have comparative analysis of the methods depending on their important features and result. If we want to present the entire video within a squat interval of time, video summary becomes the best alternative for this. This has become a very essential work...
Planetary rovers face mobility hazards associated with various classes of terrains they traverse: sand, bedrock, and rock-strewn terrain. This work develops visual classifiers for these 3 terrain types for single monochrome navigation images from the NASA Mars Exploration Rover missions. The classifiers are based primarily on visual texture, captured in histograms of edges filter responses at various...
In this work, we propose a framework to deal with cross-modal visuo-tactile object recognition. By cross-modal visuo-tactile object recognition, we mean that the object recognition algorithm is trained only with visual data and is able to recognize objects leveraging only tactile perception. The proposed cross-modal framework is constituted by three main elements. The first is a unified representation...
Brain-computer interface (BCI) systems can translate the human mind into control commands, which makes it feasible to improve the life quality of physically challenged people. However, in real-life situations, it is still difficult for users to utilize robots to provide basic services with BCI systems. We aimed to propose a BCI-based system with a visual servo module to operate a service robot. We...
Object representation is a major component in object tracking, however, most conventional patch-based methods just simply decompose the object into patches with grid or stochastic rectangles. This kind of decomposition ignores the intrinsic structure of object, leading to low discriminative power and weak representation effectiveness when similar objects appear or under background clutters. In this...
Different kinds of features hold some distinct merits, making them complementary to each other. Inspired by this idea an index level multiple feature fusion scheme via similarity matrix pooling is proposed in this paper. We first compute the similarity matrix of each index, and then a novel scheme is used to pool on these similarity matrices for updating the original indices. Compared with the existing...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.