The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
We present a novel approach to scene classification using combined audio signal and video image features and compare this methodology to scene classification results using each modality in isolation. Each modality is represented using summary features, namely Mel-frequency Cepstral Coefficients (audio) and Scale Invariant Feature Transform (SIFT) (video) within a multi-resolution bag-of-features model...
This paper presents a bed-leaving detection method using Elman-type Counter Propagation Networks (ECPNs), a novel machine-learning-based method used for time-series signals. In our earlier study, we used CPNs, a form of supervised model of Self-Organizing Maps (SOMs), to produce category maps to learn relations among input and teaching signals. For this study, we inserted a feedback loop as the second...
In light of their rapid growth, there is a pressing need to develop analysis and decision solutions whether or not. However, most of protections are limited understanding of these mobile malware and sophisticated analyzing. In this paper, we propose a method of analyzing and deciding malware on the basis of similarity with existing malware families on the popular platform, Android. We focus on the...
As an alternative to vector-based descriptors, such as SIFT and SURF, more computationally efficient binary descriptors, such as BRISK and ORB, have recently been proposed. These binary descriptors are usually used in combination with a novel scale-space FAST-based detector to be suitable for real-time applications, but it consumes more time than creating binary descriptors. Therefore, if accuracy...
Local features have been widely used in visual object tracking for their robustness in illumination, deformation, rotation and partial occlusion. Traditional feature selection algorithms based on accumulated knowledge of previous frames usually adopt the perspective of continuity of changes, which could lead to degradation. Exploiting discrimination and uniqueness of local sub-blocks, we build an...
This paper introduces a novel dynamic neural network model which can recognize dynamic visual image patterns of human actions based on learning. The proposed model is characterized by its capability of extracting the spatio-temporal feature hierarchy latent in the training visual image streams. The model achieves this property by integrating two essential ideas: (1) multiple spatial-scales processing...
Real-time visual identification and tracking of objects is a computationally intensive task, particularly in cluttered environments which contain many visual distracters. In this paper we describe a real-time bio-inspired system for object tracking and identification which combines an event-based vision sensor with a convolutional neural network running on FPGA for recognition. The event-based vision...
Techniques that enable user interaction with a mixed environment in natural way and low-cost may provide a great potential to increase the degree of virtual presence. In this paper, we present a low-cost 3D interaction technique based on ARToolKit planar marker in order to interact with mixed reality environment, and manipulate 3D virtual object. This proposed method that we called “2 in 1 Marker”...
Visual objects decoding with functional magnetic resonance imaging (fMRI) often merely depends on the brain activity, using pattern classification to decode information about visual stimuli from patterns of activity. However, the spatial resolution of fMRI is still limited to the > mm range. Limited by its spatial resolution, fMRI voxels lack high-spatial-frequency information of visual stimuli...
With shorter calibration times and higher information transfer rates, steady-state visual evoked potential (SSVEP)-based brain-computer interfaces (BCIs) have been studied most activity in recent years. Target identification is the ongoing core task in BCI researches, and plays a significant role in practical applications. In order to improve the performance of SSVEP-based BCI system, we proposed...
Extracting data from web pages is an important task for several applications, such as comparison shopping and data mining. Much of that data is provided by search result pages, in which each result, called search result record, represents a record from a database. One of the most important steps for extracting such records is identifying, among different data regions from a page, one that contains...
Effective user training could help us to improve the discrimination performance of our intention in brain computer interface (BCI). This paper aims to differentiate users left or right hand motor imagery (MI) tasks with different scenarios in 3D virtual environment, as non-object-directed (NOD) scenario, static-object-directed (SOD) scenario and dynamic-object-directed (DOD) scenario respectively...
There are various precise spatial manipulators that have been employed in the field micro engineering, medical surgery and biology. For this reason, a lot of researches have been done, and a variety of control methods and mechanisms have been developed. However it is very small cases to give the wider working ranges in XYZ as well as large angular motion to such a scalpel or a syringe under the microscope...
Wide-Angle Fovea Vision Sensor (WAFVS) system was designed and developed being inspired from advantages of the human eye's functions. This system is characterized by its space-variant data acquisition property, i.e., the WAFVS captures a 120-degree wide-angle input image in which its resolution (or magnification) changes like the human visual acuity. As well-known, the human visual acuity is the highest...
Many office workers today sit and work at computers for extended periods of time, which can result in a group of symptoms called “Office Workers Syndrome”. To help prevent these symptoms, we propose a novel system to monitor computer users by using a Kinect camera. Firstly, data mining classification is applied for detection of prolonged sitting, while mathematics that include a spherical coordinate...
Due to the increasing popularity of location-based services, the need for reliable and cost-effective indoor positioning methods is rising. As an alternative to radio-based localization methods, in 2011, we introduced MoVIPS (Mobile Visual Indoor Positioning System), which is based on the idea to extract visual feature points from a query image and compare them to those of previously collected geo-referenced...
The Bag of Words (BOW) method with spatio-temporal interest points has achieved great performance in human action recognition. However the traditional BOW methods based on vector quantization (VQ) suffer serious quantization error and lose masses of information. There are two main reasons leading these: the first is the codebook obtained by k-means has no obvious visual interpretation and second,...
The function of depth perception may be related to the ability to perform vergence eye movements during the viewing of three-dimensional stereoscopic movies.
In this paper, we propose a stand-alone mobile visual search system based on binary features and bag of visual words framework. The contribution of this paper is two-fold: (1) a visual word-dependent substring extraction method is proposed; (2) a modified version of the local NBNN scoring method is proposed in the context of image retrieval. The proposed system improves retrieval accuracy by 11% compared...
The GrabCut, which uses the graph-cut iteratively, is popularly used as an interactive image segmentation method since it can produce the globally optimal result. However, since the initialization of the GrabCut is roughly performed by the manual interaction, the accuracy of the segmentation result is not guaranteed when the user defines an inaccurate guide. To solve this problem, in this paper, we...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.