The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
A large number of studies have been reported on top-down influences of visual attention. However, less progress have been made in understanding and modeling its mechanisms in real-world tasks. In this paper, we propose an approach for learning spatial attention taking into account influences of physical actions on top-down attention. For this purpose, we focus on interactive visual environments (video...
Many methods on extracting 3D information from 2D images have been studied since 1990s, especially the depth information extraction. A novel approach for depth extraction is proposed and implemented in this paper. About 30 images are taken especially for this case study and experiments are conducted on the Stanford Range Image Data. Results show that this approach is generally suitable for most of...
The ability to automatically discover error conditions with little human input is a feature lacking in most modern computer systems and networks. However, with the ever increasing size and complexity of modern systems, such a feature will become a necessity in the not too distant future. Our work proposes a hybrid framework that allows High Performance Clusters (HPC) to detect error conditions in...
We describe a study that aims towards enhancing our understanding of the perception of H.264/AVC compressed stereoscopic 3D videos, in particular spatial video quality, depth quality, visual comfort and overall 3D video quality. The results of this study indicate that the human subjects have diverse opinions on depth quality scores but a high agreement on spatial video quality. Their agreement on...
Increasing air-traffic demand implies that new air-traffic management (ATM) concepts lowering controller loads, maintaining safety and increasing efficiency need to be designed and implemented. Many of such ideas are prepared within NextGEN. Before they are deployed to real daily usage in National Airspace System (NAS), they must be rigorously evaluated under realistic conditions. The paper presents...
Decolorization - the process to transform a color image to a grayscale one - is a basic tool in digital printing, stylized black-and-white photography, and in many single channel image processing applications. In this paper, we propose an optimization approach aiming at maximally preserving the original color contrast. Our main contribution is to alleviate a strict order constraint for color mapping...
This paper presents a new generic framework for human visual system inspired object detection and recognition and introduces the idea of feature extraction based on the human visual sensitivity. These methods can greatly enhance robotic vision applications. Additionally a new computationally effective object detection algorithm is presented based on image morphology and visual sensitivity. This new...
There are lots of regions in human DNA (deoxy-ribo-nucleic acid) sequences which contain repetitive patterns. In this paper, the visualization of repetitive regions of DNA sequences via Short Time Fourier Transform is investigated.
Brain computer interface (BCI) is a developing research area which aims communication between the human user and the computers by way of user's brainwaves only. This can be accomplished by processing electroencephalography (EEG) data. In this paper the aim is to build a simple BCI in terms of both collection and classification of EEG data using event related potentials (ERP). Proposed approach helps...
Color image segmentation is a critical pre-process in image processing. Also it's important in the field of computer vision and pattern recognition. In this paper, we first state some evidence in the human vision research. Not all the intensity from 0 to 255 in RGB spaces can be distinguished by human vision. So we reduce the level of the intensity in RGB space to 26,28,26 respectively, while maintaining...
With the advent of image and video representation of visual scenes in digital computer, subsequent necessity of vision-substitution representation of a given image is felt. The medium for non-visual representation of an image is chosen to be sound due to well developed auditory sensing ability of human beings and wide availability of cheap audio hardware. Visionary information of an image can be conveyed...
Communication between cortices mediated by deep brain structures such as the amygdala and fusiform gyrus has been suggested to explain the enhanced perception of stimuli bearing emotional content or having facial features. In this paper, we analyze the dependence structure of the relevant brain regions to assess their connectivity in response to a facial stimulus, and to discriminate it from a mock...
Based on “ground truth” eye-tracking data, earlier research [1] shows that adding natural scene saliency (NSS) can improve an objective metric's performance in predicting perceived image quality. To include NSS in a real-world implementation of an objective metric, a computational model instead of eye-tracking data is needed. Existing models of visual saliency are generally designed for a specific...
We propose a saliency-maximized audio spectrogram as a representation that lets human analysts quickly search for and detect events in audio recordings. By rendering target events as visually salient patterns, this representation minimizes the time and effort needed to examine a recording. In particular, we propose a transformation of a conventional spectrogram that maximizes the mutual information...
Computer lip-reading is one of the great signal processing challenges. Not only is the signal noisy, it is variable. However it is almost unknown to compare the performance with human lip-readers. Partly this is because of the paucity of human lip-readers and partly because most automatic systems only handle data that are trivial and therefore not representative of human speech. Here we generate a...
Since CMMB is an important application in wireless communication field -- its video quality plays a critical role for widely use. In assessment of video sequence, classic method often applies algebra method in making a compute model, such as PSNR, which often result in difficult for alignment of video sequence numbers and leads to complex computation problems. In this paper, it presents a novel method...
Motion saliency detection has an important impact on further video processing tasks, such as video segmentation, object recognition and adaptive compression. Different to image saliency, in videos, moving regions (objects) catch human beings' attention much easier than static ones. Based on this observation, we propose a novel method of motion saliency detection, which makes use of the low-rank and...
Using lessons learned from error control coding, and multiple areas of life science, we propose a general purpose representation and association machine (GPRAM). In this part of paper, we illustrate our methodology, four principles, and our understanding of intelligence. We then introduce hierarchical structure and reasons to be vagueness, overcompleteness, and deliberate variation. After that, we...
Point to point navigation is a critical and demanding task for dismounted operators, especially while traversing hostile terrains. Visual displays such as a compass, maps, and global positioning systems have been the ubiquitous means of navigation and have proven to be effective; however, these tools require visual attention in an already visually demanding environment. Multiple resource theory proposes...
Narrow field of view of common Head Mount Displays, coupled with lack of adaptive camera accommodation and vergence make it impossible to view virtual scenes using familiar eye-head-body coordination patterns and reflexes. This impediment of natural habits is most noticeable in applications where users are facing multiple tasks, which require frequent switching between viewing modes, from wide range...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.