The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
We present a video summarization approach for egocentric or “wearable” camera data. Given hours of video, the proposed method produces a compact storyboard summary of the camera wearer's day. In contrast to traditional keyframe selection techniques, the resulting summary focuses on the most important objects and people with which the camera wearer interacts. To accomplish this, we develop region cues...
We present a machine learning framework that automatically generates a model set of landmarks for some class of registered 3D objects: here we use human faces. The aim is to replace heuristically-designed landmark models by something that is learned from training data. The value of this automatically generated model is an expected improvement in robustness and precision of learning-based 3D landmarking...
In this paper, we propose an effective method to recognize human actions from 3D positions of body joints. With the release of RGBD sensors and associated SDK, human body joints can be extracted in real time with reasonable accuracy. In our method, we propose a new type of features based on position differences of joints, EigenJoints, which combine action information including static posture, motion,...
In this paper, we present a gamesourcing method for automatically and rapidly acquiring labeled images of human poses to obtain ground truth data as input for human pose estimation from 2D images. Typically, these datasets are constructed manually through a tedious process of clicking on joint locations in images. By using a low-cost RGBD sensor, we capture synchronized, registered images, depth maps,...
The launch of Xbox Kinect has built a very successful computer vision product and made a big impact to the gaming industry; this sheds lights onto a wide variety of potential applications related to action recognition. The accurate estimation of human poses from the depth image is universally a critical step. However, existing pose estimation systems exhibit failures when faced severe occlusion. In...
While bottom-up and top-down processes have shown effectiveness during predicting attention and eye fixation maps on images, in this paper, inspired by the perceptual organization mechanism before attention selection, we propose to utilize figure-ground maps for the purpose. So as to take both pixel-wise and region-wise interactions into consideration when predicting label probabilities for each pixel,...
In this paper we present the first large-scale scene attribute database. First, we perform crowd-sourced human studies to find a taxonomy of 102 discriminative attributes. Next, we build the “SUN attribute database” on top of the diverse SUN categorical database. Our attribute database spans more than 700 categories and 14,000 images and has potential for use in high-level scene understanding and...
In recent years, the rise of digital image and video data available has led to an increasing demand for image annotation. In this paper, we propose an interactive object annotation method that incrementally trains an object detector while the user provides annotations. In the design of the system, we have focused on minimizing human annotation time rather than pure algorithm learning performance....
The human vision tends to recognize more variants of a distinctive exemplar. This observation suggests that discriminative power of training exemplars could be utilized for shaping a desirable global classifier that generalizes maximally from a few exemplars. We propose to derive classification uncertainty for each exemplar, using a local classification task to separate the exemplar from those in...
Understanding natural human activity involves not only identifying the action being performed, but also locating the semantic elements of the scene and describing the person's interaction with them. We present a system that is able to recognize complex, fine-grained human actions involving the manipulation of objects in realistic action sequences. Our method takes advantage of recent advances in sensors...
This study was conducted to measure the readiness of PT Pertamina in implementing e-learning, where the study refers to the theories and methods of measurement have been developed by Aydin & Tasci. The results showed that PT Pertamina as a whole is ready for e-learning, but still need some improvement especially in the field of human resources. In addition, this study confirms that personal characteristics...
Reaching and grasping of objects in an everyday-life environment seems so simple for humans, though so complicated from an engineering point of view. Humans use a variety of strategies for reaching and grasping anything from the simplest to the most complicated objects, achieving high dexterity and efficiency. This seemingly simple process of reach-to-grasp relies on the complex coordination of the...
The studies on mirror neurons observed in monkeys indicate that recognition of other's actions activates neural circuits that are also responsible for generating the very same actions in the animal. The mirror neuron hypothesis argues that such an overlap between action generation and recognition can provide a shared worldview among individuals and be a key pillar for communication. Inspired by these...
In this paper, a novel sparse feature representation method for object tracking is proposed. The method is on the observation that a tracked object can be dynamically and compactly represented by a few features (sparse representation) from a large feature set (the improved histogram of oriented gradient and color, HOGC). Based on the HOGC features, the sparse representation can be learned online from...
Short Utterance Speaker Recognition (SUSR) is an important area of speaker recognition when only small amount of speech data is available for testing and training. We list the most commonly used state-of-the-art methods of speaker recognition and the significance of prosodic speaker recognition. A short survey of SUSR is hereby conducted, highlighting various methodologies when using short utterances...
Rehabilitation robotic devices have been actively explored for training patients with impaired neural functions or assisting those with weak limbs due to aging or diseases. In recent years, the authors have proposed light-weight exoskeleton designs for the upper arm, in which rigid links of the exoskeleton are replaced by lightweight cuffs attached to the moving limb segments of the human arm. Cables,...
This paper presents an adaptive neuro/fuzzy system which can be trained to detect the current human emotions from a set of measured responses. Six models are built using different types of input/output membership functions and trained by different kinds of input arrays. The models are compared based on their ability to train with lowest error values. Many factors impact the error values such as input/output...
Extracting and labeling sulcal curves on the human cerebral cortex is important for many neuroscience studies, however manually annotating the sulcal curves is a time-consuming task. In this paper, we present an automatic sulcal curve extraction method by registering a set of dense landmark points representing the sulcal curves to the subject cortical surface. A Markov random field is used to model...
This paper is an attempt to explore a human element not easily solved in the image processing communities. The problem statement is vague but important to address. What is a good image? More specifically, if a low contrast image is presented, at what level of enhancement is good enough for a human observer? This of course depends on diverse elements, e.g., personal preference, emotional state, physical...
Preventing a traffic accident is a good way to solve many problems in the world for the current generation surrounding with many automotive technologies causing many people's death from the accident. The prevention makes an important impact to every society for making many people more safety and improving their lives' quality. In the fact, the primary cause is mostly drivers' carelessness and lacking...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.