The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Advertising on the Internet is a key factor for the success of several businesses nowadays. The Internet has evolved to a point where it has become possible to develop a business model completely based on Web advertising, which is important for the consolidation of such a model and the continuity of the Internet itself. However, it is often observed that some content publishers are dishonest and employ...
While activity recognition is a current focus of research the challenging problem of fine-grained activity recognition is largely overlooked. We thus propose a novel database of 65 cooking activities, continuously recorded in a realistic setting. Activities are distinguished by fine-grained body motions that have low inter-class variability and high intra-class variability due to diverse subjects...
This paper focus on understanding human visual system when it decodes or recognizes facial expressions. Results presented can be exploited by the computer vision research community for the development of robust descriptor based on human visual system for facial expressions recognition. We have conducted psycho-visual experimental study to find which facial region is perceptually more attractive or...
In this paper, we present a gamesourcing method for automatically and rapidly acquiring labeled images of human poses to obtain ground truth data as input for human pose estimation from 2D images. Typically, these datasets are constructed manually through a tedious process of clicking on joint locations in images. By using a low-cost RGBD sensor, we capture synchronized, registered images, depth maps,...
Many works in computer vision attempt to solve different tasks such as object detection, scene recognition or attribute detection, either separately or as a joint problem. In recent years, there has been a growing interest in combining the results from these different tasks in order to provide a textual description of the scene. However, when describing a scene, there are many items that can be mentioned...
In this paper we present the first large-scale scene attribute database. First, we perform crowd-sourced human studies to find a taxonomy of 102 discriminative attributes. Next, we build the “SUN attribute database” on top of the diverse SUN categorical database. Our attribute database spans more than 700 categories and 14,000 images and has potential for use in high-level scene understanding and...
Curve fragments, as opposed to unorganized edge elements, are of interest and use in a large number of applications such as multiview reconstructions, tracking, motion-based segmentation, and object recognition. A large number of contour grouping algorithms have been developed, but progress in this area has been hampered by the fact that current evaluation methodologies are mainly edge-based, thus...
This paper introduces Avatar CAPTCHA, an image based approach to distinguish human users from computer programs (bots). The proposed CAPTCHA asks users to identify avatar faces from a set of 12 grayscale images comprised of a mix of human and avatar faces. Experimental results indicate that it can be solved 62% of the time by human users with an average success time of 24 seconds and a positive user...
Human commonsense is required to improve quality of robotic application. However, to acquire the necessary knowledge, robot needs to evaluate the appropriateness of the data it has collected. This paper presents an evaluation method, by combining the weighting mechanism in commonsense databases with a set of weighting factors. The method was verified on our Basic-level Knowledge Network. We conducted...
This paper presents an approach for computing power grasps for hands with kinematic structure similar to the human hand, which allows the implementation of strategies inspired in human grasping actions. The proposed method first samples the object surface to look for the best spots for creating an opposing grasp with two or three fingers, and then aligns the other fingers to match the local curvature...
We solve the problem of localizing and tracking household objects using a depth-camera sensor network. We design and implement Kin sight that tracks household objects indirectly -- by tracking human figures, and detecting and recognizing objects from human-object interactions. We devise two novel algorithms: (1) Depth Sweep -- that uses depth information to efficiently extract objects from an image,...
We present a new gait identification method based on dynamic time warping (DTW), as video surveillance system requires high accuracy and precision. It could reduce computational cost of gait recognition, significantly improve the recognition rate for gait and meet the demand of video surveillance. The characters of human appearance have been utilized to extract entire binary image of human silhouette...
Natural body gesture, as well as speech dialog, is crucial for human-robot interaction and human-robot symbiosis. We have already proposed a real-time gesture planning method. In this paper, we afford this method more flexibility by adding motion parameterization function. Especially in multi-person HRI, this function becomes more important because of its adaptation to changes of a speaker's and/or...
The study is about the influence of face in videos. In the experiment, the participants were instructed free viewing of various videos. The resulting eye positions are compared to the hand-labeled faces to evaluate the impact of location and number of faces in the visual field. Here, we defined three regions—Inside (I), Periphery (P), and Outside (O)—to categorize video frames with one or two faces...
In this paper we propose the first (to the best of our knowledge) overall quality assessment scheme for facial images based on statistical learning. The overall quality assessment system is trained on the subjective quality scores, and is with a high fidelity to the human vision system (HVS) model. This scheme employs a hierarchical binary decision tree classifier based on support vector machines...
This paper presents a new approach to enhancing the text readability of the quad RGBW color electrophoretic display (EPD). In the color EPD, text characters are jagged due to its low resolution and the jaggedness degrades the readability of the text. However, text characters are usually black-and-white, and for the black-and-white character, it is possible to improve readability by relocating the...
Gender and ethnicity classification are challenging topics in the field of face analysis. Some features, like skin color, are relevant only for ethnicity but not for gender; some others, like face geometry, are important for both. The impact of ethnicity in gender perception, as the effect of gender on ethnicity disambiguation, is not clear. This paper provides a study to check if gender and ethnicity...
This paper presents the result of a recent large-scale subjective study of image retargeting quality on a collection of images generated by several representative image retargeting methods. Owning to many approaches to image retargeting that are developed, there is a need for a diverse independent public database of the retargeted images and the corresponding subjective scores that is freely available...
In this paper we evaluate the impact of different encoding configurations such as compression ratios, frame rates and resolution on the perceived quality of high definition video-conference applications. After generating a high quality video database, degraded sequences had their quality assessed by state-of-the-art automatic metrics. Results have shown that, for low rates, it is preferable to decrease...
Landscapes are essential for society, tourism industries and local communities. The development of tools capable to assess their environmental and socioeconomic importance are fundamental to preserve their aesthetic integrity, especially in coastal areas facing strong anthropogenic pressure. Online photo databases enable users to localize their images via GPS coordinates and share their photo albums...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.