Search results

chapter

Monte-Carlo image retargeting

Roberto Gallea, Edoardo Ardizzone, Roberto Pirrone

2014 International Conference on Computer Vision Theory and Applications (VISAPP) > 1 > 402 - 408

2014 International Conference on Computer Vision Theory and Applications (VISAPP)

In this paper an efficient method for image retargeting is proposed. It relies on a monte-carlo model that makes use of image saliency. Each random sample is extracted from deformation probability mass function defined properly, and shrinks or enlarges the image by a fixed size. The shape of the function, determining which regions of the image are affected by the deformations, depends on the image...

chapter

Subtasks of Unconstrained Face Recognition

Joel Z. Leibo, Qianli Liao, Tomaso Poggio

2014 International Conference on Computer Vision Theory and Applications (VISAPP) > 2 > 113 - 121

2014 International Conference on Computer Vision Theory and Applications (VISAPP)

Unconstrained face recognition remains a challenging computer vision problem despite recent exceptionally high results (∼ 95% accuracy) on the current gold standard evaluation dataset: Labeled Faces in the Wild (LFW) (Huang et al., 2008; Chen et al., 2013). We offer a decomposition of the unconstrained problem into subtasks based on the idea that invariance to identity-preserving transformations is...

chapter

Automatic analysis of in-the-wild mobile eye-tracking experiments using object, face and person detection

Stijn De Beugher, Geert Brone, Toon Goedeme

2014 International Conference on Computer Vision Theory and Applications (VISAPP) > 1 > 625 - 633

2014 International Conference on Computer Vision Theory and Applications (VISAPP)

In this paper we present a novel method for the automatic analysis of mobile eye-tracking data in natural environments. Mobile eye-trackers generate large amounts of data, making manual analysis very time-consuming. Available solutions, such as marker-based analysis minimize the manual labour but require experimental control, making real-life experiments practically unfeasible. We present a novel...

chapter

Mastering the art of persuasion intelligent tutoring system for presenters

Anh-Tuan Nguyen, Wei Chen, Matthias Rauterberg

2014 International Conference on Computer Vision Theory and Applications (VISAPP) > 3 > 83 - 90

2014 International Conference on Computer Vision Theory and Applications (VISAPP)

Public speaking is a non-trivial task since it is affected by how nonverbal behaviors are expressed. Practicing to deliver the appropriate expressions is difficult while they are mostly given subconsciously. This paper presents our empirical study on the nonverbal behaviors of presenters. Such information was used as the ground truth to develop an intelligent tutoring system. The system can capture...

chapter

Video content and structure description based on keyframes, clusters and storyboards

Marc Junyent, Pablo Beltran, Miquel Farre, Jordi Pont-Tuset, more

2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP)

In this paper we present a novel system to extract keyframes, shot clusters and structural storyboards for video content description, which can be used for a variety of summarization, visualization, classification, indexing and retrieval applications. The system automatically selects an appealing set of keyframes and creates meaningful clusters of shots. It further identifies sections that appear...

chapter

Learning-based movie summarization via role-community analysis and feature fusion

Jun-Ying Li, Li-Wei Kang, Chia-Ming Tsai, Chia-Wen Lin

2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP)

Movie summarization aims at condensing a full-length movie to a significantly shortened version that still preserves the movie's major semantic content. In this paper, we propose a learning-based movie summarization framework via role-community social network analysis and feature fusion. In our framework, scene-based movie summarization is formulated as a 0–1 knapsack problem, where the scene attention...

chapter

Head movement analysis in lie detection

Dora-Ionut Noje, Raul Malutan

2015 Conference Grid, Cloud & High Performance Computing in Science (ROLCG) > 1 - 4

2015 Conference Grid, Cloud & High Performance Computing in Science (ROLCG)

This document presents a study regarding the potential of head movement in lie detection. The potential was analyzed using a non-invasive technique that detects the head movement out of video. In the literature there is a lot of information regarding lie indicators and for this reason we made a short review of them. Application was built in order to detect the head movement and head position by performing...

chapter

Gender disparity and the creepy hill in face replacement videos

Xindi Li, Tom Gedeon

2015 6th IEEE International Conference on Cognitive Infocommunications (CogInfoCom) > 413 - 418

2015 6th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

The study investigates the gender disparity in video face replacement perception. This is inspired by the prevalent face replacement techniques being utilised in the film making industry as well as the existence of gender disparity in various aspects of human life. A user study was conducted which contained quality rating task of face replacement videos. Results show that there is a significant difference...

chapter

FaceMore: A Face Beautification Platform on the Cloud

Lingyu Liang, Deng Liu, Lianwen Jin

2015 IEEE International Conference on Systems, Man, and Cybernetics > 1798 - 1803

2015 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

Face More, a cloud-based face beautification platform for intelligent face manipulation, is developed in this work. It provides flexible and efficient cloud API to develop automatic or interactive face retouching applications. A web-site, www.facemore.net, is built on Face More, where user can upload images and obtain various online face beautification services. To obtain automatic inhomogeneous editing...

chapter

Emoticon Extraction Method Based on Eye Characters and Symmetric String

Takeru Yokoi, Mizuki Kobayashi, Roliana Ibrahim

2015 IEEE International Conference on Systems, Man, and Cybernetics > 2979 - 2984

2015 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

Emoticon is often used in short messages to shortly describe actions, feelings and so on. That can also represent sentimental intention such that it's difficult to describe by only language. Recently the sentiment analysis has been focused in cases of election, economic market and so on. Consideration of emoticon is also useful in such cases and in the first place, emoticon extraction from text is...

chapter

GMM-based synchronization rules for HMM-based audio-visual laughter synthesis

Huseyin Cakmak, Kevin El Haddad, Thierry Dutoit

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 428 - 434

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

In this paper we propose synchronization rules between acoustic and visual laughter synthesis systems. Previous works have addressed separately the acoustic and visual laughter synthesis following an HMM-based approach. The need of synchronization rules comes from the constraint that in laughter, HMM-based synthesis cannot be performed using a unified system where common transcriptions may be used...

chapter

Multi task sequence learning for depression scale prediction from video

Linlin Chao, Jianhua Tao, Minghao Yang, Ya Li, more

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 526 - 531

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

Depression is a typical mood disorder, which affects people in mental and even physical problems. People who suffer depression always behave abnormal in visual behavior and the voice. In this paper, an audio visual based multimodal depression scale prediction system is proposed. Firstly, features are extracted from video and audio are fused in feature level to represent the audio visual behavior....

chapter

Definitions of engagement in human-agent interaction

Nadine Glas, Catherine Pelachaud

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 944 - 949

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

We give an overview of engagement in human-agent interaction. We discuss the different definitions of engagement in human and social science, specify how they relate to certain other concepts, and give an overview of the high level behaviour that is often associated with engagement. This work serves to position our future research on engagement in human-agent interaction.

chapter

Perception of intensity incongruence in synthesized multimodal expressions of laughter

Radoslaw Niewiadomski, Yu Ding, Maurizio Mancini, Catherine Pelachaud, more

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 684 - 690

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

In this paper, we study perception of intensity in-congruence between auditory and visual modalities of synthesized expressions of laughter. In particular, we investigate whether incongruent expressions are perceived as 1) regulated, and 2) unsuccessful in terms of animation synthesis. For this purpose, we conducted a perceptive study with the use of a virtual agent. Congruent and incongruent multimodal...

chapter

Towards a minimal representation of affective gestures (Extended abstract)

Donald Glowinski, Marcello Mortillaro, Klaus Scherer, Nele Dael, more

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 498 - 504

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

How efficiently decoding affective information when computational resources and sensor systems are limited? This paper presents a framework for analysis of affective behavior starting with a reduced amount of visual information related to human upper-body movements. The main goal is to individuate a minimal representation of emotional displays based on non-verbal gesture features. The GEMEP (Geneva...

chapter

Efficient autism spectrum disorder prediction with eye movement: A machine learning framework

Wenbo Liu, Li Yi, Zhiding Yu, Xiaobing Zou, more

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 649 - 655

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

We propose an autism spectrum disorder (ASD) prediction system based on machine learning techniques. Our work features the novel development and application of machine learning methods over traditional ASD evaluation protocols. Specifically, we are interested in discovering the latent patterns that possibly indicate the symptom of ASD underneath the observations of eye movement. A group of subjects...

chapter

Engagement detection based on mutli-party cues for human robot interaction

Hanan Salam, Mohamed Chetouani

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 341 - 347

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

In this paper, we address the problematic of automatic detection of engagement in multi-party Human-Robot Interaction scenarios. The aim is to investigate to what extent are we able to infer the engagement of one of the entities of a group based solely on the cues of the other entities present in the interaction. In a scenario featuring 3 entities: 2 participants and a robot, we extract behavioural...

chapter

Utilizing visual cues in robot audition for sound source discrimination in speech-based human-robot communication

Randy Gomez, Levko Ivanchuk, Keisuke Nakamura, Takeshi Mizumoto, more

2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) > 4216 - 4222

2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

It is easy for human beings to discern whether an observed acoustic signal is a direct speech, reflected speech or noise through simple listening. Relying purely on acoustic cues is enough for human beings to discriminate between the different kinds of sound sources which is not straightforward for machines. A robot equipped with the current robot audition mechanism in most cases, will fail to differentiate...

chapter

Modeling infant visual preference as perceptual oscillation

Benjamin Balas, Lisa Oakes

2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) > 26 - 32

2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

Infants' visual recognition abilities are typically studied using variations of preferential looking paradigms. In this broad class of tasks, the extent to which infants discriminate between, categorize, and recognize complex images is determined by which of two test images they prefer to look at. This preference is usually expressed by calculating the proportion of total looking time allocated to...

chapter

When and where do infants follow gaze?

Gedeon O. Deak

2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) > 182 - 187

2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

Infants' processing of adult social cues develops late in the first year. Sensitivity before 6 months is limited to nonspecific motion-cuing by lateral eye movements. Results from naturalistic and experimental studies show that learning is sensitive to factors including target location, target salience, gaze-cue salience, and the presence of distractors or non-gaze social cues. Those results are consistent...

INFONA - science communication portal

Search results

Monte-Carlo image retargeting

Subtasks of Unconstrained Face Recognition

Automatic analysis of in-the-wild mobile eye-tracking experiments using object, face and person detection

Mastering the art of persuasion intelligent tutoring system for presenters

Video content and structure description based on keyframes, clusters and storyboards

Learning-based movie summarization via role-community analysis and feature fusion

Head movement analysis in lie detection

Gender disparity and the creepy hill in face replacement videos

FaceMore: A Face Beautification Platform on the Cloud

Emoticon Extraction Method Based on Eye Characters and Symmetric String

GMM-based synchronization rules for HMM-based audio-visual laughter synthesis

Multi task sequence learning for depression scale prediction from video

Definitions of engagement in human-agent interaction

Perception of intensity incongruence in synthesized multimodal expressions of laughter

Towards a minimal representation of affective gestures (Extended abstract)

Efficient autism spectrum disorder prediction with eye movement: A machine learning framework

Engagement detection based on mutli-party cues for human robot interaction

Utilizing visual cues in robot audition for sound source discrimination in speech-based human-robot communication

Modeling infant visual preference as perceptual oscillation

When and where do infants follow gaze?

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options