Search results

chapter

An efficient audiovisual saliency model to predict eye positions when looking at conversations

Antoine Coutrot, Nathalie Guyader

2015 23rd European Signal Processing Conference (EUSIPCO) > 1531 - 1535

2015 23rd European Signal Processing Conference (EUSIPCO)

Classic models of visual attention dramatically fail at predicting eye positions on visual scenes involving faces. While some recent models combine faces with low-level features, none of them consider sound as an input. Yet it is crucial in conversation or meeting scenes. In this paper, we describe and refine an audiovisual saliency model for conversation scenes. This model includes a speaker diarization...

chapter

Visualization of prostatic nerves using polarization-sensitive optical coherence tomography

Yeoreum Yoon, Yong Hyun Park, Seung Hwan Jeon, Won Hyuk Jang, more

2015 11th Conference on Lasers and Electro-Optics Pacific Rim (CLEO-PR) > 2 > 1 - 2

2015 11th Conference on Lasers and Electro-Optics Pacific Rim (CLEO-PR)

We demonstrate that polarization-sensitive optical coherence tomography (PS-OCT) can identify the cavernous nerve in the human and rat prostate ex vivo based on its birefringence. PS-OCT may be useful for nerve preservation during radical prostatectomy.

chapter

Towards the verification of image integrity in online news

Cecilia Pasquini, Carlo Brunetta, Andrea F. Vinci, Valentina Conotter, more

2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) > 1 - 6

2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

The widespread of social networking services allows users to share and quickly spread an enormous amount of digital contents. Currently, a low level of security and trustworthiness is applied to such information, whose reliability cannot be taken for granted due to the large availability of image editing software which allow any user to easily manipulate digital contents. This has a huge impact on...

chapter

Investigating human behaviors in selecting personal photos to preserve memories

Andrea Ceroni, Vassilis Solachidis, Mingxin Fu, Nattiya Kanhabua, more

2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) > 1 - 6

2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

Photos are excellent means for keeping and refreshing memories. Digital photography, however, imposes new challenges for keeping photos accessible on the long run due to threats such as hard disk crashes, format changes, or storage medium decay. Safe long-term preservation, ensuring the longevity of photos, comes at a cost, suggesting a restriction of this investment to the most valuable photos. Therefore,...

chapter

Face Retrieval on Large-Scale Video Data

Christian Herrmann, Jurgen Beyerer

2015 12th Conference on Computer and Robot Vision > 192 - 199

2015 12th Conference on Computer and Robot Vision (CRV)

Increasingly large amounts of video data raise the question if large-scale face retrieval is feasible. To find fast and accurate matching strategies, an according face track descriptor is constructed by using local features, extended by an encoding of the respective measurement conditions. The feature encoding allows collecting all features of one face track together in a single feature set, where...

chapter

Interpretable video representation

Lukas Diem, Maia Zaharieva

2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI) > 1 - 6

2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI)

The immense amount of available video data poses novel requirements for video representation approaches by means of focusing on central and relevant aspects of the underlying story and facilitating the efficient overview and assessment of the content. In general, the assessment of content relevance and significance is a high-level task that usually requires for human intervention. However, some filming...

chapter

A hybrid approach for retrieving diverse social images of landmarks

Duc-Tien Dang-Nguyen, Luca Piras, Giorgio Giacinto, Giulia Boato, more

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, we present a novel method that can produce a visual description of a landmark by choosing the most diverse pictures that best describe all the details of the queried location from community-contributed datasets. The main idea of this method is to filter out non-relevant images at a first stage and then cluster the images according to textual descriptors first, and then to visual descriptors...

chapter

Hierarchical clustering pseudo-relevance feedback for social image search result diversification

Bogdan Boteanu, Ionut Mironica, Bogdan Ionescu

2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI) > 1 - 6

2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI)

This article addresses the issue of social image search result diversification. We propose a novel perspective for the diversification problem via Relevance Feedback (RF). Traditional RF introduces the user in the processing loop by harvesting feedback about the relevance of the search results. This information is used for recomputing a better representation of the data needed. The novelty of our...

chapter

Learning Gaussian mixture model for saliency detection on face images

Yun Ren, Mai Xu, Ruihan Pan, Zulin Wang

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

The previous work has demonstrated that integrating topdown features in bottom-up saliencymethods can improve the saliency prediction accuracy. Therefore, for face images, this paper proposes a saliency detection method based on Gaussian mixture model (GMM), which learns the distribution of saliency over face regions as the top-down feature. Specifically, we verify that fixations tend to cluster around...

chapter

Active crosstalk reduction system for multiview autostereoscopic displays

Philippe Hanhart, Carmelo di Nolfo, Touradj Ebrahimi

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

Multiview autostereoscopic displays are considered as the future of 3DTV. However, these displays suffer from a high level of crosstalk, which negatively impacts quality of experience (QoE). In this paper, we propose a system to improve 3D QoE on multiview autostereoscopic displays. First, the display is characterized in terms of luminance distribution. Then, the luminance profiles are modeled using...

chapter

You are what you tweet…pic! gender prediction based on semantic analysis of social media images

Michele Merler, Liangliang Cao, John R. Smith

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

We propose a method to extract user attributes from the pictures posted in social media feeds, specifically gender information. While traditional approaches rely on text analysis or exploit visual information only from the user profile picture or colors, we propose to look at the distribution of semantics in the pictures coming from the whole feed of a person to estimate gender. In order to compute...

chapter

Steering All: Infinite Racing Game Controlled by Haar Cascade through OpenCV

Alison de Araujo Bento, Leonardo Cunha de Miranda

2015 XVII Symposium on Virtual and Augmented Reality > 245 - 254

2015 XVII Symposium on Virtual and Augmented Reality (SVR)

In the contemporary world, new interaction forms for digital games have been developing increasingly. In this work, we present an infinite racing game inspired by the brick game racing. By these interaction entities of the application, called Interacts, and using hear cascade through OpenCV, we extend the possibilities for controlling this game. Thus, beyond keyboard and mouse, it is also possible...

chapter

Segmentation Quality for Augmented Reality: An Objective Metric

Danilo Vitori Salioni, Silvio R.R. Sanches, Valdinei F. SIlva, Tiago de Gaspari, more

2015 XVII Symposium on Virtual and Augmented Reality > 212 - 219

2015 XVII Symposium on Virtual and Augmented Reality (SVR)

Assessment the quality of segmentation algorithms considering the user perception is an important problem in Computer Vision. For this purpose a metric must take into account the impact of different types of errors displayed to the users. In this work we developed a new objective metric to assess the quality obtained by bilayer segmentation algorithms when they are used in Augmented Reality applications...

chapter

Prediction gradients for feature extraction and analysis from convolutional neural networks

Henry Z. Lo, Joseph Paul Cohen, Wei Ding

2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG) > 1 - 6

2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG)

Despite their impact on computer vision and face recognition, the inner workings of deep convolutional neural networks (CNNs) have traditionally been regarded as uninterpretable. We demonstrate this to be false by proposing prediction gradients to understand how neural networks encode concepts into individual units. In constrast, existing efforts to understand convolutional nets focus on visualizing...

chapter

Circle-based eye center localization (CECL)

Yustinus Eko Soelistio, Eric Postma, Alfons Maes

2015 14th IAPR International Conference on Machine Vision Applications (MVA) > 349 - 352

2015 14th IAPR International Conference on Machine Vision Applications (MVA)

The ability to automatically detect eye center locations in video images allows for estimating gaze direction. This, in turn, facilitates the study of human-computer interaction and behavioral analyses of social interactions. We propose an improved eye center localization method based on the Hough transform, called Circle-based Eye Center Localization (CECL) that is simple, robust, and achieves accuracy...

chapter

Tri-subjects kinship verification: Understanding the core of a family

Xiaoqian Qin, Xiaoyang Tan, Songcan Chen

2015 14th IAPR International Conference on Machine Vision Applications (MVA) > 580 - 583

2015 14th IAPR International Conference on Machine Vision Applications (MVA)

Recent research has demonstrated that computer vision algorithms have understood individual face image fairly well. However, one major challenge in computer vision is to go beyond that and to investigate the bi-or tri- relationship among multiple visual entities, answering such questions as whether a child in a photo belongs to given parents. Indeed parents-child relationship plays a core role in...

chapter

Heterogeneous feature fusion-based optimal face image acquisition in visual sensor network

Kuicheng Lin, Xue Wang, Sujin Cui, Yuqi Tan

2015 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) Proceedings > 1078 - 1083

2015 IEEE International Instrumentation and Measurement Technology Conference (I2MTC)

High quality face image acquisition from huge video data obtained in visual sensor network is of great significance in applications related to face processing, such as face recognition and reconstruction. This paper proposes an optimal face image acquisition method in visual sensor network, which is based on collaborative face frames acquisition and heterogeneous feature fusion-based face quality...

chapter

Foveated Manifold Sensing for object recognition

Irina Burciu, Thomas Martinetz, Erhardt Barth

2015 IEEE International Black Sea Conference on Communications and Networking (BlackSeaCom) > 196 - 200

2015 IEEE International Black Sea Conference on Communications and Networking (BlackSeaCom)

We present a novel method, Foveated Manifold Sensing, for the adaptive and efficient sensing of the visual world. The method is based on algorithms that learn manifolds of increasing but low dimensionality for representative data. As opposed to Manifold Sensing, the new foveated version senses only the most salient areas of a scene. This leads to an efficient sensing strategy that requires only a...

chapter

Surface flow visualization using the closest point embedding

Mark Kim, Charles Hansen

2015 IEEE Pacific Visualization Symposium (PacificVis) > 17 - 23

2015 IEEE Pacific Visualization Symposium (PacificVis)

In this paper, we introduce a novel flow visualization technique for arbitrary surfaces. This new technique utilizes the closest point embedding to represent the surface, which allows for accurate particle advection on the surface as well as supports the unsteady flow line integral convolution (UFLIC) technique on the surface. This global approach is faster than previous parameterization techniques...

chapter

Photo-real talking head with deep bidirectional LSTM

Bo Fan, Lijuan Wang, Frank K. Soong, Lei Xie

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4884 - 4888

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Long short-term memory (LSTM) is a specific recurrent neural network (RNN) architecture that is designed to model temporal sequences and their long-range dependencies more accurately than conventional RNNs. In this paper, we propose to use deep bidirectional LSTM (BLSTM) for audio/visual modeling in our photo-real talking head system. An audio/visual database of a subject's talking is firstly recorded...

INFONA - science communication portal

Search results

An efficient audiovisual saliency model to predict eye positions when looking at conversations

Visualization of prostatic nerves using polarization-sensitive optical coherence tomography

Towards the verification of image integrity in online news

Investigating human behaviors in selecting personal photos to preserve memories

Face Retrieval on Large-Scale Video Data

Interpretable video representation

A hybrid approach for retrieving diverse social images of landmarks

Hierarchical clustering pseudo-relevance feedback for social image search result diversification

Learning Gaussian mixture model for saliency detection on face images

Active crosstalk reduction system for multiview autostereoscopic displays

You are what you tweet…pic! gender prediction based on semantic analysis of social media images

Steering All: Infinite Racing Game Controlled by Haar Cascade through OpenCV

Segmentation Quality for Augmented Reality: An Objective Metric

Prediction gradients for feature extraction and analysis from convolutional neural networks

Circle-based eye center localization (CECL)

Tri-subjects kinship verification: Understanding the core of a family

Heterogeneous feature fusion-based optimal face image acquisition in visual sensor network

Foveated Manifold Sensing for object recognition

Surface flow visualization using the closest point embedding

Photo-real talking head with deep bidirectional LSTM

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options