2016 23rd International Conference on Pattern Recognition (ICPR)

chapter

Large-scale Isolated Gesture Recognition using Convolutional Neural Networks

Pichao Wang, Wanqing Li, Song Liu, Zhimin Gao, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 7 - 12

This paper proposes three simple, compact yet effective representations of depth sequences, referred to respectively as Dynamic Depth Images (DDI), Dynamic Depth Normal Images (DDNI) and Dynamic Depth Motion Normal Images (DDMNI). These dynamic images are constructed from a sequence of depth maps using bidirectional rank pooling to effectively capture the spatial-temporal information. Such image-based...

chapter

Large-scale Continuous Gesture Recognition Using Convolutional Neural Networks

Pichao Wang, Wanqing Li, Song Liu, Yuyao Zhang, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 13 - 18

2016 23rd International Conference on Pattern Recognition (ICPR)

This paper addresses the problem of continuous gesture recognition from sequences of depth maps using Convolutional Neural networks (ConvNets). The proposed method first segments individual gestures from a depth sequence based on quantity of movement (QOM). For each segmented gesture, an Improved Depth Motion Map (IDMM), which converts the depth sequence into one image, is constructed and fed to a...

chapter

Large-scale Isolated Gesture Recognition using pyramidal 3D convolutional networks

Guangming Zhu, Liang Zhang, Lin Mei, Jie Shao, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 19 - 24

2016 23rd International Conference on Pattern Recognition (ICPR)

Human gesture recognition is one of the central research fields of computer vision, and effective gesture recognition is still challenging up to now. In this paper, we present a pyramidal 3D convolutional network framework for large-scale isolated human gesture recognition. 3D convolutional networks are utilized to learn the spatiotemporal features from gesture video files. Pyramid input is proposed...

chapter

Large-scale gesture recognition with a fusion of RGB-D data based on the C3D model

Yunan Li, Qiguang Miao, Kuan Tian, Yingying Fan, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 25 - 30

2016 23rd International Conference on Pattern Recognition (ICPR)

The gesture recognition has raised attention in computer vision owing to its many applications. However, video-based large-scale gesture recognition still faces many challenges, since many factors like background may disturb the accuracy. To achieve gesture recognition with large-scale videos, we propose a method based on RGB-D data. To learn gesture details better, the inputs are expanded into 32-frame...

chapter

Two streams Recurrent Neural Networks for Large-Scale Continuous Gesture Recognition

Xiujuan Chai, Zhipeng Liu, Fang Yin, Zhuang Liu, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 31 - 36

2016 23rd International Conference on Pattern Recognition (ICPR)

In this paper, we tackle the continuous gesture recognition problem with a two streams Recurrent Neural Networks (2S-RNN) for the RGB-D data input. In our framework, the spotting-recognition strategy is used, that means the continuous gestures are first segmented into separated gestures, and then each isolated gesture is recognized by using the 2S-RNN. Concretely, the gesture segmentation is based...

chapter

Automatic personality prediction from audiovisual data using random forest regression

Berkay Aydin, Ahmet Alp Kindiroglu, Oya Aran, Lale Akarun

2016 23rd International Conference on Pattern Recognition (ICPR) > 37 - 42

2016 23rd International Conference on Pattern Recognition (ICPR)

In this paper, we focus on describing the method we designed for automatic perceived personality prediction. We present a simple model that uses three different sets of features: nonverbal audio cues, visual cues from video, and facial landmark points. The model uses a random decision forest to do regression from the extracted features. As we discuss in Section 4, this multimodal model performs relatively...

chapter

Multimodal fusion of audio, scene, and face features for first impression estimation

Furkan Gurpinar, Heysem Kaya, Albert Ali Salah

2016 23rd International Conference on Pattern Recognition (ICPR) > 43 - 48

2016 23rd International Conference on Pattern Recognition (ICPR)

Affective computing, particularly emotion and personality trait recognition, is of increasing interest in many research disciplines. The interplay of emotion and personality shows itself in the first impression left on other people. Moreover, the ambient information, e.g. the environment and objects surrounding the subject, also affect these impressions. In this work, we employ pre-trained Deep Convolutional...

chapter

Using Convolutional 3D Neural Networks for User-independent continuous gesture recognition

Necati Cihan Camgoz, Simon Hadfield, Oscar Koller, Richard Bowden

2016 23rd International Conference on Pattern Recognition (ICPR) > 49 - 54

2016 23rd International Conference on Pattern Recognition (ICPR)

In this paper, we propose using 3D Convolutional Neural Networks for large scale user-independent continuous gesture recognition. We have trained an end-to-end deep network for continuous gesture recognition (jointly learning both the feature representation and the classifier). The network performs three-dimensional (i.e. space-time) convolutions to extract features related to both the appearance...

chapter

Bi-modal regression for Apparent Personality trait Recognition

Nishant Rai

2016 23rd International Conference on Pattern Recognition (ICPR) > 55 - 60

2016 23rd International Conference on Pattern Recognition (ICPR)

The task of the ChaLearn Apparent Personality Analysis: First Impressions Challenge is to rate/quantify personality traits of users in short video sequences. Although the validity of personality judgments from short interactions is questionable, studies show the possibility of predicting attributed traits (First Impressions) using facial [15] and acoustic [13] features. The challenge introduces a...

chapter

Fusion of classifier predictions for audio-visual emotion recognition

Fatemeh Noroozi, Marina Marjanovic, Angelina Njegus, Sergio Escalera, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 61 - 66

2016 23rd International Conference on Pattern Recognition (ICPR)

In this paper is presented a novel multimodal emotion recognition system which is based on the analysis of audio and visual cues. MFCC-based features are extracted from the audio channel and facial landmark geometric relations are computed from visual data. Both sets of features are learnt separately using state-of-the-art classifiers. In addition, we summarise each emotion video into a reduced set...

chapter

Deep convolutional neural network based HEp-2 cell classification

Xi Jia, Linlin Shen, Xiande Zhou, Shiqi Yu

2016 23rd International Conference on Pattern Recognition (ICPR) > 77 - 80

2016 23rd International Conference on Pattern Recognition (ICPR)

As different staining patterns of HEp-2 cells indicate different diseases, the classification of Indirect Immune Fluorescence (IIF) images on Human Epithelial-2 (HEp-2) cell is important for clinical applications. Different from traditional pattern recognition techniques, we use CNN to extract more high-level features for cell images classification. Compared to the existing CNN based HEp-2 classification...

chapter

HEp-2 cell classification using artificial neural network approach

Divya BS, Kamalraj Subramaniam, Nanjundaswamy HR

2016 23rd International Conference on Pattern Recognition (ICPR) > 84 - 89

2016 23rd International Conference on Pattern Recognition (ICPR)

Human Epithelial type-2 (HEp-2) cells are used as substrates for the detection of Anti Nuclear Antibodies (ANA) in the Indirect Immunofluorescence (IIF) test to diagnose autoimmune diseases. Pathologists in the laboratory examine the IIF slides to detect and recognize theHEp-2 cell patterns to generate the report. So, the IIF test is subjective and requires objective analysis. This paper introduces...

chapter

HEp-2 specimen classification with fully convolutional network

Yuexiang Li, Linlin Shen, Xiande Zhou, Shiqi Yu

2016 23rd International Conference on Pattern Recognition (ICPR) > 96 - 100

2016 23rd International Conference on Pattern Recognition (ICPR)

Reliable automatic system for Human Epithelial-2 (HEp-2) cell image classification can facilitate the diagnosis of systemic autoimmune diseases. In this paper, an automatic pattern recognition system using fully convolutional network (FCN) was proposed to address the HEp-2 specimen classification problem. The FCN in the proposed framework was adapted from VGG-16, which was trained with ICPR 2016 dataset...

chapter

Local descriptors fusion for mobile iris verification

N. Aginako, J.M. Martinez-Otzerta, B. Sierra, M. Castrillon-Santana, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 165 - 169

2016 23rd International Conference on Pattern Recognition (ICPR)

This paper summarizes the proposal submitted by the joint team conformed by researchers from UPV and ULPGC to the Mobile Iris CHallenge Evaluation II. The approach makes use of a state-of-the-art iris segmentation technique, to later extract features making use of local descriptors. Those suitable to the problem are selected after evaluating a collection of 15 local descriptors, covering a range of...

chapter

Aligning the dissimilar: A probabilistic method for feature-based point set registration

Martin Danelljan, Giulia Meneghetti, Fahad Shahbaz Khan, Michael Felsberg

2016 23rd International Conference on Pattern Recognition (ICPR) > 247 - 252

2016 23rd International Conference on Pattern Recognition (ICPR)

3D-point set registration is an active area of research in computer vision. In recent years, probabilistic registration approaches have demonstrated superior performance for many challenging applications. Generally, these probabilistic approaches rely on the spatial distribution of the 3D-points, and only recently color information has been integrated into such a framework, significantly improving...

chapter

Implicit hybrid video emotion tagging by integrating video content and users' multiple physiological responses

Shiyu Chen, Shangfei Wang, Chongliang Wu, Zhen Gao, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 295 - 300

2016 23rd International Conference on Pattern Recognition (ICPR)

The intrinsic interactions among a video's emotion tag, its content, and a user's spontaneous response while consuming the video can be leveraged to improve video emotion tagging, but this capability has not been thoroughly exploited yet. In this paper, we propose an implicit hybrid video emotion tagging approach by integrating video content and users' multiple physiological responses, which are only...

chapter

Employing subjects' information as privileged information for emotion recognition from EEG signals

Shan Wu, Shangfei Wang, Yachen Zhu, Zhen Gao, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 301 - 306

2016 23rd International Conference on Pattern Recognition (ICPR)

Current research of emotion recognition from electroencephalogram (EEG) signals rarely considers common patterns embodied in multiple subjects and individual patterns for each subject simultaneously. Therefore, in this paper, we propose a novel emotion recognition approach using subjects or subject groups as privileged information, which is only available during training. First, five frequency features...

chapter

Online speaker emotion tracking with a dynamic state transition model

Ozgun Cirakman, Bilge Gunsel

2016 23rd International Conference on Pattern Recognition (ICPR) > 307 - 312

2016 23rd International Conference on Pattern Recognition (ICPR)

Although emotional state recognition from voice has been extensively studied, there is not much effort focusing on the online emotion recognition. Since duration and intensity of emotional experiences change over time it is hard to employ existing static transition models while monitoring emotional states especially in an online setting. To overcome this difficulty we introduce a method which incorporates...

chapter

Face alignment with Cascaded Bidirectional LSTM Neural Networks

Yu Chen, Jianjun Qian, Jian Yang, Zhong Jin

2016 23rd International Conference on Pattern Recognition (ICPR) > 313 - 318

2016 23rd International Conference on Pattern Recognition (ICPR)

Face alignment is an important issue in many computer vision problems. The key problem is to find the nonlinear mapping from face image or feature to landmark locations. In this paper, we propose a novel cascaded approach with bidirectional Long Short Term Memory (LSTM) neural networks to approximate this nonlinear mapping. The cascaded structure is used to reduce the complexity of this problem and...

chapter

Learning effective Gait features using LSTM

Yang Feng, Yuncheng Li, Jiebo Luo

2016 23rd International Conference on Pattern Recognition (ICPR) > 325 - 330

2016 23rd International Conference on Pattern Recognition (ICPR)

Human gait is an important biometric feature for person identification in surveillance videos because it can be collected at a distance without subject cooperation. Most existing gait recognition methods are based on Gait Energy Image (GEI). Although the spatial information in one gait sequence can be well represented by GEI, the temporal information is lost. To solve this problem, we propose a new...

INFONA - science communication portal

2016 23rd International Conference on Pattern Recognition (ICPR)

Large-scale Isolated Gesture Recognition using Convolutional Neural Networks

Large-scale Continuous Gesture Recognition Using Convolutional Neural Networks

Large-scale Isolated Gesture Recognition using pyramidal 3D convolutional networks

Large-scale gesture recognition with a fusion of RGB-D data based on the C3D model

Two streams Recurrent Neural Networks for Large-Scale Continuous Gesture Recognition

Automatic personality prediction from audiovisual data using random forest regression

Multimodal fusion of audio, scene, and face features for first impression estimation

Using Convolutional 3D Neural Networks for User-independent continuous gesture recognition

Bi-modal regression for Apparent Personality trait Recognition

Fusion of classifier predictions for audio-visual emotion recognition

Deep convolutional neural network based HEp-2 cell classification

HEp-2 cell classification using artificial neural network approach

HEp-2 specimen classification with fully convolutional network

Local descriptors fusion for mobile iris verification

Aligning the dissimilar: A probabilistic method for feature-based point set registration

Implicit hybrid video emotion tagging by integrating video content and users' multiple physiological responses

Employing subjects' information as privileged information for emotion recognition from EEG signals

Online speaker emotion tracking with a dynamic state transition model

Face alignment with Cascaded Bidirectional LSTM Neural Networks

Learning effective Gait features using LSTM

Filter options

Publication date

Keywords

INFONA - science communication portal

2016 23rd International Conference on Pattern Recognition (ICPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2016 23rd International Conference on Pattern Recognition (ICPR)