Search results

chapter

Modeling temporal information using discrete fourier transform for recognizing emotions in user-generated videos

Haimin Zhang, Min Xu

2016 IEEE International Conference on Image Processing (ICIP) > 629 - 633

2016 IEEE International Conference on Image Processing (ICIP)

With the widespread of user-generated Internet videos, emotion recognition in those videos attracts increasing research efforts. However, most existing works are based on framelevel visual features and/or audio features, which might fail to model the temporal information, e.g. characteristics accumulated along time. In order to capture video temporal information, in this paper, we propose to analyse...

chapter

How scenes imply actions in realistic videos?

Hongsong Wang, Wei Wang, Liang Wang

2016 IEEE International Conference on Image Processing (ICIP) > 1619 - 1623

2016 IEEE International Conference on Image Processing (ICIP)

People drive on the road and eat in the kitchen. Can the road imply driving or the kitchen imply eating? This paper addresses such a problem by studying the relations between actions and scenes. To get effective scene representation, we use a deep convolutional neural networks (CNN) model trained from a scene-centric database to predict scene responses for videos. We employ two encoding schemes based...

chapter

Towards temporal adaptive representation for video action recognition

Junjie Cai, Jie Yu, Francisco Imai, Qi Tian

2016 IEEE International Conference on Image Processing (ICIP) > 4155 - 4159

2016 IEEE International Conference on Image Processing (ICIP)

Action recognition has been one of the challenging problems in the computer vision community. Most of the recent research work in this area exploits the motion features captured by dense trajectory descriptors. On the other hand, static image classification has seen the rise of deep learning architectures, with evidence that the output of intermediate layers could be successfully employed as a low...

chapter

Fine-grained maize cultivar identification using filter-specific convolutional activations

Hao Lu, Zhiguo Cao, Yang Xiao, Zhiwen Fang, more

2016 IEEE International Conference on Image Processing (ICIP) > 3718 - 3722

2016 IEEE International Conference on Image Processing (ICIP)

Cultivar identification is an important aspect in agriculture and also a typical task of fine-grained visual categorization (FGVC). In comparison with other common topics in FGVC, studies on this problem are somewhat lagged and limited. In this paper, targeting four Chinese maize cultivars of Jundan No.20, Wuyue No.3, Nongda No.108, and Zhengdan No.958, we first consider the problem of identifying...

chapter

Learning zeroth class dictionary for human action recognition

Jia-xin Cai, Xin Tang, Lifang Zhang, Guocan Feng

2016 IEEE International Conference on Image Processing (ICIP) > 4175 - 4179

2016 IEEE International Conference on Image Processing (ICIP)

Document is unavailable: This DOI was registered to an article that was not presented by the author(s) at this conference. As per section 8.2.1.B.13 of IEEE's "Publication Services and Products Board Operations Manual," IEEE has chosen to exclude this article from distribution. We regret any inconvenience.

chapter

Abnormal event detection using spatio-temporal feature and nonnegative locality-constrained linear coding

Yu Zhao, Lei Zhou, Keren Fu, Jie Yang

2016 IEEE International Conference on Image Processing (ICIP) > 3354 - 3358

2016 IEEE International Conference on Image Processing (ICIP)

In this paper, an approach using the spatio-temporal feature and nonnegative locality-constrained linear coding (NLLC) is proposed to detect abnormal events in videos. This approach utilizes position-based spatio-temporal descriptors as the low-level representations of a video clip. Each descriptor consists of the position information of a space-time interest point and an appearance feature vector...

chapter

Fisher vector encoded deep convolutional features for unconstrained face verification

Jun-Cheng Chen, Jingxiao Zheng, Vishal M. Patel, Rama Chellappa

2016 IEEE International Conference on Image Processing (ICIP) > 2981 - 2985

2016 IEEE International Conference on Image Processing (ICIP)

We present a method to combine the Fisher vector representation and the Deep Convolutional Neural Network (DCNN) features to generate a rerpesentation, called the Fisher vector encoded DCNN (FV-DCNN) features, for unconstrained face verification. One of the key features of our method is that spatial and appearance information are simultaneously processed when learning the Gaussian mixture model to...

chapter

Keypoint trajectory coding on compact descriptor for video analysis

Dong Tian, Huifang Sun, Anthony Vetro

2016 IEEE International Conference on Image Processing (ICIP) > 171 - 175

2016 IEEE International Conference on Image Processing (ICIP)

In contrast to still image analysis, motion information offers a powerful means to analyze video. In particular, motion trajectories determined from keypoints have become very popular in recent years for a variety of video analysis tasks, including search, retrieval and classification. Additionally, cloud-based analysis of media content has been gaining momentum, so efficient communication of salient...

chapter

A novel method for splice sites prediction using sequence component and hidden Markov model

Elham Pashaei, Alper Yilmaz, Mustafa Ozen, Nizamettin Aydin

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) > 3076 - 3079

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

With increasing growth of DNA sequence data, it has become an urgent demand to develop new methods to accurately predict the genes. The performance of gene detection methods mainly depend on the efficiency of splice site prediction methods. In this paper, a novel method for detecting splice sites is proposed by using a new effective DNA encoding method and AdaBoost.M1 classifier. Our proposed DNA...

chapter

Differentiating facial incongruity and flatness in schizophrenia, using structured light camera data

Talia Tron, Abraham Peled, Alexander Grinsphoon, Daphna Weinshall

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) > 2427 - 2430

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Incongruity between emotional experience and its outwardly expression is one of the prominent symptoms in schizophrenia. Though widely reported and used in clinical evaluation, this symptom is inadequately defined in the literature and may be confused with mere affect flattening. In this study we used structured-light depth camera and dedicated software to automatically measure facial activity of...

chapter

Vehicle Classification in Acoustic Sensor Networks Based on Hybrid Dictionary Learning

Shuilin Guo, Rui Wang, Bin Liu, Qiyue Wei, more

2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech) > 861 - 865

2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech)

In this paper, we consider a single sensor classification problem, focusing on classifying the types of the moving vehicles. To improve the classification accuracy with low-time complexity in complex scenes, the acoustic sensor data sets were captured to measure the physical events and a novel hybrid dictionary learning method for vehicle classification is proposed. The efficient hybrid dictionary...

chapter

Decoding of responses to mixed frequency and phase coded visual stimuli using multiset canonical correlation analysis

Kaori Suefusa, Toshihisa Tanaka

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) > 1492 - 1495

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Brain-computer interfacing (BCI) based on steady-state visual evoked potentials (SSVEPs) is one of the most practical BCIs because of its high recognition accuracies and little training of a user. Mixed frequency and phase coding which can implement a number of commands and achieve a high information transfer rate (ITR) has recently been gaining much attention. In order to implement mixed-coded SSVEP-BCI...

chapter

Bidirectional sparse representations for multi-shot person re-identification

Solene Chan-Lang, Quoc Cuong Pham, Catherine Achard

2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) > 263 - 270

2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

With the development of surveillance cameras, person re-identification has gained much interest, however re-identifying people across cameras remains a challenging problem which not only requires a good feature description but also a reliable matching scheme. Our method can be applied with any feature and focuses on the second requirement. We propose a robust bidirectional sparse coding method that...

chapter

Improving surface normals based action recognition in depth images

Xuan Son Nguyen, Thanh Phuong Nguyen, Francois Charpillet

2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) > 109 - 114

2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

In this paper, we propose a new local descriptor for action recognition in depth images. Our proposed descriptor jointly encodes the shape and motion cues using surface normals in 4D space of depth, time, spatial coordinates and higher-order partial derivatives of depth values along spatial coordinates. In a traditional Bag-of-words (BoW) approach, local descriptors extracted from a depth sequence...

chapter

A scene recognition method using sparse features with layout-sensitive pooling and extreme learning machine

Lingying Wu, Yuanlong Yu, Jason Gu

2016 IEEE International Conference on Information and Automation (ICIA) > 178 - 183

2016 IEEE International Conference on Information and Automation (ICIA)

Scene recognition aims to find a semantic explanation of a scene, i.e., it helps intelligent machines to know where they are. It can be widely applied into various tasks in computer vision and robotics. Most of pioneer methods extracted a set of low-level features and put them into classifier directly to identify scene category. But it has been proved that low-level features do not work well. Currently...

chapter

Improved local ternary pattern features with application to pedestrian detection

Jiatao Song, Wei Wang, Tuozhong Yao, Chunfeng Zhang, more

2016 IEEE International Conference on Information and Automation (ICIA) > 1365 - 1369

2016 IEEE International Conference on Information and Automation (ICIA)

Pedestrian detection is one of the challenging research topics in computer vision and efficient feature representation of a pedestrian attracts more and more attention. Traditional features such as Histogram of Oriented Gradients (HOG) were widely used in pedestrian detection, but because of their poor texture description ability, these feature based methods cannot achieve satisfactory pedestrian...

chapter

A robust feature detection algorithm for the binary encoded single-shot structured light system

Hualie Jiang, Zhan Song

2016 IEEE International Conference on Information and Automation (ICIA) > 264 - 269

2016 IEEE International Conference on Information and Automation (ICIA)

This work introduces a novel feature detection algorithm for the decoding of a binary encoded structured light pattern. To make the structure light pattern insensitive to surface color and texture, some geometrical shapes are used as the pattern elements. Grid-point between each two adjacent rhombic pattern element is defined as the feature points. Affected by the inner structure of pattern element,...

chapter

Combining Nonlinear Dimension Reduction and Hashing Method for Efficient Image Retrieval

Yang Li, Zhuang Miao, Yulong Xu, Hang Li, more

2016 12th International Conference on Semantics, Knowledge and Grids (SKG) > 126 - 130

2016 12th International Conference on Semantics, Knowledge and Grids (SKG)

For large-scale image retrieval, high-dimensional image representations derived from pre-trained Convolutional Neural Networks (CNNs) make the retrieval system inefficiency. In this paper, we propose to combine nonlinear dimension reduction and hashing method for efficient image retrieval. We firstly extract 4096-dimension features by a pre-trained CNNs model. Secondly, we use t-Distributed Stochastic...

chapter

Designing a Better Data Representation for Deep Neural Networks and Text Classification

Joseph D. Prusa, Taghi M. Khoshgoftaar

2016 IEEE 17th International Conference on Information Reuse and Integration (IRI) > 411 - 416

2016 IEEE 17th International Conference on Information Reuse and Integration (IRI)

Traditional machine learning requires data to be described by attributes prior to applying a learning algorithm. In text classification tasks, many feature engineering methodologies have been proposed to extract meaningful features, however, no best practice approach has emerged. Traditional methods of feature engineering have inherent limitations due to loss of information and the limits of human...

chapter

Nonlinear dictionary learning based deep neural networks

Hui Zhang, Huaping Liu, Rui Song, Fuchun Sun

2016 International Joint Conference on Neural Networks (IJCNN) > 3771 - 3776

2016 International Joint Conference on Neural Networks (IJCNN)

In this paper, we demonstrate nonlinear features extracted by deep neural network have better results in the task of dictionary learning. A nonlinear dictionary learning model is constructed and the optimization algorithm is developed. In the learning algorithm, we use the deep neural network to convey raw samples to feature space and learn a nonlinear dictionary. The extensive experimental results...

INFONA - science communication portal

Search results

Modeling temporal information using discrete fourier transform for recognizing emotions in user-generated videos

How scenes imply actions in realistic videos?

Towards temporal adaptive representation for video action recognition

Fine-grained maize cultivar identification using filter-specific convolutional activations

Learning zeroth class dictionary for human action recognition

Abnormal event detection using spatio-temporal feature and nonnegative locality-constrained linear coding

Fisher vector encoded deep convolutional features for unconstrained face verification

Keypoint trajectory coding on compact descriptor for video analysis

A novel method for splice sites prediction using sequence component and hidden Markov model

Differentiating facial incongruity and flatness in schizophrenia, using structured light camera data

Vehicle Classification in Acoustic Sensor Networks Based on Hybrid Dictionary Learning

Decoding of responses to mixed frequency and phase coded visual stimuli using multiset canonical correlation analysis

Bidirectional sparse representations for multi-shot person re-identification

Improving surface normals based action recognition in depth images

A scene recognition method using sparse features with layout-sensitive pooling and extreme learning machine

Improved local ternary pattern features with application to pedestrian detection

A robust feature detection algorithm for the binary encoded single-shot structured light system

Combining Nonlinear Dimension Reduction and Hashing Method for Efficient Image Retrieval

Designing a Better Data Representation for Deep Neural Networks and Text Classification

Nonlinear dictionary learning based deep neural networks

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options