Search results for: Jinzhuo Wang

Items from 1 to 7 out of 7 results

chapter

A joint model for action localization and classification in untrimmed video with visual attention

Weimian Li, Wenmin Wang, Xiongtao Chen, Jinzhuo Wang, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 619 - 624

2017 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, we introduce a joint model that learns to directly localize the temporal bounds of actions in untrimmed videos as well as precisely classify what actions occur. Most existing approaches tend to scan the whole video to generate action instances, which are really inefficient. Instead, inspired by human perception, our model is formulated based on a recurrent neural network to observe...

chapter

Collaborative Deep Networks for Pedestrian Detection

Hongmeng Song, Wenmin Wang, Jinzhuo Wang, Ronggang Wang

2017 IEEE Third International Conference on Multimedia Big Data (BigMM) > 146 - 153

2017 IEEE Third International Conference on Multimedia Big Data (BigMM)

Conventional pedestrian detection methods construct models based on hand-crafted features or deep learning. They are powerful but limited due to finite capabilities of single classifiers. Ensemble models escape these problems by assembling multiple classifiers using some man-made criteria which synthetically utilize information from all combined models. However, these criteria lack theoretical support...

chapter

Tube ConvNets: Better exploiting motion for action recognition

Zhihao Li, Wenmin Wang, Nannan Li, Jinzhuo Wang

2016 IEEE International Conference on Image Processing (ICIP) > 3056 - 3060

2016 IEEE International Conference on Image Processing (ICIP)

Motion information is a key factor for action recognition and has been eagerly pursued for decades. How to effectively learn motion features in Convolutional Networks (ConvNets) remains an open issue. Prevalent ConvNets often take several full frames of video as input at a time, which can be a heavy burden for network training. In this paper, we introduce a novel framework called Tube ConvNets, by...

article

CSPS: An Adaptive Pooling Method for Image Classification

Jinzhuo Wang, Wenmin Wang, Ronggang Wang, Wen Gao

IEEE Transactions on Multimedia > 2016 > 18 > 6 > 1000 - 1010

This paper proposes an adaptive approach to learn class-specific pooling shapes (CSPS) for image classification. Prevalent methods for spatial pooling are often conducted on predefined grids of images, which is an ad-hoc method and, thus, lacks generalization power across different categories. In contrast, our CSPS is designed in a data-driven fashion by generating plenty of candidates and selecting...

chapter

A compact shot representation for video semantic indexing

Jinzhuo Wang, Wenmin Wang, Ronggang Wang, Wen Gao

2015 IEEE International Conference on Image Processing (ICIP) > 3265 - 3269

2015 IEEE International Conference on Image Processing (ICIP)

This paper presents a compact shot representation for video semantic indexing (SIN). The proposed representation consists of visual cues from only two frames, i.e., key frame (KF) and difference frame (DF), which are both constructed with spatial pyramid. The KF describes static information while the generated DF captures non-static information. Each region of DF is derived from the same location...

chapter

Image classification using RBM to encode local descriptors with group sparse learning

Jinzhuo Wang, Wenmin Wang, Ronggang Wang, Wen Gao

2015 IEEE International Conference on Image Processing (ICIP) > 912 - 916

2015 IEEE International Conference on Image Processing (ICIP)

This paper proposes to employ deep learning model to encode local descriptors for image classification. Previous works using deep architectures to obtain higher representations are often operated from pixel level, which lack the power to be generalized to large-size and complex images due to computational burdens and internal essence capture. Our method slips the leash of this limitation by starting...

chapter

Learning class-specific pooling shapes for image classification

Jinzhuo Wang, Wenmin Wang, Ronggang Wang, Wen Gao

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

Spatial pyramid (SP) representation is an extension of bag-of-feature model which embeds spatial layout information of local features by pooling feature codes over pre-defined spatial shapes. However, the uniform style of spatial pooling shapes used in standard SP is an ad-hoc manner without theoretical motivation, thus lacking the generalization power to adapt to different distribution of geometric...

Filter options

Publication date

Set your own date range

Publication type

book (6)
article (1)

Keywords

TRAINING (4)
IMAGE CLASSIFICATION (3)
CLASS-SPECIFIC POOLING SHAPES (CSPS) (2)
CLUSTERING ALGORITHMS (2)
COMPUTATIONAL MODELING (2)
FEATURE EXTRACTION (2)
IMAGE COLOR ANALYSIS (2)
LAYOUT (2)
MULTI-SHAPE MATCHING KERNEL (2)
REPRESENTATION COMPRESSION (2)
SHAPE (2)
STANDARDS (2)
ACTION CLASSIFICATION (1)
ACTION LOCALIZATION (1)
ACTION RECOGNITION (1)
ACTION TUBES (1)
COLLABORATION (1)
COLLABORATIVE DEEP NETWORKS (1)
COLLABORATIVE LEARNING (1)
COLLABORATIVE WORK (1)
COMPACT SHOT REPRESENTATION (1)
COMPUTER ARCHITECTURE (1)
CONVNETS (1)
DATA MODELS (1)
DICTIONARIES (1)
DICTIONARY SENSITIVITY (1)
DIFFERENCE FRAME (DF) (1)
ELECTRON TUBES (1)
ENCODING (1)
FEATURE CODING (1)
FULLY-CONNECTED NETWORK (1)
GROUP SPARSE LEARNING (GSL) (1)
IMAGE CODING (1)
KERNEL (1)
KEY FRAME (KF) (1)
MATHEMATICAL MODEL (1)
MOTION INFORMATION (1)
OPTICAL IMAGING (1)
OPTICAL NOISE (1)
PEDESTRIAN DETECTION (1)
PREDICTIVE MODELS (1)
PROPOSALS (1)
RECURRENT NEURAL NETWORK (1)
RECURRENT NEURAL NETWORKS (1)
REINFORCE (1)
RESAMPLING PROCESS (1)
RESTRICTED BOLTZMANN MACHINE (RBM) (1)
SENSITIVITY (1)
TEMPORAL LOCALIZATION (1)
TRAJECTORY (1)
VIDEO SEMANTIC INDEXING (1)
VISUAL ATTENTION (1)
more

INFONA - science communication portal

Search results for: Jinzhuo Wang

A joint model for action localization and classification in untrimmed video with visual attention

Collaborative Deep Networks for Pedestrian Detection

Tube ConvNets: Better exploiting motion for action recognition

CSPS: An Adaptive Pooling Method for Image Classification

A compact shot representation for video semantic indexing

Image classification using RBM to encode local descriptors with group sparse learning

Learning class-specific pooling shapes for image classification

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options