Search results

Items from 1 to 20 out of 29 results

chapter

Learning to detect violent videos using convolutional long short-term memory

Swathikiran Sudhakaran, Oswald Lanz

2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) > 1 - 6

2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

Developing a technique for the automatic analysis of surveillance videos in order to identify the presence of violence is of broad interest. In this work, we propose a deep neural network for the purpose of recognizing violent videos. A convolutional neural network is used to extract frame level features from a video. The frame level features are then aggregated using a variant of the long short term...

chapter

Temporal Action Localization by Structured Maximal Sums

Zehuan Yuan, Jonathan C. Stroud, Tong Lu, Jia Deng

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3215 - 3223

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We address the problem of temporal action localization in videos. We pose action localization as a structured prediction over arbitrary-length temporal windows, where each window is scored as the sum of frame-wise classification scores. Additionally, our model classifies the start, middle, and end of each action as separate components, allowing our system to explicitly model each actions temporal...

chapter

Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images

Zhi-Qi Cheng, Xiao Wu, Yang Liu, Xian-Sheng Hua

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4169 - 4177

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In recent years, both online retail and video hosting service have been exponentially grown. In this paper, a novel deep neural network, called AsymNet, is proposed to explore a new cross-domain task, Video2Shop, targeting for matching clothes appeared in videos to the exactly same items in online shops. For the image side, well-established methods are used to detect and extract features for clothing...

chapter

Object Detection in Videos with Tubelet Proposal Networks

Kai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 889 - 897

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Object detection in videos has drawn increasing attention recently with the introduction of the large-scale ImageNet VID dataset. Different from object detection in static images, temporal information in videos is vital for object detection. To fully utilize temporal information, state-of-the-art methods [15, 14] are based on spatiotemporal tubelets, which are essentially sequences of associated bounding...

chapter

Deep Learning for Domain-Specific Action Recognition in Tennis

Silvia Vinyes Mora, William J. Knottenbelt

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 170 - 178

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Recent progress in sports analytics has been driven by the availability of spatio-temporal and high level data. Video-based action recognition in sports can significantly contribute to these advances. Good progress has been made in the field of action recognition but its application to sports mainly focuses in detecting which sport is being played. In order for action recognition to be useful in sports...

chapter

Text detection algorithm on real scenes images and videos on the base of discrete cosine transform and convolutional neural network

Polina M. Osina, Yuliya A. Bolotova, Vladimir G. Spitsyn

2017 International Siberian Conference on Control and Communications (SIBCON) > 1 - 4

2017 International Siberian Conference on Control and Communications (SIBCON)

In this work we present algorithms which are applied in such task as text recognition on images and video. Proposed algorithm is based on the combination of discrete cosine transform and convolutional neural networks. Description of the applying features of discrete cosine transform for text detection is provided. We list the main advantages and disadvantages of CNN and DCT combination. Also in this...

chapter

Deep learning based image description generation

Philip Kinghorn, Li Zhang, Ling Shao

2017 International Joint Conference on Neural Networks (IJCNN) > 919 - 926

2017 International Joint Conference on Neural Networks (IJCNN)

Describing the contents of images is a challenging task for machines to achieve. It requires not only accurate recognition of objects and humans, but also their attributes and relationships as well as scene information. It would be even more challenging to extend this process to identify falls and hazardous objects to aid elderly or users in need of care. This research makes initial attempts to deal...

chapter

DeepVO: Towards end-to-end visual odometry with deep Recurrent Convolutional Neural Networks

Sen Wang, Ronald Clark, Hongkai Wen, Niki Trigoni

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2043 - 2050

2017 IEEE International Conference on Robotics and Automation (ICRA)

This paper studies monocular visual odometry (VO) problem. Most of existing VO algorithms are developed under a standard pipeline including feature extraction, feature matching, motion estimation, local optimisation, etc. Although some of them have demonstrated superior performance, they usually need to be carefully designed and specifically fine-tuned to work well in different environments. Some...

chapter

Detection of motorcyclists without helmet in videos using convolutional neural network

C. Vishnu, Dinesh Singh, C. Krishna Mohan, Sobhan Babu

2017 International Joint Conference on Neural Networks (IJCNN) > 3036 - 3041

2017 International Joint Conference on Neural Networks (IJCNN)

In order to ensure the safety measures, the detection of traffic rule violators is a highly desirable but challenging task due to various difficulties such as occlusion, illumination, poor quality of surveillance video, varying whether conditions, etc. In this paper, we present a framework for automatic detection of motorcyclists driving without helmets in surveillance videos. In the proposed approach,...

chapter

Part-level fully convolutional networks for pedestrian detection

Xinran Wang, Cheolkon Jung, Alfred O Hero

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2267 - 2271

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Since pedestrians in videos have a wide range of appearances such as body poses, occlusions, and complex backgrounds, pedestrian detection is a challengeable task. In this paper, we propose part-level fully convolutional networks (FCN) for pedestrian detection. We adopt deep learning to deal with the proposal shifting problem in pedestrian detection. First, we combine convolutional neural networks...

chapter

Deep-net fusion to classify shots in concert videos

Wen-Li Wei, Jen-Chun Lin, Tyng-Luh Liu, Yi-Hsuan Yang, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1383 - 1387

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Varying types of shots is a fundamental element in the language of film, commonly used by a visual storytelling director to convey the emotion, ideas, and art. To classify such types of shots from images, we present a new framework that facilitates the intriguing task by addressing two key issues. We first focus on learning more effective features by fusing the layer-wise outputs extracted from a...

chapter

KidsTube: Detection, characterization and analysis of child unsafe content & promoters on YouTube

Rishabh Kaushal, Srishty Saha, Payal Bajaj, Ponnurangam Kumaraguru

2016 14th Annual Conference on Privacy, Security and Trust (PST) > 157 - 164

2016 14th Annual Conference on Privacy, Security and Trust (PST)

YouTube draws large number of users who contribute actively by uploading videos or commenting on existing videos. However, being a crowd sourced and large content pushed onto it, there is limited control over the content. This makes malicious users push content (videos and comments) which is inappropriate (unsafe), particularly when such content is placed around cartoon videos which are typically...

chapter

In-vehicle Hand Gesture Recognition using Hidden Markov models

Nachiket Deo, Akshay Rangesh, Mohan Trivedi

2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC) > 2179 - 2184

2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC)

In this work we explore Hidden Markov models as an approach for modeling and recognizing dynamic hand gestures for the interface of in-vehicle infotainment systems. We train the HMMs on more complex shape descriptors such as HOG and CNN features, unlike typical HMM based approaches. An analysis of the optimal hyperparameters of the HMM for the task has been carried out. Also, dimensionality reduction...

chapter

Anomaly detection techniques in surveillance videos

Xiaoli Li, Ze-min Cai

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) > 54 - 59

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)

In recent years, a dramatically increasing number of surveillance cameras have been installed to monitor private and public spaces and areas. Video surveillance is seen as an effective way to ensure our security. Therefore, modeling activity patterns and human behaviors for detection or recognition of peculiar event is a critical technology which has attracted remarkable research interest in the last...

article

Animal Detection From Highly Cluttered Natural Scenes Using Spatiotemporal Object Region Proposals and Patch Verification

Zhi Zhang, Zhihai He, Guitao Cao, Wenming Cao

IEEE Transactions on Multimedia > 2016 > 18 > 10 > 2079 - 2092

In this paper, we consider the animal object detection and segmentation from wildlife monitoring videos captured by motion-triggered cameras, called camera-traps. For these types of videos, existing approaches often suffer from low detection rates due to low contrast between the foreground animals and the cluttered background, as well as high false positive rates due to the dynamic background. To...

chapter

A machine learning approach to identify and track learning styles in MOOCs

Brahim Hmedna, Ali El Mezouary, Omar Baz, Driss Mammass

2016 5th International Conference on Multimedia Computing and Systems (ICMCS) > 212 - 216

2016 5th International Conference on Multimedia Computing and Systems (ICMCS)

This paper is devoted to describe a preliminary draft of our approach that aims to identify and track learners' learning styles based on their behavior and actions during a MOOC then to provide them with personalized recommendations based on their learning styles. Massive Open Online Courses are attracting a debate in the research community about their influence in online education. Indeed, with their...

chapter

Deep action classification via matrix completion

Sushma Bomma, Neil M Robertson

2016 24th European Signal Processing Conference (EUSIPCO) > 1886 - 1890

2016 24th European Signal Processing Conference (EUSIPCO)

Matrix completion is the task of predicting unknown or missing entries in a data matrix. The estimation of the missing entries is based on the assumption that the underlying matrix is a low rank one. Deep learning has evolved as an efficient tool for feature extraction in many large-scale image based applications. Exploiting the techniques from both domains, we propose a novel solution to the problem...

chapter

Human action recognition with DeepAction Kernel Gaussian Process

Yali Wang, Lin Li, Yu Qiao

2016 International Conference on Advanced Robotics and Mechatronics (ICARM) > 165 - 170

2016 International Conference on Advanced Robotics and Mechatronics (ICARM)

Human action recognition is a challenging vision task due to the complex action patterns in the real-world videos. In this work, we propose a DeepAction Kernel Gaussian Process, which takes advantage of Gaussian process (GP) and deep learning, to capture the distinctive action characteristics. Specifically, we design a unified, deep and non-adjacent kernel structure within Gaussian process to classify...

chapter

Multimedia event detection via deep spatial-temporal neural networks

Jingyi Hou, Xinxiao Wu, Feiwu Yu, Yunde Jia

2016 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2016 IEEE International Conference on Multimedia and Expo (ICME)

This paper proposes a novel method using deep spatial-temporal neural networks based on deep Convolutional Neural Network (CNN) for multimedia event detection. To sufficiently take advantage of the motion and appearance information of events from videos, our networks contain two branches: a temporal neural network and a spatial neural network. The temporal neural network captures motion information...

chapter

Efficient audio segmentation in soccer videos

M A Raghuram, Nikhil R. Chavan, Shashidhar G. Koolagudi, Pravin B. Ramteke

2016 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 4

2016 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)

Identifying different audio segments in videos is the first step for many important tasks such as event detection and speech transcription. Approaches using Mel-Frequency Cepstral coefficients (MFCCs) with Gaussian mixture models (GMMs) and hidden Markov models (HMMs) perform reasonably well in stationary conditions but do not scale to a broad range of environmental conditions. This paper focuses...

Data set:
ieee
Keywords:
FEATURE EXTRACTION
VIDEOS
NEURAL NETWORKS

Publication date

Set your own date range

Publication type

book (27)
article (2)

Keywords

MACHINE LEARNING (7)
TRAINING (6)
CONVOLUTION (4)
PROPOSALS (4)
VISUALIZATION (4)
KERNEL (3)
NEURAL NETWORK (3)
OBJECT DETECTION (3)
SUPPORT VECTOR MACHINES (3)
ACCURACY (2)
CAMERAS (2)
CONVOLUTIONAL CODES (2)
CONVOLUTIONAL NEURAL NETWORK (2)
CONVOLUTIONAL NEURAL NETWORKS (2)
DEEP LEARNING (2)
DETECTORS (2)
HEAD (2)
HIDDEN MARKOV MODELS (2)
LIGHTING (2)
LOGIC GATES (2)
OPTICAL IMAGING (2)
SURVEILLANCE (2)
TRACKING (2)
TRAJECTORY (2)
YOUTUBE (2)
2D VIEW FEATURE VECTOR (1)
3-D MODEL RETRIEVAL (1)
3D HEAD MODEL RETRIEVAL (1)
3D MODEL FEATURE VECTOR (1)
3D VIRTUAL CHARACTERS (1)
ACTION BANK FEATURES (1)
ACTION RECOGNITION (1)
ACTION UNITS (1)
ACTIVITIES OF DAILY LIVING (1)
ACTIVITY LEVEL RECOGNITION (1)
ADAPTATION (1)
ADAPTATION MODELS (1)
ADAPTIVE MAPPING (1)
ADAPTIVE OPTICS (1)
ADAPTIVE SYSTEMS (1)
ANIMALS (1)
APPLICATIONS OF DEEP NEURAL NETWORKS (1)
ARTIFICIAL NEURAL NETWORK (1)
ASSISTED LIVING (1)
AUDIO SEGMENTATION (1)
BACKGROUND MODELING (1)
BENCHMARK TESTING (1)
BIOLOGICAL NEURAL NETWORKS (1)
BOUNDING BOX ALIGNMENT (1)
BUILDINGS (1)
CAMERA-TRAP IMAGES (1)
CHARACTER GENERATION (1)
CLOTHING (1)
CNN-HMM HYBRID (1)
COLLABORATION (1)
COMPUTATIONAL EFFICIENCY (1)
COMPUTATIONAL MODELING (1)
COMPUTER ARCHITECTURE (1)
COMPUTER SCIENCE (1)
COMPUTER VISION (1)
CONTENT BASED RETRIEVAL (1)
CONTENT-BASED RETRIEVAL (1)
CONVOLUTION NEURAL NETWORKS (1)
CONVOLUTIONAL NEURAL NETWORKS (CNN) FOR FEATURE EXTRACTION (1)
CROSS-DATASET TRANSFER (1)
DATA MINING (1)
DEEP CONVOLUTIONAL NETWORK (1)
DEEP NEURAL NETWORK (1)
DEEP NEURAL NETWORKS (1)
DEEP TRAJECTORY DESCRIPTOR (1)
DIMENSIONALITY REDUCTION (1)
DISCRETE COSINE TRANSFORM (1)
DISCRETE COSINE TRANSFORMS (1)
ELECTRONIC MAIL (1)
ENCODING (1)
ENDOSCOPES (1)
EVENT DETECTION (1)
FACE (1)
FACE DETECTION (1)
FACE RECOGNITION (1)
FACIAL FEATURES (1)
FEATURE REPRESENTATION (1)
FULLY CONVOLUTIONAL NETWORKS (1)
GABOR FILTER (1)
GABOR FILTERS (1)
GAME DESIGN (1)
GAUSSIAN PROCESSES (1)
GESTURE RECOGNITION (1)
GOLD (1)
GRAPH CUT (1)
HAND GESTURE RECOGNITION (1)
HARALICK FEATURES (1)
HAZARDS (1)
HEATING SYSTEMS (1)
HELMET DETECTION (1)
HIDDEN MARKOV MODELS (HMM) (1)
HUMAN ACTION RECOGNITION (1)
more

INFONA - science communication portal

Search results

Learning to detect violent videos using convolutional long short-term memory

Temporal Action Localization by Structured Maximal Sums

Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images

Object Detection in Videos with Tubelet Proposal Networks

Deep Learning for Domain-Specific Action Recognition in Tennis

Text detection algorithm on real scenes images and videos on the base of discrete cosine transform and convolutional neural network

Deep learning based image description generation

DeepVO: Towards end-to-end visual odometry with deep Recurrent Convolutional Neural Networks

Detection of motorcyclists without helmet in videos using convolutional neural network

Part-level fully convolutional networks for pedestrian detection

Deep-net fusion to classify shots in concert videos

KidsTube: Detection, characterization and analysis of child unsafe content & promoters on YouTube

In-vehicle Hand Gesture Recognition using Hidden Markov models

Anomaly detection techniques in surveillance videos

Animal Detection From Highly Cluttered Natural Scenes Using Spatiotemporal Object Region Proposals and Patch Verification

A machine learning approach to identify and track learning styles in MOOCs

Deep action classification via matrix completion

Human action recognition with DeepAction Kernel Gaussian Process

Multimedia event detection via deep spatial-temporal neural networks

Efficient audio segmentation in soccer videos

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options