Search results

Items from 1 to 14 out of 14 results

chapter

High-Level Feature Extraction Using SIFT GMMs and Audio Models

N Inoue, T Saito, K Shinoda, S Furui

2010 20th International Conference on Pattern Recognition > 3220 - 3223

2010 20th International Conference on Pattern Recognition (ICPR 2010)

We propose a statistical framework for high-level feature extraction that uses SIFT Gaussian mixture models (GMMs) and audio models. SIFT features were extracted from all the image frames and modeled by a GMM. In addition, we used mel-frequency cepstral coefficients and ergodic hidden Markov models to detect high-level features in audio streams. The best result obtained by using SIFT GMMs in terms...

chapter

Exploring Scale-Induced Feature Hierarchies in Natural Images

J. Perkio, T. Tuytelaars, W. Buntine

2009 International Conference on Machine Learning and Applications > 25 - 31

Eighth International Conference on Machine Learning and Applications (ICMLA 2009)

Recently there has been considerable interest in topic models based on the bag-of-features representation of images. The strong independence assumption inherent in the bag-of-features representation is not realistic however: patches often overlap and share underlying image structures. Moreover, important information with respect to relative scales of the features is completely ignored, for the sake...

chapter

Pilot Prototype Analysis: Ongoing Framework for Semantic Extraction Retrieval in Tennis Sports Video

V. Raman, P. Sumari, R. Abdullah

2009 Second International Conference on Computer and Electrical Engineering > 2 > 619 - 623

2009 Second International Conference on Computer and Electrical Engineering (ICCEE 2009)

Producing large amounts of digital media data every day requires fast transmission, efficient storage, flexible manipulation, and reuse of visual content. Since humans tend to use high-level semantic concepts when querying and browsing multimedia databases, there is an increasing need for semantic video indexing and analysis. For this purpose, we proposed a unified framework for semantic extraction...

chapter

VASD: Video Action Scene Detector Using Audio Visual Data

N.A. Lili

2009 International Conference on Computer Technology and Development > 2 > 303 - 307

2009 International Conference on Computer Technology and Development (ICCTD 2009)

This paper presents a method which able to integrate audio and visual information for human action scene analysis. The approach is top-down for determining and extracting action scenes in video by analyzing both audio and video data. We proposed a framework for recognizing actions by measuring image and action-based information from video with the following characteristics: feature extraction is done...

chapter

A Framework for Evaluating Human Action Detection via Multidimensional Approach

N.A. Lili

2009 Sixth International Conference on Computer Graphics, Imaging and Visualization > 186 - 190

2009 Sixth International Conference on Computer Graphics, Imaging and Visualization (CGIV 2009)

This work discusses the application of an Artificial Intelligence technique called data extraction and a process-based ontology in constructing experimental qualitative models for video retrieval and detection. We present a framework architecture that uses multimodality features as the knowledge representation scheme to model the behaviors of a number of human actions in the video scenes. The main...

chapter

News Video Story Segmentation Based on Naïve Bayes Model

Wan Jianping, Peng Tianqiang, Li Bicheng

2009 Fifth International Conference on Natural Computation > 6 > 77 - 81

2009 Fifth International Conference on Natural Computation (ICNC 2009)

Story boundary detection is the foundation of content based news video retrieval. In this paper, Naive Bayes Model, which has been successfully used in multi-modal feature fusion, is implemented in news video story segmentation. Firstly, we get candidate boundaries through shot detection. Secondly, middle-level features such as visual features, audio type, motion and caption, are extracted from shots...

chapter

Hidden Markov Model for Content-Based Video Retrieval

N.A. Lili

2009 Third Asia International Conference on Modelling&Simulation > 353 - 358

2009 Third Asia International Conference on Modelling & Simulation (AMS 2009)

Content-based video retrieval system is fairly recent and it is currently necessary to examine where it would just replace existing systems, where it can really bring some improvement and where it will open new possibilities. The users want to query the content instead of the raw video data. In this paper, we surveyed the art of video retrieval and proposed a basic framework for video retrieval based...

chapter

Summarizing raw video material using Hidden Markov Models

W. Bailer, G. Thallinger

2009 10th Workshop on Image Analysis for Multimedia Interactive Services > 53 - 56

2009 10th Workshop on Image Analysis for Multimedia Interactive Services. WIAMIS 2009

Besides the reduction of redundancy the selection of representative segments is a core problem when summarizing collections of raw video material. We propose a novel approach for the selection of segments to be included in a video summary based on hidden Markov models (HMM), which are trained on an annotated subset of the content. The observations of the HMM are relevance judgments of content segments...

chapter

An integrated multi-sensing framework for pervasive healthcare monitoring

M ElHelw, J Pansiot, D McIlwraith, R Ali, more

2009 3rd International Conference on Pervasive Computing Technologies for Healthcare > 1 - 7

3rd International Conference on Pervasive Computing Technologies for Healthcare (Pervasive Health 2009)

Pervasive healthcare provides an effective solution for monitoring the wellbeing of elderly, quantifying post-operative patient recovery and monitoring the progression of neurodegenerative diseases such as Parkinson's. However, developing functional pervasive systems is a complex task that entails the creation of appropriate sensing platforms, integration of versatile technologies for data stream...

chapter

Estimation of crowd behavior using sensor networks and sensor fusion

M. Andersson, J. Rydell, J. Ahlberg

2009 12th International Conference on Information Fusion > 396 - 403

2009 12th International Conference on Information Fusion (FUSION)

Commonly, surveillance operators are today monitoring a large number of CCTV screens, trying to solve the complex cognitive tasks of analyzing crowd behavior and detecting threats and other abnormal behavior. Information overload is a rule rather than an exception. Moreover, CCTV footage lacks important indicators revealing certain threats, and can also in other respects be complemented by data from...

article

Learning Video Preferences Using Visual Features and Closed Captions

D. Brezeale, D.J. Cook

IEEE MultiMedia > 2009 > 16 > 3 > 39 - 47

An approach to identifying a viewer's video preferences uses hidden Markov models by combining visual features and closed captions.

chapter

Lip feature extraction and reduction for HMM-based visual speech recognition systems

S. Alizadeh, R. Boostani, V. Asadpour

2008 9th International Conference on Signal Processing > 561 - 564

2008 9th International Conference on Signal Processing (ICSP 2008)

Lipreading is a main part of audio-visual speech recognition systems which are mostly faced with redundancy of extracted features. In this paper, a new approach has been proposed to increase the lipreading performance by extraction of discriminant features. In this way, first, faces are detected; then, lip key points are extracted in which four cubic curves characterize lip contours. Next, the visual...

chapter

Metadata generation process for video action detection

L.N. Abdullah, S.A.M. Noah

2008 International Symposium on Information Technology > 2 > 1 - 5

2008 International Symposium on Information Technology

This research proposes a model of the multidimensional metadata generation approach for detecting human action in video. The idea is to develop a multidimensional multimodal framework, which will use a semantic approach on the action recognition and classification level. The main idea of the model is the inputs/outputs in the model will be the results of recognition processes from different modalities...

chapter

Integrating Audio Visual Data for Human Action Detection

L.N. Abdullah, S. Noah

2008 Fifth International Conference on Computer Graphics, Imaging and Visualisation > 242 - 246

2008 5th International Conference on Computer Graphics, Imaging and Visualisation (CGIV)

This paper presents a method which able to integrate audio and visual information for action scene analysis in any movie. The approach is top-down for determining and extract action scenes in video by analyzing both audio and video data. In this paper, we directly modelled the hierarchy and shared structures of human behaviours, and we present a framework of the hidden Markov model based application...

Filter options

Data set:
ieee
Keywords:
DATA MINING
FEATURE EXTRACTION
HIDDEN MARKOV MODELS
VISUALIZATION

Publication date

Set your own date range

Publication type

book (13)
article (1)

Keywords

IMAGE COLOR ANALYSIS (7)
HIDDEN MARKOV MODEL (6)
VIDEO SIGNAL PROCESSING (6)
HMM (4)
HUMANS (4)
VIDEO RETRIEVAL (4)
AUDIO SIGNAL PROCESSING (3)
STREAMING MEDIA (3)
VISUAL FEATURE (3)
ACTIVITY RECOGNITION (2)
AUDIO FEATURE (2)
AUDIO VISUAL DATA (2)
AUDITORY INFORMATION (2)
CLASSIFICATION ALGORITHMS (2)
COMPONENT (2)
COMPUTATIONAL MODELING (2)
CONTENT-BASED RETRIEVAL (2)
HUMAN ACTION DETECTION (2)
IMAGE PROCESSING (2)
MOTION PICTURES (2)
OBJECT DETECTION (2)
SENSOR FUSION (2)
VISUAL FEATURES (2)
VISUAL INFORMATION (2)
ACCURACY (1)
ACTION CLUSTERING (1)
ACTION IDENTIFICATION (1)
ACTION RECOGNITION (1)
ACTION SCENE ANALYSIS (1)
ACTION-BASED INFORMATION (1)
AUDIO INFORMATION (1)
AUDIO MODEL (1)
AUDIO STREAM (1)
AUDIO STREAMING (1)
AUDIO TYPE (1)
AUDIO VECTORS (1)
AUDIO VISUAL FEATURES (1)
AUDIO-VISUAL SPEECH RECOGNITION SYSTEM (1)
AUDIO-VISUAL SYSTEMS (1)
AUDIOVISUAL FEATURES (1)
BAG-OF-FEATURES REPRESENTATION (1)
BAYES METHODS (1)
BEHAVIOR ANALYSIS (1)
BEHAVIOUR PROFILING (1)
BIOLOGICAL SYSTEM MODELING (1)
BIOMEDICAL MONITORING (1)
BODY SENSOR NETWORKS (1)
C++ LANGUAGE (1)
CAMERAS (1)
CAPTION EXTRACTION (1)
CCTV FOOTAGE (1)
CEPSTRAL ANALYSIS (1)
CLOSED CAPTION ANALYSIS (1)
CLOSED CAPTIONS (1)
CLOSED CIRCUIT TELEVISION (1)
COLOR TRACKING MODEL (1)
COLOUR DISTRIBUTION (1)
CONFERENCES (1)
CONSTRUCTION LOGIC (1)
CONTENT BASED NEWS VIDEO RETRIEVAL (1)
CONTENT SEGMENT SELECTION (1)
CONTENT-BASED VIDEO RETRIEVAL (1)
CONTENT-BASED VIDEO RETRIEVAL SYSTEM (1)
CONTEXT (1)
CROWD BEHAVIOR ESTIMATION (1)
DATA EXTRACTION (1)
DATA FUSION (1)
DATA STREAM MANAGEMENT (1)
DATA VISUALISATION (1)
DATABASES (1)
DETECTION ALGORITHMS (1)
DETECTORS (1)
DIGITAL MEDIA DATA (1)
DIRECT LINEAR DISCRIMINANT ANALYSIS (1)
DISCRETE COSINE TRANSFORMS (1)
DISEASES (1)
DISTRIBUTED HETEROGENEOUS SENSOR (1)
DISTRIBUTED NETWORK (1)
EDGE FEATURE EXTRACTION (1)
ERGODIC HIDDEN MARKOV MODEL (1)
ESTIMATION (1)
GAUSSIAN PROCESSES (1)
GRAPHICAL USER INTERFACE (1)
GRAPHICAL USER INTERFACES (1)
HEALTH CARE (1)
HETEROGENEOUS SENSORS (1)
HIERARCHICAL TOPIC MODEL (1)
HIGH-LEVEL FEATURE DETECTION (1)
HIGH-LEVEL FEATURE EXTRACTION (1)
HMM-BASED VISUAL SPEECH RECOGNITION SYSTEMS (1)
HUMAN ACTION DETECTION EVALUATION (1)
HUMAN ACTION SCENE ANALYSIS (1)
HUMAN ACTIVITY RECOGNITION (1)
HUMAN BEHAVIOUR (1)
HUMAN PERCEPTUAL PROCESSING (1)
IEEE (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options