Search results

Items from 1 to 11 out of 11 results

chapter

High-Level Feature Extraction Using SIFT GMMs and Audio Models

N Inoue, T Saito, K Shinoda, S Furui

2010 20th International Conference on Pattern Recognition > 3220 - 3223

2010 20th International Conference on Pattern Recognition (ICPR 2010)

We propose a statistical framework for high-level feature extraction that uses SIFT Gaussian mixture models (GMMs) and audio models. SIFT features were extracted from all the image frames and modeled by a GMM. In addition, we used mel-frequency cepstral coefficients and ergodic hidden Markov models to detect high-level features in audio streams. The best result obtained by using SIFT GMMs in terms...

chapter

Analysis of Music Rhythm Based on Bayesian Theory

Xiaolan Lin, Chuanzhen Li, Hui Wang, Qin Zhang

2009 International Forum on Computer Science-Technology and Applications > 3 > 296 - 299

2009 International Forum on Computer Science-Technology and Applications (IFCSTA 2009)

Automatically extracting rhythmic information from musical recordings is inarguably one of the most critical subtasks in many systems of music information retrieval. This paper presents a system for automatically extracting rhythm feature of audio music signal in the WAV format by using a new approach based on metric structure and Bayesian theory. In this system, an detected method is applied in the...

chapter

Dominant Audio Descriptors for Audio Classification and Retrieval

A. Fadeev, O. Missaoui, H. Frigui

2009 International Conference on Machine Learning and Applications > 75 - 78

Eighth International Conference on Machine Learning and Applications (ICMLA 2009)

In this paper, we propose a new general low-level feature representation for audio signals. Our approach, called Dominant Audio Descriptor is inspired by the MPEG-7 Dominant Color Descriptor. It is based on clustering time-local features and identifying dominant components. The features used to illustrate this approach are the well-known Mel Frequency Cepstral Coefficients. The performance of the...

chapter

A Method Based on General Model and Rough Set for Audio Classification

Xin He, Yingchun Shi, Fuming Peng, Xianzhong Zhou

2009 Chinese Conference on Pattern Recognition > 1 - 5

2009 Chinese Conference on Pattern Recognition. (CCPR 2009) and the First CJK Joint Workshop on Pattern Recognition (CJKPR)

As one of important information component in multimedia, audio enriches information perception and acquisition. Analyses and extractions of audio features are the base of audio classification. It's important to extract audio features effectively for content-based audio retrieval. In this paper, based on the theory of rough set, audio features are reduced and a lower-dimension feature set can be obtained...

chapter

VASD: Video Action Scene Detector Using Audio Visual Data

N.A. Lili

2009 International Conference on Computer Technology and Development > 2 > 303 - 307

2009 International Conference on Computer Technology and Development (ICCTD 2009)

This paper presents a method which able to integrate audio and visual information for human action scene analysis. The approach is top-down for determining and extracting action scenes in video by analyzing both audio and video data. We proposed a framework for recognizing actions by measuring image and action-based information from video with the following characteristics: feature extraction is done...

chapter

Sports audio classification based on MFCC and GMM

Liu Jiqing, Dong Yuan, Huang Jun, Zhao Xianyu, more

2009 2nd IEEE International Conference on Broadband Network&Multimedia Technology > 482 - 485

2009 2nd IEEE International Conference on Broadband Network & Multimedia Technology (IC-BNMT)

Audio segmentation and classification can provide useful information for multimedia content analysis. In this paper, we present a approach to segment and categorize the sports audio into speech, music and other environmental sounds for sports video classification and highlight detection. We investigate the performance of mel frequency cepstral coefficients (MFCC) in a Gaussian mixture model frame...

chapter

Affective video segment retrieval for consumer generated videos based on correlation between emotions and emotional audio events

G. Irie, K. Hidaka, T. Satou, T. Yamasaki, more

2009 IEEE International Conference on Multimedia and Expo > 522 - 525

2009 IEEE International Conference on Multimedia and Expo (ICME)

A novel affective video segment retrieval method based on the correlation between emotion and emotional audio events (EAEs) is presented. The proposed method focuses on retrieving three types of affective video segments, joy, sadness and excitement, by utilizing correlations between emotions and EAEs. The correlation between these emotions and EAEs is investigated by a subjective evaluation. The proposed...

chapter

A regularized kernel-based approach to unsupervised audio segmentation

Z. Harchaoui, F. Vallet, A. Lung-Yut-Fong, O. Cappe

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 1665 - 1668

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

We introduce a regularized kernel-based rule for unsupervised change detection based on a simpler version of the recently proposed kernel fisher discriminant ratio. Compared to other kernel-based change detectors found in the literature, the proposed test statistic is easier to compute and has a known asymptotic distribution which can effectively be used to set the false alarm rate a priori. This...

chapter

Dynamic texture models of music

L. Barrington, A.B. Chan, G. Lanckriet

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 1589 - 1592

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we consider representing a musical signal as a dynamic texture, a model for both the timbral and rhythmical qualities of sound. We apply the new representation to the task of automatic song segmentation. In particular, we cluster sequences of audio feature-vectors, extracted from the song, using a dynamic texture mixture model (DTM). We show that the DTM model can both detect transition...

chapter

Integrating Audio Visual Data for Human Action Detection

L.N. Abdullah, S. Noah

2008 Fifth International Conference on Computer Graphics, Imaging and Visualisation > 242 - 246

2008 5th International Conference on Computer Graphics, Imaging and Visualisation (CGIV)

This paper presents a method which able to integrate audio and visual information for action scene analysis in any movie. The approach is top-down for determining and extract action scenes in video by analyzing both audio and video data. In this paper, we directly modelled the hierarchy and shared structures of human behaviours, and we present a framework of the hidden Markov model based application...

article

Speakers Role Recognition in Multiparty Audio Recordings Using Social Network Analysis and Duration Distribution Modeling

A. Vinciarelli

IEEE Transactions on Multimedia > 2007 > 9 > 6 > 1215 - 1226

This paper presents two approaches for speaker role recognition in multiparty audio recordings. The experiments are performed over a corpus of 96 radio bulletins corresponding to roughly 19 h of material. Each recording involves, on average, 11 speakers playing one among six roles belonging to a predefined set. Both proposed approaches start by segmenting automatically the recordings into single speaker...

Filter options

Data set:
ieee
Keywords:
DATA MINING
FEATURE EXTRACTION
HIDDEN MARKOV MODELS
AUDIO SIGNAL PROCESSING

Publication date

Set your own date range

Publication type

book (10)
article (1)

Keywords

MEL FREQUENCY CEPSTRAL COEFFICIENT (4)
MUSIC (4)
AUDIO CLASSIFICATION (3)
COMPUTATIONAL MODELING (3)
SIGNAL CLASSIFICATION (3)
SPEECH RECOGNITION (3)
VISUALIZATION (3)
AUDIO SEGMENTATION (2)
AUDIO VISUAL DATA (2)
AUDITORY INFORMATION (2)
CEPSTRAL ANALYSIS (2)
CONTENT-BASED RETRIEVAL (2)
HIDDEN MARKOV MODEL (2)
HMM (2)
HUMANS (2)
IEEE (2)
IMAGE COLOR ANALYSIS (2)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (2)
MFCC (2)
OBJECT DETECTION (2)
PATTERN CLUSTERING (2)
SPEECH (2)
VIDEO SIGNAL PROCESSING (2)
VISUAL INFORMATION (2)
ACTION CLUSTERING (1)
ACTION IDENTIFICATION (1)
ACTION RECOGNITION (1)
ACTION SCENE ANALYSIS (1)
ACTION-BASED INFORMATION (1)
ACTIVITY RECOGNITION (1)
AFFECTIVE CONTENT (1)
AFFECTIVE VIDEO SEGMENT RETRIEVAL METHOD (1)
AUDIO EVENT DETECTION (1)
AUDIO FEATURE (1)
AUDIO FEATURE EXTRACTION (1)
AUDIO FEATURE-VECTORS (1)
AUDIO INDEXING (1)
AUDIO INFORMATION (1)
AUDIO MODEL (1)
AUDIO MUSIC SIGNAL (1)
AUDIO RECORDING (1)
AUDIO RETRIEVAL (1)
AUDIO SIGNALS (1)
AUDIO STREAM (1)
AUDIO STREAMING (1)
AUDIO VECTORS (1)
AUDIO VISUAL FEATURES (1)
AUDIOVISUAL FEATURES (1)
AUTOMATIC MUSIC ANALYSIS (1)
AUTOMATIC RECORDING SEGMENTATION (1)
AUTOMATIC SEGMENTATION (1)
AUTOMATIC SONG SEGMENTATION (1)
BAYES METHODS (1)
BAYESIAN METHODS (1)
BAYESIAN THEORY (1)
CHANGE DETECTION (1)
CLASSIFICATION ALGORITHMS (1)
COLOUR DISTRIBUTION (1)
CONSUMER GENERATED VIDEO (1)
CONSUMER GENERATED VIDEOS (1)
CONTENT BASED VIDEO RETRIEVAL (1)
CONTENT-BASED AUDIO RETRIEVAL (1)
CONTEXT (1)
CORRELATION (1)
DETECTORS (1)
DOMINANT AUDIO DESCRIPTORS (1)
DURATION DISTRIBUTION MODELING (1)
DYNAMIC KALMAN FILTER (1)
DYNAMIC TEXTURE (1)
DYNAMIC TEXTURE MIXTURE MODEL (1)
DYNAMIC TEXTURE MODEL (1)
EDGE FEATURE EXTRACTION (1)
EDUCATIONAL INSTITUTIONS (1)
ELECTRONIC MAIL (1)
EMOTION AND EMOTIONAL AUDIO EVENT DETECTION (1)
ERGODIC HIDDEN MARKOV MODEL (1)
EVENT DETECTION (1)
FILTERING THEORY (1)
GAUSSIAN DISTRIBUTION (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN PROCESSES (1)
GENERALIZED STATE-SPACE MODEL (1)
GMM (1)
HEURISTIC ALGORITHMS (1)
HIGH-LEVEL FEATURE DETECTION (1)
HIGH-LEVEL FEATURE EXTRACTION (1)
HUMAN ACTION DETECTION (1)
HUMAN ACTION SCENE ANALYSIS (1)
HUMAN ACTIVITY RECOGNITION (1)
HUMAN BEHAVIOUR (1)
HUMAN PERCEPTUAL PROCESSING (1)
IMAGE FRAME (1)
IMAGE PROCESSING (1)
IMAGE RECOGNITION (1)
IMAGE SEGMENTATION (1)
INFORMATION ACQUISITION (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options