Advanced search

Advanced search in people

From:

To:

Items from 1 to 9 out of 9 results

chapter

Realistic Human Action Recognition with Audio Context

Qiuxia Wu, Zhiyong Wang, Feiqi Deng, D D Feng

2010 International Conference on Digital Image Computing: Techniques and Applications > 288 - 293

2010 International Conference on Digital Image Computing: Techniques and Applications (DICTA 2010)

Recognizing human actions in realistic scenes has emerged as a challenging topic due to various aspects such as dynamic backgrounds. In this paper, we present a novel approach to taking audio context into account for better action recognition performance, since audio can provide strong evidence to certain actions such as phone-ringing to answer-phone. At first, classifiers are established for visual...

chapter

Learning Naive Bayes Classifiers for Music Classification and Retrieval

Zhouyu Fu, Guojun Lu, Kai Ming Ting, Dengsheng Zhang

2010 20th International Conference on Pattern Recognition > 4589 - 4592

2010 20th International Conference on Pattern Recognition (ICPR 2010)

In this paper, we explore the use of naive Bayes classifiers for music classification and retrieval. The motivation is to employ all audio features extracted from local windows for classification instead of just using a single song-level feature vector produced by compressing the local features. Two variants of naive Bayes classifiers are studied based on the extensions of standard nearest neighbor...

chapter

Homogeneous segmentation and classifier ensemble for audio tag annotation and retrieval

Hung-Yi Lo, Ju-Chiang Wang, Hsin-Min Wang

2010 IEEE International Conference on Multimedia and Expo > 304 - 309

2010 IEEE International Conference on Multimedia and Expo (ICME)

Audio tags describe different types of musical information such as genre, mood, and instrument. This paper aims to automatically annotate audio clips with tags and retrieve relevant clips from a music database by tags. Given an audio clip, we divide it into several homogeneous segments by using an audio novelty curve, and then extract audio features from each segment with respect to various musical...

chapter

A SVM-Based Audio Event Detection System

Li Lu, Fengpei Ge, Qingwei Zhao, Yonghong Yan

2010 International Conference on Electrical and Control Engineering > 292 - 295

2010 International Conference on Electrical and Control Engineering (ICECE 2010)

This paper proposes a SVM-based method to deal with the problem of detecting audio events(cheering and applause) by audio analysis. In our framework, a sliding window is first used to pre-segment the audio stream into short segments by moving from start to the end. Second, various kinds of audio features are extracted to represent different audio sounds in each segment. Third, SVM(super vector machine)...

chapter

Modified Local Discriminant Bases and Its Application in Audio Feature Extraction

Zheng Jiming, Wei Guohua, Yang Chunde

2009 International Forum on Information Technology and Applications > 3 > 49 - 52

2009 International Forum on Information Technology and Applications (IFITA)

One of the major challenges in classification problems based on signal decomposition approach is to identify the right basis function and its derivatives that can provide optimal features to distinguish the classes. Local discriminant bases (LDB) algorithm is one such algorithm, which efficiently selects a set of significant basis functions from the library of orthonormal bases based on certain defined...

chapter

Video scene classification and segmentation based on Support Vector Machine

Yingying Zhu, Yingying Zhu, Zhong Ming, Jun Zhang

2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence) > 3571 - 3576

2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence)

Video scene classification and segmentation are fundamental steps for multimedia retrieval, indexing and browsing. In this paper, a robust scene classification and segmentation approach based on support vector machine (SVM) is presented, which extracts both audio and visual features and analyzes their inter-relations to identify and classify video scenes. Our system works on content from a diverse...

chapter

SVM-Based Video Scene Classification and Segmentation

Yingying Zhu, Zhong Ming

2008 International Conference on Multimedia and Ubiquitous Engineering (mue 2008) > 407 - 412

2008 2nd International Conference on Multimedia and Ubiquitous Engineering (MUE '08)

article

A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification

Wei Chu, B. Champagne

IEEE Transactions on Audio, Speech, and Language Processing > 2008 > 16 > 1 > 137 - 150

In this paper, we investigate the noise robustness of Wang and Shamma's early auditory (EA) model for the calculation of an auditory spectrum in audio classification applications. First, a stochastic analysis is conducted wherein an approximate expression of the auditory spectrum is derived to justify the noise-suppression property of the EA model. Second, we present an efficient fast Fourier transform...

chapter

Unsupervised speech/music classification using one-class support vector machines

S.O. Sadjadi, S.M. Ahadi, O. Hazrati

2007 6th International Conference on Information, Communications&Signal Processing > 1 - 5

Sixth International Conference on Information, Communications and Signal Processing

Audio classification is an important issue in current audio processing and content analysis researches. Speech/music classification is one of the most interesting branches of audio signal classification. In this paper we present an unsupervised clustering method, based on one-class support vector machines (OCSVM) and inspired by the classical K-means algorithm, which effectively classifies speech/music...

Filter options

Keywords:
SUPPORT VECTOR MACHINES
AUDIO SIGNAL PROCESSING
AUDIO FEATURE EXTRACTION

Publication date

Set your own date range

Publication type

book (8)
article (1)

Keywords

SIGNAL CLASSIFICATION (4)
SUPPORT VECTOR MACHINE (3)
TRAINING (3)
ARTIFICIAL NEURAL NETWORKS (2)
CLASSIFICATION ALGORITHMS (2)
IMAGE CLASSIFICATION (2)
IMAGE SEGMENTATION (2)
INFORMATION RETRIEVAL (2)
MULTIMEDIA BROWSING (2)
MULTIMEDIA INDEXING (2)
MULTIMEDIA RETRIEVAL (2)
MUSIC (2)
SCENE CHANGE BOUNDARY DETECTION (2)
SPEECH (2)
SPEECH PROCESSING (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
SVM (2)
VIDEO RETRIEVAL (2)
VIDEO SCENE CLASSIFICATION (2)
VIDEO SCENE SEGMENTATION (2)
VIDEO SIGNAL PROCESSING (2)
VISUAL FEATURE EXTRACTION (2)
ACCURACY (1)
ADABOOST CLASSIFIER (1)
AUDIO ANALYSIS (1)
AUDIO CLASSIFICATION (1)
AUDIO CONTEXT (1)
AUDIO DETECTION TASK (1)
AUDIO FILES (1)
AUDIO MODALITY (1)
AUDIO NOVELTY CURVE (1)
AUDIO PROCESSING (1)
AUDIO RECOGNITION (1)
AUDIO SEGMENTATION (1)
AUDIO SIGNAL CLASSIFICATION (1)
AUDIO SOUNDS (1)
AUDIO STREAM (1)
AUDIO STREAMING (1)
AUDIO TAG ANNOTATION (1)
AUDIO TAG RETRIEVAL (1)
BACKGROUND NOISE (1)
BAG OF VISUAL-WORDS MODEL (1)
BAYES METHODS (1)
C4.5 (1)
CALIBRATED PROBABILITY SCORES (1)
CEPSTRAL ANALYSIS (1)
CLASSIFICATION TECHNIQUE (1)
CLASSIFICATION TREE ANALYSIS (1)
CLASSIFIER ENSEMBLE (1)
COMPUTATIONAL COMPLEXITY (1)
CONFERENCES (1)
CONTENT ANALYSIS (1)
CONTEXT (1)
DECISION FUSION SCHEME (1)
DECISION RULES (1)
DECISION THEORY (1)
DECISION TREE LEARNING ALGORITHM (1)
DECISION TREES (1)
EARLY AUDITORY (EA) MODEL (1)
EDGE DETECTION (1)
ENSEMBLE CLASSIFIER (1)
ENSEMBLE METHOD (1)
EQUATIONS (1)
EVENT DETECTION (1)
F-MEASURE (1)
FALSE ALARMS (1)
FAST FOURIER TRANSFORMS (1)
FEATURE SELECTION (1)
FRAME-BASED FEATURE VECTOR SEQUENCE FORMAT (1)
GAUSSIAN PROCESSES (1)
GMM-BASED AUDIO EVENT DETECTION SYSTEM (1)
HISTOGRAMS (1)
HOLLYWOOD HUMAN ACTIONS DATASET (1)
HOMOGENEOUS SEGMENTATION (1)
HUMAN ACTION RECOGNITION (1)
HUMAN ACTION REPRESENTATION (1)
HUMANS (1)
INDEXING (1)
INTERFERENCE SUPPRESSION (1)
ITERATIVE K-MEANS LIKE ALGORITHM (1)
ITERATIVE METHODS (1)
JOINTS (1)
LDB (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
MEASUREMENT (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (1)
MIREX 2009 AUDIO TAG CLASSIFICATION TASK (1)
MODIFIED LOCAL DISCRIMINANT BASES (1)
MOTION PICTURES (1)
MULTIMEDIA COMPUTING (1)
MUSIC CLASSIFICATION (1)
MUSIC DATABASE (1)
MUSIC RETRIEVAL (1)
NAIVE BAYES (1)
NAIVE BAYES CLASSIFIERS (1)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options