Search results for: Deshun Yang

Items from 1 to 13 out of 13 results

chapter

Robust sound event classification by using denoising autoencoder

Jianchao Zhou, Liqun Peng, Xiaoou Chen, Deshun Yang

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP)

Over the last decade, a lot of research has been done on sound event classification. But a main problem with sound event classification is that the performance sharply degrades in the presence of noise. As spectrogram-based image features and denoising auto encoder reportedly have superior performance in noisy conditions, this paper proposes a new robust feature called denoising auto encoder image...

chapter

Two-layer large-scale cover song identification system based on music structure segmentation

Kang Cai, Deshun Yang, Xiaoou Chen

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP)

This paper focuses on cover song identification over a large-scale dataset. Identifying all covers of a query song from music collection is a challenging task since covers vary in multiple aspects, such as tempo, key, and structure. For the large-scale dataset, cover song identification is more challenging and few works have been published. Previous works usually use a single representation for a...

chapter

Audio event recognition based on DBN features from multiple filter-bank representations

Feng Guo, Xiaoou Chen, Deshun Yang

2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP)

In the audio event classification or detection research field, the representation of the audio itself is important. Many researchers tried to apply Deep Belief Network (DBN) to learn new representations of the audio. The mel filter-bank feature, which is obtained based on mel scale, is commonly used as the low level representation of the audio in the pre-processing procedure of DBN. However, the mel...

chapter

Using Deep Belief Network to Capture Temporal Information for Audio Event Classification

Feng Guo, Deshun Yang, Xiaoou Chen

2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP) > 421 - 424

2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

Audio event classification plays an important role in surveillance systems. Due to the constrain of short-time Fourier transform (STFT), the extraction of the audio frequency domain features, as the essential work among the audio event classification, still have some difficulty when conducted on a big audio frame. The traditional concatenation method of feature vector for the successive audio windows...

chapter

Music identification based on music word model

Wanyi Yang, Deshun Yang, Xiaoou Chen, Haiqian He

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

For music identification, conventional bag of audio words model methods generally compute a histogram for a piece of music, which ignores the temporal characteristic of music and has a negative influence on the accuracy. In addition, they are usually based on DFT spectrogram, which cannot represent music as well as Constant Q (CQ) spectrogram. To address the above problems, we propose a two-layer...

chapter

Multi frame size feature extraction for acoustic event detection

Liqun Peng, Deshun Yang, Xiaoou Chen

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific > 1 - 4

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

This paper addresses the problem of detection and recognition of impulsive sounds in surveillance system, such as door slams, footsteps, glass breaks, gunshots and human screams. We build an acoustic event dataset of about 1k sound clips and a ground truth dataset of a surveillance system. We investigate the influence of different frame size in audio feature extraction when classify acoustic events...

chapter

Multi-person tracking-by-detection with local particle filtering and global occlusion handling

Yaowen Guan, Xiaoou Chen, Deshun Yang, Yuqian Wu

2014 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2014 IEEE International Conference on Multimedia and Expo (ICME)

This paper presents a detection-based method for tracking an uncertain number of persons in complex scenarios with frequent occlusions. Frame-by-frame data association based particle filters are adopted to track targets in occlusion-free regions. When occlusion is detected, the associated trackers are deactivated and they are re-activated when the tracked persons are re-identified after occlusion...

chapter

Efficient music identification by utilizing space-saving audio fingerprinting system

Guang Yang, Xiaoou Chen, Deshun Yang

2014 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2014 IEEE International Conference on Multimedia and Expo (ICME)

Audio fingerprints can be used to implement an efficient music identification system on a million-song library, but the system requires huge amount of memory to hold the fingerprints and indexes. Therefore, for a large-scale music library, memory imposes a restriction on the speed of music identification. In this paper, we propose an efficient music identification system which utilizes a kind of space-saving...

chapter

Enhance popular music emotion regression by importing structure information

Xing Wang, Yuqian Wu, Xiaoou Chen, Deshun Yang

2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference > 1 - 4

2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Emotion is a useful mean to organize music library, and automatic music emotion recognition is drawing more and more attention. Music structure information is imported to improve the result for music emotion regression. Music dataset with emotion and structure annotations is built, and features concerning lyrics, audio and midi are extracted. For each emotion dimension, regressors are built using...

chapter

A two-layer model for music pleasure regression

Xing Wang, Yuqian Wu, Xiaoou Chen, Deshun Yang

2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) > 1 - 6

2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)

We adopt a two-layer regression model for music pleasure regression. Pleasure orientation of a song is estimated first, and then different regressors are used to predict degree of pleasure according to the estimated orientation. By using corresponding regressors for each instance, there is a big improvement when we assume the first layer is perfect in comparison with one-layer model. By tuning the...

chapter

Multi-Modal Music Mood Classification Using Co-Training

Yongkai Zhao, Deshun Yang, Xiaoou Chen

2010 International Conference on Computational Intelligence and Software Engineering > 1 - 4

2010 International Conference on Computational Intelligence and Software Engineering (CiSE 2010)

In this paper, we present a new approach to content-based music mood classification. Music, especially song, is born with multi-modality natures. But current studies are mainly focus on its audio modality, and the classification capability is not good enough. In this paper we use three modalities which are audio, lyric and MIDI. After extracting features from these three modalities respectively, we...

chapter

Enriching music mood annotation by semantic association reasoning

Jun Wang, Xavier Anguera, Xiaoou Chen, Deshun Yang

2010 IEEE International Conference on Multimedia and Expo > 1445 - 1450

2010 IEEE International Conference on Multimedia and Expo (ICME)

Mood annotation of music is challenging as it concerns not only audio content but also extra-musical information. It is a representative research topic about how to traverse the well-known semantic gap. In this paper, we propose a new music-mood-specific ontology. Novel ontology-based semantic reasoning methods are applied to effectively bridge content-based information with web-based resources. Also,...

chapter

New approach to classification of Chinese folk music based on extension of HMM

XiaoBing Liu, DeShun Yang, XiaoOu Chen

2008 International Conference on Audio, Language and Image Processing > 1172 - 1179

2008 International Conference on Audio, Language and Image Processing

Recently, class labels are commonly used to structure the increasing amounts of music available in digital form on the Web and are important for music information retrieval. An evaluation for automatic classification of Chinese folk music according to an audio taxonomy is presented. The audio taxonomy is organized as hierarchical, resulting in good coverage of Chinese folk music. Continuous Hidden...

Filter options

Data set:
ieee

Publication date

Set your own date range

Keywords

FEATURE EXTRACTION (7)
ACCURACY (5)
MUSIC (4)
TRAINING (3)
CLASSIFICATION ALGORITHMS (2)
EVENT DETECTION (2)
HISTOGRAMS (2)
INDEXES (2)
MACHINE LEARNING (2)
MEL FREQUENCY CEPSTRAL COEFFICIENT (2)
MOOD (2)
MULTIMEDIA COMMUNICATION (2)
MULTIPLE SIGNAL CLASSIFICATION (2)
MUSIC IDENTIFICATION (2)
ROBUSTNESS (2)
SPECTROGRAM (2)
SPEECH RECOGNITION (2)
SURVEILLANCE (2)
ACOUSTICS (1)
ANNOTATION (1)
AUDIO CONTENT (1)
AUDIO EVENT CLASSIFICATION (1)
AUDIO FINGERPRINTING SYSTEM (1)
AUDIO MODALITY (1)
AUDIO TAXONOMY (1)
AUDIO TEMPORAL INFORMATION (1)
BIT ERROR RATE (1)
BIT RATE (1)
CHINESE FOLK MUSIC CLASSIFICATION (1)
COGNITION (1)
COMPUTATIONAL EFFICIENCY (1)
COMPUTER SCIENCE (1)
CONSTANT Q TRANSFORM (1)
CONTENT-BASED INFORMATION (1)
CONTENT-BASED MUSIC MOOD CLASSIFICATION (1)
CONTENT-BASED RETRIEVAL (1)
CONTINUOUS HIDDEN MARKOV MODEL (1)
COTRAINING ALGORITHM (1)
DATABASES (1)
DEEP BELIEF NETWORK (1)
DETECTORS (1)
DICTIONARIES (1)
DYNAMIC PROGRAMMING (1)
EXTRA-MUSICAL INFORMATION (1)
FOURIER TRANSFORMS (1)
FREQUENCY MODULATION (1)
FREQUENCY-DOMAIN ANALYSIS (1)
FUSES (1)
HIDDEN MARKOV MODELS (1)
HIDDEN SEMIMARKOV MODEL (1)
IMAGE COLOR ANALYSIS (1)
IMAGE RECONSTRUCTION (1)
INFORMATION RETRIEVAL (1)
KERNEL (1)
LARGE-SCALE DATABASE (1)
LYRIC MODALITY (1)
MARKOV PROCESSES (1)
MEMORY MANAGEMENT (1)
META DATA (1)
METADATA (1)
MIDI (1)
MULTI-OBJECT TRACKING (1)
MULTI-SCALE REPRESENTATIONS (1)
MULTIMODAL MUSIC MOOD CLASSIFICATION (1)
MUSIC EMOTION (1)
MUSIC MOOD ANNOTATION (1)
MUSIC WORD MODEL (1)
MUSIC-MOOD-SPECIFIC ONTOLOGY (1)
NEURONS (1)
NOISE MEASUREMENT (1)
NOISE REDUCTION (1)
OCCLUSION HANDLING (1)
ONTOLOGIES (1)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (1)
ONTOLOGY (1)
ONTOLOGY-BASED SEMANTIC REASONING METHODS (1)
PARTICLE FILTERING (1)
PLEASURE (1)
PREDICTION ALGORITHMS (1)
PROBABILITY DENSITY FUNCTION (1)
PSYCHOACOUSTIC MODELS (1)
REGRESSION (1)
SEGMENTATION DURATION-BASED HMM (1)
SEMANTIC ASSOCIATION REASONING (1)
SEMANTIC REASONING (1)
SEMANTICS (1)
SOCIAL MUSIC (1)
SPACE-SAVING STORAGE (1)
STREAMING MEDIA (1)
SUPPORT VECTOR MACHINES (1)
TARGET TRACKING (1)
TRACKING-BY-DETECTION (1)
TRAJECTORY (1)
TWO DIMENSIONAL DISPLAYS (1)
WEB-BASED RESOURCES (1)
more

INFONA - science communication portal

Search results for: Deshun Yang

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options