2014 International Conference on Audio, Language and Image Processing

Most of short time-frequency feature (TFF) extraction methods in the literature only consider scale and frequency of the selected atoms, which neglects the effect of expansion coefficient and time of the selected atoms. In order to classify movie audio signals better, an effective and flexible time-frequency feature extraction method using expansion coefficient, scale, time and frequency of the selected...

chapter

Author index

2014 International Conference on Audio, Language and Image Processing > 1 - 5

2014 International Conference on Audio, Language and Image Processing (ICALIP)

chapter

Front matter volume 1

2014 International Conference on Audio, Language and Image Processing > 1 - 33

2014 International Conference on Audio, Language and Image Processing (ICALIP)

chapter

Front matter volume 2

2014 International Conference on Audio, Language and Image Processing > 1 - 33

2014 International Conference on Audio, Language and Image Processing (ICALIP)

chapter

Speeding up audio fingerprinting over GPUs

Chung-Che Wang, Jyh-Shing Roger Jang, Wenshan Liou

2014 International Conference on Audio, Language and Image Processing > 5 - 10

2014 International Conference on Audio, Language and Image Processing (ICALIP)

This paper presents the use of GPUs (graphic processing units) for implementing an efficient audio fingerprinting (AFP) system for audio music retrieval. Such a music retrieval system can compare a 10-second recording of exact but noisy audio clip to the database of more than 100K songs on a single PC with GPU cards. Due to the use of GPUs, we can achieve a speedup factor of 14 for audio comparison,...

chapter

Subjective evaluation on the timbre of horizontal ambisonics reproduction

Liu Yang, Xie Bosun

2014 International Conference on Audio, Language and Image Processing > 11 - 15

2014 International Conference on Audio, Language and Image Processing (ICALIP)

Ambisonics are a series of flexible sound reproduction systems that decompose and reconstruct sound field by each order approximation of horizontal Fourier or spatial spherical harmonics decomposition. For a given order reproduction, providing that the product of wave number and the radius of the rendering region is approximately less than the order, the system is able to recreate the target sound...

chapter

An intrinsic mode function basis dictionary for auditory signal processing

Chang Gao, Haifeng Li, Lin Ma

2014 International Conference on Audio, Language and Image Processing > 16 - 21

2014 International Conference on Audio, Language and Image Processing (ICALIP)

As one important field of sparse representation, the research of dictionary learning attracts most researchers interest in signal processing study. Empirical Mode Decomposition (EMD), as an efficient and adaptive signal decomposition method that depends completely on the signal, is considered as an innovative and appropriative the basis function theory. The Intrinsic Mode Functions (IMFs) obtained...

chapter

Suitability of speech quality evaluation measures in speech enhancement

Zhang Jie, Xiaoqun Zhao, Jingyun Xu, Zhang Yang

2014 International Conference on Audio, Language and Image Processing > 22 - 26

2014 International Conference on Audio, Language and Image Processing (ICALIP)

In this paper, we discuss the suitability of speech quality evaluation measures under various noise environments in the application of spectral subtraction speech enhancement. We take three kinds of typical noise and evaluate comprehensively the speech quality under the standard of global signal-to-noise ratio of noisy speech. We take six kinds of quality measures which include mean opinion score,...

chapter

DCT based algorithm on dimension reduction of residual frequency magnitude parameters

Jingyun Xu, Xiaoqun Zhao, Rongyun Li, Qiao Wang

2014 International Conference on Audio, Language and Image Processing > 27 - 30

2014 International Conference on Audio, Language and Image Processing (ICALIP)

Statistics and analysis of residual FM parameters correlation of male and female voices in English and Chinese, an algorithm for the dimensionality reduction of residual FM parameters based on two-dimensional DCT transform is presented. Remarkable reduction of the correlation of residual fm parameters is obtained. In multi-frame joint coding, DCT dimensionality reduction algorithm achieves coded bits...

chapter

Measure and model of vocal-tract length discrimination in cochlear implants

Etienne Gaudrain, Lucas Stam, Deniz Baskent

2014 International Conference on Audio, Language and Image Processing > 31 - 34

2014 International Conference on Audio, Language and Image Processing (ICALIP)

Voice discrimination is crucial to selectively listen to a particular talker in a crowded environment. In normalhearing listeners, it strongly relies on the perception of two dimensions: the fundamental frequency and the vocal-tract length. Yet, very little is known about the perception of the latter in cochlear implants. The present study reports discrimination thresholds for vocal-tract length in...

chapter

Relative distance estimation in multi-channel spatial audio signal

Zheng-yang Sun, Chang-chun Bao, Mao-shen Jia, Bing Bu

2014 International Conference on Audio, Language and Image Processing > 35 - 38

2014 International Conference on Audio, Language and Image Processing (ICALIP)

With the development of 3D audio, distance rendering in multi-channel spatial audio becomes a hot topic of great interest. In this paper, the directional-to-diffuse energy ratio (DDR), a novel relative distance cue, is presented based on Fast Independent Components Analysis (FastICA). DDR is used to trace the relative distance of recreated sound image, by extracting the energy ratio of the directional...

chapter

Segmentation and 3D visualization of pheochromocytoma in contrast-enhanced CT images

San Tang, Yi Guo, Yuanyuan Wang, Wanli Cao, more

2014 International Conference on Audio, Language and Image Processing > 39 - 43

2014 International Conference on Audio, Language and Image Processing (ICALIP)

As a kind of adrenal tumors, pheochromocytoma is commonly present with serious and potentially lethal cardiovascular complications. In this paper, a novel image segmentation and three-dimensional (3D) visualization framework is proposed to extract and visualize the pheochromocytoma and intratumoral necrosis in multiphase contrast-enhanced computed tomography (CECT) images in “Digital Imaging and Communications...

chapter

Atrial fibrillation detection using spectra of FSD recurrence complex network

Yajuan Zhang, Yuanyuan Wang, Cuiwei Yang, Xiaomei Wu, more

2014 International Conference on Audio, Language and Image Processing > 44 - 47

2014 International Conference on Audio, Language and Image Processing (ICALIP)

Complex network spectra features are proposed to be used by the classifier to classify atrial fibrillation (AF) and normal sinus rhythm (NSR). This novel complex network construction method utilizes the fuzzy symbolic dynamics (FSD) and recurrence complex network to analyze the synchronization of cardiac electrical activity. Firstly, the multi-lead epicardial signals recorded from dogs are transformed...

INFONA - science communication portal

2014 International Conference on Audio, Language and Image Processing

Copyright page

Back cover volume 2

Front cover volume 2

Back cover volume 1

Side cover volume 2

Side cover volume 1

Front cover volume 1

A novel time-frequency feature extraction for movie audio signals classification

Author index

Front matter volume 1

Front matter volume 2

Speeding up audio fingerprinting over GPUs

Subjective evaluation on the timbre of horizontal ambisonics reproduction

An intrinsic mode function basis dictionary for auditory signal processing

Suitability of speech quality evaluation measures in speech enhancement

DCT based algorithm on dimension reduction of residual frequency magnitude parameters

Measure and model of vocal-tract length discrimination in cochlear implants

Relative distance estimation in multi-channel spatial audio signal

Segmentation and 3D visualization of pheochromocytoma in contrast-enhanced CT images

Atrial fibrillation detection using spectra of FSD recurrence complex network

Filter options

Publication date

Keywords

INFONA - science communication portal

2014 International Conference on Audio, Language and Image Processing $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2014 International Conference on Audio, Language and Image Processing