The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Most of short time-frequency feature (TFF) extraction methods in the literature only consider scale and frequency of the selected atoms, which neglects the effect of expansion coefficient and time of the selected atoms. In order to classify movie audio signals better, an effective and flexible time-frequency feature extraction method using expansion coefficient, scale, time and frequency of the selected...
This paper presents the use of GPUs (graphic processing units) for implementing an efficient audio fingerprinting (AFP) system for audio music retrieval. Such a music retrieval system can compare a 10-second recording of exact but noisy audio clip to the database of more than 100K songs on a single PC with GPU cards. Due to the use of GPUs, we can achieve a speedup factor of 14 for audio comparison,...
Ambisonics are a series of flexible sound reproduction systems that decompose and reconstruct sound field by each order approximation of horizontal Fourier or spatial spherical harmonics decomposition. For a given order reproduction, providing that the product of wave number and the radius of the rendering region is approximately less than the order, the system is able to recreate the target sound...
As one important field of sparse representation, the research of dictionary learning attracts most researchers interest in signal processing study. Empirical Mode Decomposition (EMD), as an efficient and adaptive signal decomposition method that depends completely on the signal, is considered as an innovative and appropriative the basis function theory. The Intrinsic Mode Functions (IMFs) obtained...
In this paper, we discuss the suitability of speech quality evaluation measures under various noise environments in the application of spectral subtraction speech enhancement. We take three kinds of typical noise and evaluate comprehensively the speech quality under the standard of global signal-to-noise ratio of noisy speech. We take six kinds of quality measures which include mean opinion score,...
Statistics and analysis of residual FM parameters correlation of male and female voices in English and Chinese, an algorithm for the dimensionality reduction of residual FM parameters based on two-dimensional DCT transform is presented. Remarkable reduction of the correlation of residual fm parameters is obtained. In multi-frame joint coding, DCT dimensionality reduction algorithm achieves coded bits...
Voice discrimination is crucial to selectively listen to a particular talker in a crowded environment. In normalhearing listeners, it strongly relies on the perception of two dimensions: the fundamental frequency and the vocal-tract length. Yet, very little is known about the perception of the latter in cochlear implants. The present study reports discrimination thresholds for vocal-tract length in...
With the development of 3D audio, distance rendering in multi-channel spatial audio becomes a hot topic of great interest. In this paper, the directional-to-diffuse energy ratio (DDR), a novel relative distance cue, is presented based on Fast Independent Components Analysis (FastICA). DDR is used to trace the relative distance of recreated sound image, by extracting the energy ratio of the directional...
As a kind of adrenal tumors, pheochromocytoma is commonly present with serious and potentially lethal cardiovascular complications. In this paper, a novel image segmentation and three-dimensional (3D) visualization framework is proposed to extract and visualize the pheochromocytoma and intratumoral necrosis in multiphase contrast-enhanced computed tomography (CECT) images in “Digital Imaging and Communications...
Complex network spectra features are proposed to be used by the classifier to classify atrial fibrillation (AF) and normal sinus rhythm (NSR). This novel complex network construction method utilizes the fuzzy symbolic dynamics (FSD) and recurrence complex network to analyze the synchronization of cardiac electrical activity. Firstly, the multi-lead epicardial signals recorded from dogs are transformed...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.