Search results

Items from 1 to 14 out of 14 results

chapter

Feature mapping, score-, and feature-level fusion for improved normal and whispered speech speaker verification

Milton Sarria-Paja, Mohammed Senoussaoui, Douglas O'Shaughnessy, Tiago H. Falk

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5480 - 5484

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, automatic speaker verification using normal and whispered speech is explored. Typically, for speaker verification systems with varying vocal effort inputs, standard solutions such as feature mapping or addition of data during parameter estimation (training) and enrollment stages result in a trade-off between accuracy gains with whispered test data and accuracy losses (up to 70% in equal...

chapter

Comparison of different representations based on nonlinear features for music genre classification

Athanasia Zlatintsi, Petros Maragos

2014 22nd European Signal Processing Conference (EUSIPCO) > 1547 - 1551

2014 22nd European Signal Processing Conference (EUSIPCO)

In this paper, we examine the descriptiveness and recognition properties of different feature representations for the analysis of musical signals, aiming in the exploration of their microand macro-structures, for the task of music genre classification. We explore nonlinear methods, such as the AM-FM model and ideas from fractal theory, so as to model the timevarying harmonic structure of musical signals...

chapter

Adaptive order of fractional Fourier transform for whispered speaker identification

Qian Xiaohong, Zhao Heming

International Conference on Automatic Control and Artificial Intelligence (ACAI 2012) > 363 - 366

International Conference on Automatic Control and Artificial Intelligence (ACAI 2012)

A method widely used in speech signal analysis is based on short-time Fourier transform (STFT), but STFT only provides “average” characteristics of a signal, which can't depict the refined structure of speech. Therefore, a new speech analysis tool called fractional Fourier transform (FRFT) is introduced into this article. The transform orders for FRFT are adaptively set according to piecewise linear...

article

An Improved Scheme for Full Fingerprint Reconstruction

Sheng Li, Alex C. Kot

IEEE Transactions on Information Forensics and Security > 2012 > 7 > 6 > 1906 - 1912

Different fingerprint recognition systems store minutiae-based fingerprint templates differently. Some store them inside a small token; some can be found in a server database. As the minutiae template is very compact, many take it for granted that the template does not contain sufficient information for reconstructing the original fingerprint. This paper proposes a scheme to reconstruct a full fingerprint...

chapter

Spectral-envelope and group-delay models for transient signals—Applications to castanets and stop consonants

Ravi R. Shenoy, Chandra Sekhar Seelamantula

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 521 - 524

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We present a novel approach to represent transients using spectral-domain amplitude-modulated/frequency-modulated (AM-FM) functions. The model is applied to the real and imaginary parts of the Fourier transform (FT) of the transient. The suitability of the model lies in the observation that since transients are well-localized in time, the real and imaginary parts of the Fourier spectrum have a modulation...

chapter

Speaker identification using pykfec and AANN

Shanthini Pandiaraj, D. Synthiya Vinothini, H. Nisha Rachel Keziah, Lineeta Gloria, more

2011 3rd International Conference on Electronics Computer Technology > 3 > 313 - 316

2011 3rd International Conference on Electronics Computer Technology (ICECT)

This paper presents the parameterization of speech based on amplitude and frequency modulation (AM-FM) model and its application to speaker identification. Speech parameterization is based on three different bandwidths. The speaker identification is done using auto associative neural network. The AANN is trained with SOLO speaking style speech signal, and a network is created for each speaker. The...

chapter

Detection of voice onset time using FB expansion and AM-FM model

Ram Bilas Pachori, Suryakanth V Gangashetty

10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010) > 149 - 152

2010 10th International Conference on Information Sciences, Signal Processing and their Applications (ISSPA 2010)

The voice onset time (VOT) combines the temporal and frequency structure over very short duration. This makes the VOT detection task difficult. But the VOT is an important temporal feature. In this paper we propose a new method for the detection of VOT in speech utterances. The method uses Fourier-Bessel (FB) expansion followed by amplitude and frequency modulated (AM-FM) signal model. The FB expansion...

chapter

Coherent texture decomposition using AM-FM model

Chuong T Nguyen, Joseph P Havlicek

2010 IEEE Southwest Symposium on Image Analysis&Interpretation (SSIAI) > 81 - 84

2010 IEEE Southwest Symposium on Image Analysis & Interpretation (SSIAI)

We introduce a novel decomposition algorithm capable of extracting locally coherent and visually meaningful texture components from images. The algorithm estimates texture dominant orientation for each coherent component and iteratively extracts it from the image based on a new quantitative coherency measure formulated in the modulation domain. The original image is perfectly reconstructed from extracted...

chapter

Robust Q Features for Speaker Identification

M.S. Deshpande, R.S. Holambe

2009 International Conference on Advances in Recent Technologies in Communication and Computing > 209 - 213

2009 International Conference on Advances in Recent Technologies in Communication and Computing. ARTCom 2009

In this paper, a nonlinear AM-FM speech model is used to extract robust features for speaker identification. The proposed features measure the amount of amplitude and frequency modulation that the commonly used linear source-filter model and the Mel frequency cepstral coefficients (MFCC) feature fails to capture. From the short time estimates of the frequency and bandwidth, a novel set of features...

chapter

Statistical analysis of amplitude modulation in speech signals using an AM-FM model

P. Tsiakoulis, A. Potamianos

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 3981 - 3984

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Several studies have been dedicated to the analysis and modeling of AM-FM modulations in speech and different algorithms have been proposed for the exploitation of modulations in speech applications. This paper details a statistical analysis of amplitude modulations using a multiband AM-FM analysis framework. The aim of this study is to analyze the phonetic- and speaker-dependency of modulations in...

chapter

Feature extraction using an AM-FM model for gait pattern classification

Ning Wang, E. Ambikairajah, B.G. Celler, N.H. Lovell

2008 IEEE Biomedical Circuits and Systems Conference > 25 - 28

BioCAS 2008. IEEE Biomedical Circuits and Systems Conference - Intelligent Biomedical Systems

This paper describes classification of gait patterns from a waist-mounted triaxial accelerometer. A feature extraction technique using empirical mode decomposition (EMD) and an amplitude/frequency modulation (AM-FM) model is proposed for the classification of walking activities from accelerometry data. A set of novel features, including AM, instantaneous frequency (IF) and instantaneous amplitude...

chapter

Content based image retrieval: The foundation for future case-based and evidence-based ophthalmology

S.T. Acton, P. Soliz, S. Russell, M.S. Pattichis

2008 IEEE International Conference on Multimedia and Expo > 541 - 544

2008 IEEE International Conference on Multimedia and Expo (ICME)

For medical and epidemiologic investigators and caregivers, one powerful functionality yet to be developed is the ability to group retinal images based upon common pathologic appearance. Such a tool would enable advances in evidence-based medicine and would accelerate automated or computer-assisted screening and diagnosis. In this report, we show that current, traditional content based image retrieval...

chapter

An AM-FM model for Motion Estimation in Atherosclerotic Plaque Videos

V. Murray, S.E. Murillo, M.S. Pattichis, C.P. Loizou, more

2007 Conference Record of the Forty-First Asilomar Conference on Signals, Systems and Computers > 746 - 750

2007 41st Asilomar Conference on Signals, Systems and Computers (ACSSC '07)

We present new multidimensional amplitude-modulation frequency-modulation (AM-FM) methods for motion estimation. For a single AM-FM component we show that the optical flow constraint leads to separate equations for amplitude modulation (AM) and frequency modulation (FM). We compare our approach with phase-based estimation developed by Fleet and Jepson and also the original optical flow method by Horn...

chapter

Speech Analysis using Fourier-Bessel Expansion and Discrete Energy Separation Algorithm

Ram Pachori, Pradip Sircar

2006 IEEE 12th Digital Signal Processing Workshop&4th IEEE Signal Processing Education Workshop > 423 - 428

2006 IEEE 12th Digital Signal Processing Workshop & 4th IEEE Signal Processing Education Workshop

In this paper, a new technique based on the Fourier-Bessel (FB) expansion is presented for separating multiple formants of a speech signal. The discrete energy separation algorithm (DESA) is applied to an isolated speech formant to extract the instantaneous frequency (IF) and the time-varying amplitude envelope (AE) of the formant. It is demonstrated that the proposed technique which is called the...

Filter options

Publication date

Set your own date range

Publication type

book (13)
article (1)

Keywords

AM-FM MODEL (14)
FREQUENCY MODULATION (9)
AMPLITUDE MODULATION (6)
SPEECH (5)
FEATURE EXTRACTION (4)
BANDWIDTH (2)
BESSEL FUNCTIONS (2)
DISCRETE ENERGY SEPARATION ALGORITHM (2)
FOURIER-BESSEL EXPANSION (2)
IMAGE RECONSTRUCTION (2)
INSTANTANEOUS FREQUENCY (2)
MEDICAL IMAGE PROCESSING (2)
NOISE (2)
SPEAKER IDENTIFICATION (2)
SPEAKER RECOGNITION (2)
SPEECH ANALYSIS (2)
SPEECH PROCESSING (2)
SPEECH RECOGNITION (2)
SPEECH SIGNAL (2)
AANN (1)
ACCELEROMETERS (1)
ACCELEROMETRY (1)
ACCELEROMETRY DATA (1)
ADAPTIVE (1)
AGE-RELATED MACULAR DEGENERATION (1)
AM-FM (1)
AM-FM IMAGE MODEL (1)
AMPLITUDE ENVELOPE (1)
AMPLITUDE ENVELOPE ESTIMATE (1)
AMPLITUDE ESTIMATION (1)
AMPLITUDE MODULATED SIGNAL (1)
AMPLITUDE/FREQUENCY MODULATION (1)
ANALYTICAL MODELS (1)
AREA MORPHOLOGY (1)
ARTIFICIAL NEURAL NETWORKS (1)
ATHEROSCLEROTIC PLAQUE VIDEO (1)
AUTOMATED SCREENING (1)
BAG-OF-WORDS (1)
BAND PASS FILTERS (1)
BIOMEDICAL IMAGING (1)
CASE-BASED OPHTHALMOLOGY (1)
CEPSTRAL ANALYSIS (1)
CLASSIFICATION ALGORITHMS (1)
COHERENT COMPONENT (1)
COHERENT DEMODULATION (1)
COMPUTATIONAL MODELING (1)
COMPUTER-ASSISTED DIAGNOSIS (1)
COMPUTER-ASSISTED SCREENING (1)
CONTENT BASED IMAGE RETRIEVAL (1)
CONTENT-BASED RETRIEVAL (1)
DECOMPOSITION ALGORITHM (1)
DELAY (1)
DEMODULATION (1)
DICHOTOMOUS IMAGE SORTING (1)
DICTIONARIES (1)
EMPIRICAL MODE DECOMPOSITION (1)
ENERGY SEPARATION ALGORITHM (1)
ERROR ANALYSIS (1)
EVIDENCE-BASED MEDICINE (1)
EVIDENCE-BASED OPHTHALMOLOGY (1)
FB-DESA (1)
FEATURE MAPPING (1)
FINGERPRINT (1)
FINGERPRINT RECOGNITION (1)
FOURIER ANALYSIS (1)
FOURIER SERIES (1)
FOURIER TRANSFORMS (1)
FRACTALS (1)
FRACTIONAL FOURIER TRANSFORM (1)
FREQUENCY ESTIMATION (1)
FREQUENCY MODULATED SIGNAL (1)
FREQUENCY STRUCTURE (1)
GABOR FILTERS (1)
GAIT ANALYSIS (1)
GAIT FEATURE EXTRACTION TECHNIQUE (1)
GAIT PATTERN CLASSIFICATION (1)
GAIT PATTERN CLASSIFICATION ERROR RATE (1)
GAUSSIAN DISTRIBUTION (1)
GAUSSIAN MIXTURE MODEL CLASSIFIER (1)
GMM (1)
GRANULOMETRY (1)
GRAY-SCALE (1)
GROUP DELAY ESTIMATION (1)
HUMANS (1)
I-VECTORS (1)
IMAGE ANALYSIS (1)
IMAGE ANALYSIS TOOLS (1)
IMAGE COLOR ANALYSIS (1)
IMAGE DECOMPOSITION (1)
IMAGE RETRIEVAL (1)
IMAGE SEGMENTATION (1)
IMAGE SEQUENCES (1)
IMAGE TEXTURE (1)
INCLINED WALKING CONDITION (1)
INDEXES (1)
INSTANTANEOUS FREQUENCY EXTRACTION (1)
LEGGED LOCOMOTION (1)
LINEAR SOURCE-FILTER MODEL (1)
MANUALS (1)
MEDICAL DIAGNOSTIC IMAGING (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options