Search results

Items from 1 to 20 out of 42 results

chapter

Development of TEO phase for speaker recognition

H A Patil, K K Parhi

2010 International Conference on Signal Processing and Communications (SPCOM) > 1 - 5

2010 International Conference on Signal Processing and Communications (SPCOM 2010)

Most of the speaker recognition systems use system features for speaker recognition which are mostly spectral in nature. Recently, there has been significant work on using source features, viz., prosodies and pitch dynamics, glottal flow derivative, Linear Prediction (LP) residual and its phase, wavelet-domain representation of LP residual, etc for speaker recognition. In this paper, a new source-like...

chapter

2D Wavelet Transform Based Compression of Pseudo-periodic Signals

D. Mathew, V.P. Devassia, T. Thomas

2009 International Conference on Advances in Recent Technologies in Communication and Computing > 724 - 726

2009 International Conference on Advances in Recent Technologies in Communication and Computing. ARTCom 2009

This paper attempts to utilize the pitch synchronous property of Pseudo-periodic signals to increase the efficiency of compression, to minimize losses and thus to enhance the quality of the reconstruction. Results show higher signal to noise ratio, higher compression ratio and lower percentage distortion with the new method of 2-D compression as compared to 1-D compression. A new method is used for...

chapter

Glottal closure instant detection using Lines of Maximum Amplitudes (LOMA) of thewavelet transform

N. Sturmel, C. d'Alessandro, F. Rigaud

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4517 - 4520

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

The Lines Of Maximum Amplitude (LOMA) of the wavelet transform are used for glottal closure instant detection. Following Kadambe & al. (1992), the wavelet transform modulus maxima can be used for singularity detection. The LOMA method extends this idea. All the lines chaining maxima of a wavelet transform across scales are built. Then a back-tracking procedure allows for selection of the optimal...

chapter

Speech signal enhancement using neural network and wavelet transform

K. Daqrouq, I.N. Abu-Isbeih, M. Alfauri

2009 6th International Multi-Conference on Systems, Signals and Devices > 1 - 6

2009 6th International Multi-Conference on Systems, Signals and Devices

Speech enhancement is concerned with the processing of corrupted or noisy speech signal in order to improve the quality or intelligibility of the signal. Our goal is to enhance speech signal corrupted by noise to obtain a clean signal with higher quality. However, the presence of noise in speech signals will contribute to a high degree of inaccuracy in a system that requires speech processing. This...

chapter

Complex Wavelet Modulation Subbands for Speech Compression

J.-M. Luneau, J. Lebrun, S.H. Jensen

2009 Data Compression Conference > 457

2009 Data Compression Conference. DCC 2009

Low-frequency modulation of sound carry essential information for speech and music. They must be preserved for compression. The complex modulation spectrum has already been used for audio compression and is commonly obtained by spectral analysis of the sole temporal envelopes of the subbands out of a time/frequency analysis (modified discrete cosine transform combined with a modified discrete sine...

chapter

Preliminary segmentation of speech signals for the tasks of their recognition

V. Pavlysh, Y. Romanyshyn, V. Tkachenko

2009 5th International Conference on Perspective Technologies and Methods in MEMS Design > 144

2009 Vth International Conference on Perspective Technologies and Methods in MEMS Design. MEMSTECH'2009

In this paper some questions of analysis of methods of preliminary segmentation of speech signals and their features for the tasks of recognition are considered.

chapter

Spectral Multi-Scale Analysis for Multi-Pitch Tracking

M.A. Ben Messaoud, A. Bouzid, N. Ellouze

2009 IEEE 13th Digital Signal Processing Workshop and 5th IEEE Signal Processing Education Workshop > 26 - 31

2009 IEEE 13th Digital Signal Processing Workshop and 5th IEEE Signal Processing Education Workshop

This paper proposes a robust and accurate multi-pitch estimation method for multiple voices. This method is based on the spectral analysis of the mixture sound multi-scale product. The multi-scale product (PM) consists of making the product of wavelet transform coefficients. The wavelet used is the quadratic spline function. Simulation results showed that the proposed method can robustly estimate...

chapter

A New Algorithm for Speech Enhancement Using Wavelet Packet Transform Based on Auditory Model

Wang Na, Zheng Dezhong, Xu Shuang, Zhang Shuqing

2008 International Conference on Computer Science and Software Engineering > 4 > 1000 - 1003

2008 International Conference on Computer Science and Software Engineering (CSSE 2008)

Human auditory has non-linear characteristics, while wavelet packet transform (WPT) has flexible analysis ability to time-frequency property so that it is more compatible to simulate the human auditory model. In this paper, human auditory model is analyzed, after which a new algorithm for speech enhancement using node-threshold wavelet packet transform based on bark-scaled decomposition is established,...

chapter

A novel fast noise robust Vietnamese speech recognition applied for robot control

Phung Trung Nghia, Thai Quang Vinh

2008 10th International Conference on Control, Automation, Robotics and Vision > 821 - 826

2008 10th International Conference on Control, Automation, Robotics and Vision (ICARCV 2008)

Most of researches on speech recognition in the world concentrate on improving the large vocabulary of the corpus. In real-time robot control by speech commands, speech recognition is usually no need very large vocabulary but the fast implementation and the noise robust is prerequisite. This study proposes a novel fast noise robust wavelet-based Vietnamese speech recognition applied for robot control...

chapter

Complex wavelet based modulation analysis

J.-M. Luneau, J. Lebrun, S.H. Jensen

2008 42nd Asilomar Conference on Signals, Systems and Computers > 1224 - 1228

2008 42nd Asilomar Conference on Signals, Systems and Computers

Low-frequency modulation of sound carry important information for speech and music. The modulation spectrum is commonly obtained by spectral analysis of the sole temporal envelopes of the sub-bands out of a time-frequency analysis. Processing in this domain usually creates undesirable distortions because only the magnitudes are taken into account and the phase data is often neglected. We remedy this...

chapter

Pitch detection method for noisy speech signals based on pre-filter and weighted wavelet coefficients

Ru-wei Li, Chang-chun Bao, Hui-jing Dou

2008 9th International Conference on Signal Processing > 530 - 533

2008 9th International Conference on Signal Processing (ICSP 2008)

Most of the current pitch detection algorithms can not work well under the high noise environment. For this reason, a pitch detection algorithm for noisy speech signal based on pre-filtering and weighted wavelet coefficients is proposed. Firstly, the noisy speech signals are pre-filtered. Secondly, the speech pre-filtered is decomposed by the quadratic spline wavelet. Thirdly, the wavelet coefficients...

chapter

Robust Endpoint Detection Algorithm of Chinese

Jing Dong, Xiaohui Zhao, Shifeng Ou

2008 4th International Conference on Wireless Communications, Networking and Mobile Computing > 1 - 5

2008 4th International Conference on Wireless Communications, Networking and Mobile Computing (WiCOM)

In order to solve the problem of endpoint detection in presence of multi noises, this paper presents a robust algorithm of Chinese. Without any priori information of noise statistics, this approach employs the autocorrelation of low frequency coefficients in wavelet transform to detect the endpoint of voiced signal, and combines with the power spectral density of noisy speech to determine the commence...

chapter

Phoneme recognition using neural networks

D. Vassallo, E. Gatt

2008 15th IEEE International Conference on Electronics, Circuits and Systems > 506 - 509

2008 15th IEEE International Conference on Electronics, Circuits and Systems (ICECS 2008)

A phoneme recognition system using an anti-symmetric multi-stage filter bank structure is presented. In a filter bank the input signal is convolved with digital filters having different cut-off frequencies so that the signal is analysed at different frequencies with different resolutions. The percentage energy content in each signal decomposition level is calculated and used as input to the artificial...

chapter

Robust voice conversion systems using MFDWC

M. Farhid, M.A. Tinati

2008 International Symposium on Telecommunications > 778 - 781

2008 International Symposium on Telecommunications

Voice conversion is a method used to transform one speakerpsilas voice into another speakerpsilas voice. New modification approach for voice conversion is proposed in this paper. We take Mel-frequency Discrete Wavelet coefficients (MFDWC) as the basic feature. This feature copes well with small training sets of high dimension, which is a problem often encountered in voice conversion. The proposed...

chapter

Speaker Identification Wavelet Transform based method

K. Daqrouq, W. Al-Sawalmeh, A.-R. Al-Qawasmi, I.N. Abu-Isbeih

2008 5th International Multi-Conference on Systems, Signals and Devices > 1 - 5

2008 5th International Multi-Conference on Systems, Signals and Devices

One of the most important signal processing method in digital signal processing discipline is speaker identification method (SIM). Because of the difficult nature of speech signals and their fast variation with time, the wavelet transform is used to reduce the complexity of such signals. In this paper two identification methods are presented based on Continuous Wavelet Transform CWT. The first method...

chapter

Pathological speech deformation degree assessment based on integrating feature and neural network

Wang Xu, Han Zhiyan, Wang Jian

2008 27th Chinese Control Conference > 441 - 444

2008 Chinese Control Conference (CCC)

In tasks related to the analysis and recognition of pathological speech it is often more important to provide the respective person (e.g. physician) with guidelines for a deformation degree assessment of speech signal than to achieve a very accurate automated recognition. By ear it is easy to judge whether the speech is regular or deformed, but any attempt of a deformation degree evaluation is not...

chapter

Speech recognition based on wavelet packet transform and K-L expansion

Xu Wang, Zhiyan Han, Jian Wang, Yujuan Ma

2008 Chinese Control and Decision Conference > 2490 - 2493

2008 Chinese Control and Decision Conference (CCDC)

Based on the dynamic characteristic of speech signal, we proposed a new method of number speech recognition using wavelet packet transform and K-L expansion. Firstly, speech signals underwent a series of preprocessing course including pre-filtering, quantification, pre-emphasizing and endpoint detector. Secondly, using wavelet packet transform extracted the relative energies in 32 sub-bands and the...

chapter

Speech feature extraction of cochlear implants on the basis of auditory perception wavelet transform

Zhi Tao, Heming Zhao, Jihua Gu, Xuedan Tan, more

2008 International Conference on Audio, Language and Image Processing > 80 - 86

2008 International Conference on Audio, Language and Image Processing

A method, which is on the basis of auditory perception wavelet transform, is proposed to model the speech process and extract features for cochlear implants. First, the original speech signal is decomposed by using an auditory perception wavelet transform. Second, a linear predictive coding method is used to extract the fundamental frequency and formant frequency in the perception channel. Experimental...

chapter

Enhanced human-computer speech interface using wavelet computing

S. Ayat

2008 IEEE Conference on Virtual Environments, Human-Computer Interfaces and Measurement Systems > 37 - 40

2008 IEEE Conference on Virtual Environments, Human-Computer Interfaces and Measurement Systems

In this paper, we design an enhanced human-computer speech interface by wavelet transform. By using a new thresholding algorithm and shrink function, we improve the efficiency of the speech interface. This shrink function tries to decrease sharp time-frequency spectrogram discontinuities by attenuating the wavelet coefficients instead of setting them to zero. This attenuation will be done regarding...

chapter

Logitboost weka classifier speech segmentation

B. Ziolko, S. Manandhar, R.C. Wilson, M. Ziolko

2008 IEEE International Conference on Multimedia and Expo > 1297 - 1300

2008 IEEE International Conference on Multimedia and Expo (ICME)

Segmenting the speech signals on the basis of time-frequency analysis is the most natural approach. Boundaries are located in places where energy of some frequency subband rapidly changes. Speech segmentation method which bases on discrete wavelet transform, the resulting power spectrum and its derivatives is presented. This information allows to locate the boundaries of phonemes. A statistical classification...

Keywords:
WAVELET TRANSFORMS
TRANSFORMS
SPEECH
Publication type:
book

Publication date

Set your own date range

Keywords

SPEECH PROCESSING (28)
NOISE (20)
SIGNAL PROCESSING (19)
ARTIFICIAL NEURAL NETWORKS (15)
ACOUSTICS (13)
FEATURE EXTRACTION (13)
SIGNAL PROCESSING ALGORITHMS (13)
EQUATIONS (11)
SPEECH RECOGNITION (11)
FREQUENCY DOMAIN ANALYSIS (10)
IMAGE PROCESSING (10)
SIGNAL TO NOISE RATIO (10)
WAVELET PACKETS (10)
ALGORITHM DESIGN AND ANALYSIS (9)
CONFERENCES (9)
DISCRETE WAVELET TRANSFORMS (9)
EDUCATIONAL INSTITUTIONS (9)
FILTERING (9)
FILTERING THEORY (9)
TIME FREQUENCY ANALYSIS (9)
TESTING (8)
WHITE NOISE (8)
ACCURACY (7)
COMPUTERS (7)
MATHEMATICAL MODEL (7)
ROBUSTNESS (7)
SIGNAL RESOLUTION (7)
SPEECH ENHANCEMENT (7)
TRAINING (7)
WAVELET DOMAIN (7)
WAVELET TRANSFORM (7)
ANALYTICAL MODELS (6)
DATA MINING (6)
ELECTRONIC MAIL (6)
ESTIMATION (6)
FREQUENCY MODULATION (6)
LOW PASS FILTERS (6)
MAXIMUM LIKELIHOOD DETECTION (6)
SIGNAL DETECTION (6)
SPEECH SIGNAL (6)
WAVELET ANALYSIS (6)
CLASSIFICATION ALGORITHMS (5)
COMPUTATIONAL EFFICIENCY (5)
COMPUTER VISION (5)
CORRELATION (5)
DETECTORS (5)
DISCRETE COSINE TRANSFORMS (5)
FILTER BANK (5)
GAIN (5)
IMAGE CODING (5)
MANGANESE (5)
NOISE MEASUREMENT (5)
REVIEWS (5)
SPEECH CODING (5)
TIME DOMAIN ANALYSIS (5)
ADDITIVE NOISE (4)
APPROXIMATION METHODS (4)
AUDITORY SYSTEM (4)
COMPLEXITY THEORY (4)
COMPUTATIONAL MODELING (4)
DATABASES (4)
DISCRETE WAVELET TRANSFORM (4)
ENCODING (4)
FOURIER TRANSFORMS (4)
FREQUENCY ESTIMATION (4)
GAUSSIAN NOISE (4)
IEEE TRANSACTIONS ON IMAGE PROCESSING (4)
IMAGE EDGE DETECTION (4)
IMAGE RECOGNITION (4)
IMAGE RECONSTRUCTION (4)
IMAGE RESOLUTION (4)
IMAGE SEGMENTATION (4)
MACHINE LEARNING (4)
MEL FREQUENCY CEPSTRAL COEFFICIENT (4)
MONITORING (4)
NONLINEAR FILTERS (4)
PERIODIC STRUCTURES (4)
PRESSES (4)
SUPPORT VECTOR MACHINE CLASSIFICATION (4)
WAVELET PACKET TRANSFORM (4)
ADAPTATION MODEL (3)
APPROXIMATION ALGORITHMS (3)
AUDIO SIGNAL PROCESSING (3)
AUTOMATION (3)
BACKGROUND NOISE (3)
BANDWIDTH (3)
BRIGHTNESS (3)
COMPUTER ARCHITECTURE (3)
COMPUTER LANGUAGES (3)
COUPLINGS (3)
DATA COMPRESSION (3)
DATA MODELS (3)
DETECTION ALGORITHMS (3)
DIGITAL FILTERS (3)
DIGITAL SIGNAL PROCESSING (3)
FAULT DIAGNOSIS (3)
FILTERING ALGORITHMS (3)
more

INFONA - science communication portal

Search results

Development of TEO phase for speaker recognition

2D Wavelet Transform Based Compression of Pseudo-periodic Signals

Glottal closure instant detection using Lines of Maximum Amplitudes (LOMA) of thewavelet transform

Speech signal enhancement using neural network and wavelet transform

Complex Wavelet Modulation Subbands for Speech Compression

Preliminary segmentation of speech signals for the tasks of their recognition

Spectral Multi-Scale Analysis for Multi-Pitch Tracking

A New Algorithm for Speech Enhancement Using Wavelet Packet Transform Based on Auditory Model

A novel fast noise robust Vietnamese speech recognition applied for robot control

Complex wavelet based modulation analysis

Pitch detection method for noisy speech signals based on pre-filter and weighted wavelet coefficients

Robust Endpoint Detection Algorithm of Chinese

Phoneme recognition using neural networks

Robust voice conversion systems using MFDWC

Speaker Identification Wavelet Transform based method

Pathological speech deformation degree assessment based on integrating feature and neural network

Speech recognition based on wavelet packet transform and K-L expansion

Speech feature extraction of cochlear implants on the basis of auditory perception wavelet transform

Enhanced human-computer speech interface using wavelet computing

Logitboost weka classifier speech segmentation

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options