Search results

Items from 1 to 20 out of 52 results

chapter

Glottal pathology discrimination using ANN and SVM

Ashwini Visave, Pramod Kachare, Amutha Jeyakumar, Alice Cheeran, more

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1377 - 1381

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Use of modern technological advances in real-time biomedical analysis is very crucial. Current work focuses on glottal pathology discrimination based on non-invasive speech analysis techniques. Primary set back in developing such method is irregular performance depreciation of several state of the art acoustic features. To excuse such problems, we have used glottal to noise excitation ratio, which...

chapter

Robust impaired speech segmentation using neural network mixture model

Sunday Iliya, Dylan Menzies, Ferrante Neri, Pip Cornelius, more

2014 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 444 - 449

2014 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

This paper presents a signal processing technique for segmenting short speech utterances into unvoiced and voiced sections and identifying points where the spectrum becomes steady. The segmentation process is part of a system for deriving musculoskeletal articulation data from disordered utterances, in order to provide training feedback for people with speech articulation problem. The approach implement...

chapter

Ensemble integration of calibrated speaker localization and statistical speech detection in domestic environments

Yuuki Tachioka, Tomohiro Narita, Shinji Watanabe, Jonathan Le Roux

2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA) > 162 - 166

2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA)

This paper describes speaker localization and speech detection techniques for domestic environments. In real environments, it is hard to localize speakers because reverberation causes discrepancy from the simple spherical wave assumption. We propose a template-based method that calibrates the localization errors included in conventional methods. In addition, we use statistical speech detection methods...

chapter

Speech/music indexing for audio life-logs from portable device record

Yali Zheng, Yoshifumi Chisaki, Tsuyoshi Usagawa

2013 International Conference on Advanced Computer Science and Information Systems (ICACSIS) > 173 - 178

2013 International Conference on Advanced Computer Science and Information Systems (ICACSIS)

Audio plays an important role among information sources in our life. As a result of current technology, it is available to record people's huge personal life activities as life-logs from long-term and multi-dimensional point of view on portable device. In order to make record be effective, this paper focuses on implementing the classification among speech, music and other kinds of sound around, which...

chapter

Speaker identification using pykfec and AANN

Shanthini Pandiaraj, D. Synthiya Vinothini, H. Nisha Rachel Keziah, Lineeta Gloria, more

2011 3rd International Conference on Electronics Computer Technology > 3 > 313 - 316

2011 3rd International Conference on Electronics Computer Technology (ICECT)

This paper presents the parameterization of speech based on amplitude and frequency modulation (AM-FM) model and its application to speaker identification. Speech parameterization is based on three different bandwidths. The speaker identification is done using auto associative neural network. The AANN is trained with SOLO speaking style speech signal, and a network is created for each speaker. The...

chapter

An endpoint detection algorithm based on MFCC and spectral entropy using BP NN

Haiying Zhang, Hailong Hu

2010 2nd International Conference on Signal Processing Systems > 2 > V2-509 - V2-513

2010 2nd International Conference on Signal Processing Systems (ICSPS 2010)

Endpoint detection is the preliminary job of speech signal processing, it is vital to speech recognition. Most of recent endpoint detection algorithms will give a satisfied result at high SNRs (signal-to-noise ratio), while they might fail in occasion where the noise level is too excessive. In this paper, a novel endpoint detection algorithm based on 12-order MFCC and spectral entropy in the framework...

chapter

A distributed model of memory for the McGurk effect

I Sporea, A Gruning

The 2010 International Joint Conference on Neural Networks (IJCNN) > 1 - 4

2010 International Joint Conference on Neural Networks (IJCNN 2010)

The present paper is investigating the modelling of the McGurk effect, an audio-visual speech perceptual illusion, with a distributed model of memory. The network is trained with congruent auditory and visual patterns and tested with incongruent sets of patterns considered to produce the McGurk effect.

chapter

An approach for recofnition of speech sifnal throuf h Neural Networks

Chiranjib nur

2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE) > 1 > 215 - 217

2nd International Conference on Computer and Automation Engineering (ICCAE 2010)

Speech sifnal is very unpredictable, inconsistent and produces various curves each time it is plotted. This makes speech sifnal a very challenfinf field in terms of its reception, modulation, demodulation, filterinf and processinf. So it very difficult to formulate a feneralized mathematical model for such a sifnal. However it can be shown that flexible systems like Neural Networks can be implemented...

chapter

Extended Minimum Classification Error Training in Voice Activity Detection

T. Arakawa, H. Al-Hassanieh, M. Tsujikawa, R. Isotani

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 232 - 236

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

Voice activity detection (VAD) is a fundamental part of speech processing. Combination of multiple acoustic features is an effective approach to make VAD more robust against various noise conditions. There have been proposed several feature combination methods, in which weights for feature values are optimized based on minimum classification error (MCE) training. We improve these MCE-based methods...

chapter

Noise reduction algorithm for robust speech recognition using MLP neural network

M.P. Ghaemmaghami, F. Razzazi, H. Sameti, S. Dabbaghchian, more

2009 Asia-Pacific Conference on Computational Intelligence and Industrial Applications (PACIIA) > 1 > 377 - 380

2009 Asia-Pacific Conference on Computational Intelligence and Industrial Applications (PACIIA 2009)

We propose an efficient and effective nonlinear feature domain noise suppression algorithm, motivated by the minimum mean square error (MMSE) optimization criterion. Multi layer perceptron (MLP) neural network in the log spectral domain minimizes the difference between noisy and clean speech. By using this method as a pre-processing stage of a speech recognition system, the recognition rate in noisy...

chapter

Application of Uni-Directional Microphone Array for Identifying English Pronunciation Errors

Bo Zhang, Xin Zhuang, Pan Huang, Chen Feng, more

2009 2nd International Congress on Image and Signal Processing > 1 - 5

2009 2nd International Congress on Image and Signal Processing (CISP)

To identify the English pronunciation errors made by Chinese learners, this paper utilizes uni-directional microphones to construct a superdirective beamformer for capturing high quality input speech, and integrates the techniques of anti-model and confidence measure into the speech recognizer for accurate identification of the speaker's pronunciation errors. As to the beamformer, although designing...

chapter

A Speaker Verification System Using SVM over a Spanish Corpus

Juan Gabriel Pedroza Bernal, Alfonso Prieto Guerrero, John Goddard Close

2009 Mexican International Conference on Computer Science > 381 - 386

Tenth Mexican International Conference on Computer Science (ENC 2009)

This paper presents a description of the principal aspects employed in the development of a speaker verification system based on a Spanish corpus. The main goal is to obtain classification results and behavior using Support Vector Machines (SVM) as the classifier technique. The most relevant aspects involved in developing a Spanish corpus are given. For the front end processing a novel method to suppress...

chapter

Improved wavelet pre-enhancement and hybrid model applied in speech recognition system

Wanliang Wang, Jianwei Zheng, Wang Lei

2009 7th Asian Control Conference > 1600 - 1604

2009 7th Asian Control Conference (ASCC 2009)

After study on the robust optimization of speech recognition system, we propose an improved wavelet thresholds de-noising method and combine it with the temporal filter to pre-enhance the noisy speech signals before recognition, which leads to good results. Then a hybrid model of hidden Markov and BP neural network is proposed, using BP to get the HMM (hidden Markov model) observation probability,...

chapter

Time Delay Estimation in Spatial Noisy and Reverberant Environments

Yi Zhang, Fuliang Yin

2008 Second International Symposium on Intelligent Information Technology Application > 3 > 459 - 463

2008 Second International Symposium on Intelligent Information Technology Application

This paper proposes an eigenvalue decomposition algorithm for robust time-delay estimation (TDE) based on triple microphone in situations where reverberation and spatial noise are present. This algorithm regards time delay estimation with spatial noise and reverberation as a blind channel identification issue of a double-input triple-output system, and use lag correlation matrix to reduce the spatial...

chapter

FDM array based dual channel speech enhancement method

Weiwei Cui, Zhigang Cao

2008 9th International Conference on Signal Processing > 382 - 385

2008 9th International Conference on Signal Processing (ICSP 2008)

With more and more miniature speech communication devices coming out, two-element microphone array draws a lot of attention due to its simplicity and the ability to suppress directional noise. This paper develops a dual channel speech enhancement method by combining the first- order differential microphone (FDM) array and the single-channel spectral enhancement techniques. The method can obtain an...

chapter

Microphone array speech enhancement based on a generalized post-filter and a novel perceptual filter

Ning Cheng, Wen-Ju Liu, Peng Li, Bo Xu

2008 9th International Conference on Signal Processing > 370 - 373

2008 9th International Conference on Signal Processing (ICSP 2008)

The theoretic foundation of traditional microphone array post-filters is the assumption that the noise between sensors is uncorrelated. However, this assumption is inaccurate in real environments since the correlated noise exists. In this paper, a generalized microphone array post-filter is proposed to deal with both the correlated and uncorrelated noise in environments and a novel perceptual filter...

chapter

Robust speech endpoint detection in aircraft cockpit voice background

Lei Ming, Li Guo, Li Xue-ren

2008 9th International Conference on Signal Processing > 676 - 679

2008 9th International Conference on Signal Processing (ICSP 2008)

This paper addresses the problem of robust speech endpoint detection in aircraft cockpit voice background. The proposed method described in this paper is based on a statistical model approach. Based on the voice background characteristics analysis, the complex Laplacian distribution model that directly aim at noisy speech is established; then the likelihood ratio test (LRT) based on binary hypothesis...

chapter

LP-based over-sampled subband Adaptive Noise Canceller for speech enhancement in diffuse noise fields

S. Khorram, H. Sameti, H. Veisi

2008 9th International Conference on Signal Processing > 157 - 161

2008 9th International Conference on Signal Processing (ICSP 2008)

Adaptive noise cancellers (ANCs) do not provide sufficient noise reduction in the diffuse noise fields. In this paper, a new hybrid structure is proposed as a solution to this problem. The proposed system is a combination of two subsystems, an ANC and a new multistage post-filter. The post-filter is based on linear prediction (LP) and attempts to extract speech component by using intermediate ANC...

chapter

A Neural Network based local SNR estimation for estimating spectral masks

A.H. Hadjahmadi, M.M. Homayounpour, S.M. Ahadi

2008 International Symposium on Telecommunications > 608 - 613

2008 International Symposium on Telecommunications

In this work, we present a new mask estimation technique that uses a neural network classifier to determine the reliability of spectrographic elements. In addition some different kinds of features used for classification were compared that make no assumptions about the corrupting noise signal, but rather exploit spectrographic characteristics of the speech signal. The performance of the proposed method...

chapter

Speech enhancement using a Kalman-based normalized LMS algorithm

A. Mahmoodzadeh, H.R. Abutalebi, H. Agahi

2008 International Symposium on Telecommunications > 555 - 558

2008 International Symposium on Telecommunications

This paper deals with the problem of Adaptive Noise Cancellation (ANC) for the speech signal corrupted with an additive white Gaussian noise. After explaining the least Mean Square (LMS)-based adaptive filter and Kalman filter, we examine the hybrid Kalman-based LMS (KLMS) technique for adaptation of the ANC. The proposed technique suggests a way to normalize LMS algorithm using Kalman filter. Our...

Keywords:
NOISE
SPEECH
ARTIFICIAL NEURAL NETWORKS

Publication date

Set your own date range

Publication type

book (51)
article (1)

Keywords

SIGNAL PROCESSING (24)
SPEECH PROCESSING (24)
SIGNAL PROCESSING ALGORITHMS (21)
ESTIMATION (19)
SIGNAL TO NOISE RATIO (19)
ALGORITHM DESIGN AND ANALYSIS (18)
EQUATIONS (18)
EDUCATIONAL INSTITUTIONS (15)
ACOUSTICS (14)
CORRELATION (14)
MATHEMATICAL MODEL (14)
SPEECH RECOGNITION (14)
TRANSFORMS (14)
ACCURACY (13)
FEATURE EXTRACTION (12)
FILTERING (12)
ROBUSTNESS (12)
ELECTRONIC MAIL (11)
SIMULATION (11)
TRAINING (11)
DATA MODELS (10)
SPEECH ENHANCEMENT (10)
COMPLEXITY THEORY (9)
COMPUTERS (9)
CONFERENCES (9)
MICROPHONES (9)
REAL TIME SYSTEMS (9)
COMPUTATIONAL MODELING (8)
FREQUENCY MODULATION (8)
GAUSSIAN NOISE (8)
WAVELET TRANSFORMS (8)
WHITE NOISE (8)
ADDITIVE NOISE (7)
DATA MINING (7)
FILTERING ALGORITHMS (7)
FREQUENCY DOMAIN ANALYSIS (7)
MAXIMUM LIKELIHOOD ESTIMATION (7)
MULTIMEDIA COMMUNICATION (7)
NOISE MEASUREMENT (7)
SUPPORT VECTOR MACHINE CLASSIFICATION (7)
CLASSIFICATION ALGORITHMS (6)
COMPUTATIONAL EFFICIENCY (6)
EIGENVALUES AND EIGENFUNCTIONS (6)
FREQUENCY ESTIMATION (6)
IMAGE CODING (6)
IMAGE PROCESSING (6)
IMAGE RESOLUTION (6)
MAXIMUM LIKELIHOOD DETECTION (6)
NEURAL NETS (6)
NOISE REDUCTION (6)
SONAR (6)
TIME FREQUENCY ANALYSIS (6)
ADAPTATION MODEL (5)
ADAPTIVE FILTERS (5)
ADDITIVES (5)
ARRAY SIGNAL PROCESSING (5)
ARRAYS (5)
BANDWIDTH (5)
DEMODULATION (5)
FILTERING THEORY (5)
IEEE TRANSACTIONS ON SIGNAL PROCESSING (5)
IMAGE EDGE DETECTION (5)
IMAGE RECONSTRUCTION (5)
MANGANESE (5)
MEL FREQUENCY CEPSTRAL COEFFICIENT (5)
OPTIMIZATION (5)
PARAMETER ESTIMATION (5)
POLYNOMIALS (5)
SIGNAL DETECTION (5)
SIGNAL RESOLUTION (5)
SPEAKER RECOGNITION (5)
TIME DOMAIN ANALYSIS (5)
VECTORS (5)
ANALYTICAL MODELS (4)
APPROXIMATION ALGORITHMS (4)
APPROXIMATION METHODS (4)
BACKGROUND NOISE (4)
CIRCUITS AND SYSTEMS (4)
CLUSTERING ALGORITHMS (4)
COMPUTER LANGUAGES (4)
CONVERGENCE (4)
DATABASES (4)
DELAY (4)
DISCRETE COSINE TRANSFORMS (4)
DISCRETE FOURIER TRANSFORMS (4)
GAIN (4)
GALLIUM NITRIDE (4)
IEEE TRANSACTIONS ON IMAGE PROCESSING (4)
IMAGE QUALITY (4)
IMAGE RECOGNITION (4)
INTERFERENCE (4)
INTERPOLATION (4)
ITERATIVE METHODS (4)
LABORATORIES (4)
MICROPHONE ARRAYS (4)
MODULATION (4)
MULTIMEDIA SYSTEMS (4)
more

INFONA - science communication portal

Search results

Glottal pathology discrimination using ANN and SVM

Robust impaired speech segmentation using neural network mixture model

Ensemble integration of calibrated speaker localization and statistical speech detection in domestic environments

Speech/music indexing for audio life-logs from portable device record

Speaker identification using pykfec and AANN

An endpoint detection algorithm based on MFCC and spectral entropy using BP NN

A distributed model of memory for the McGurk effect

An approach for recofnition of speech sifnal throuf h Neural Networks

Extended Minimum Classification Error Training in Voice Activity Detection

Noise reduction algorithm for robust speech recognition using MLP neural network

Application of Uni-Directional Microphone Array for Identifying English Pronunciation Errors

A Speaker Verification System Using SVM over a Spanish Corpus

Improved wavelet pre-enhancement and hybrid model applied in speech recognition system

Time Delay Estimation in Spatial Noisy and Reverberant Environments

FDM array based dual channel speech enhancement method

Microphone array speech enhancement based on a generalized post-filter and a novel perceptual filter

Robust speech endpoint detection in aircraft cockpit voice background

LP-based over-sampled subband Adaptive Noise Canceller for speech enhancement in diffuse noise fields

A Neural Network based local SNR estimation for estimating spectral masks

Speech enhancement using a Kalman-based normalized LMS algorithm

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options