Search results

Items from 1 to 6 out of 6 results

chapter

Dynamic selection of a speech enhancement method for robust speech recognition in moving motorcycle environment

Iosif Mporas, Todor Ganchev, Otilia Kocsis, Nikos Fakotakis

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5176 - 5179

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We present a speech pre-processing scheme (SPPS) for robust speech recognition in the moving motorcycle environment. The SPPS is dynamically adapted during the run-time operation of the speech front-end, depending on short-time characteristics of the acoustic environment. In detail, the fast varying acoustic environment is modeled by GMM clusters based on which a selection function determines the...

chapter

Learning-based auditory encoding for robust speech recognition

Yu-Hsiang Bosco Chiu, Bhiksha Raj, Richard M Stern

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4278 - 4281

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

This paper describes ways of speeding up the optimization process for learning physiologically-motivated components of a feature computation module directly from data. During training, word lattices generated by the speech decoder and conjugate gradient descent were included to train the parameters of logistic functions in a fashion that maximizes the a posteriori probability of the correct class...

chapter

Improved voice activity detection using static harmonic features

T Fukuda, O Ichikawa, M Nishimura

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4482 - 4485

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Accurate voice activity detection (VAD) is important for robust automatic speech recognition (ASR) systems. We have proposed a statistical-model-based VAD using the long-term temporal information in speech, which shows good robustness against noise in an automobile environment. For further improvement, this paper describes a new method to exploit harmonic structure information with statistical models...

chapter

A pattern recognition system for environmental sound classification based on MFCCs and neural networks

F. Beritelli, R. Grasso

2008 2nd International Conference on Signal Processing and Communication Systems > 1 - 4

2008 2nd International Conference on Signal Processing and Communication Systems. ICSPCS'2008

The paper proposes a study of a background noise classifier based on a pattern recognition approach using a neural network. The signals submitted to the neural network are characterised by means of a set of 12 MFCC (Mel frequency cepstral coefficient) parameters typically present in the front end of a mobile terminal. The performance of the classifier, evaluated in terms of percent misclassification,...

chapter

Audio Noise Classification using Bark scale features and K-NN Technique

C. Eamdeelerd, K. Songwatana

2008 International Symposium on Communications and Information Technologies > 131 - 134

2008 International Symposium on Communications and Information Technologies (ISCIT)

This paper presents the audio noise classification using Bark scale features and K-NN technique. This paper uses audio noise signal from NOISEX-92 (12 types). We determine the transfer functions from linear predictive coding (LPC) coefficient of noise signal on Bark scale and use K-NN technique to classify them. The results will be used for optimization of speech recognition model in the presence...

article

SNR-Adaptive Stream Weighting for Audio-MES ASR

Ki-Seung Lee

IEEE Transactions on Biomedical Engineering > 2008 > 55 > 8 > 2001 - 2010

Myoelectric signals (MESs) from the speaker's mouth region have been successfully shown to improve the noise robustness of automatic speech recognizers (ASRs), thus promising to extend their usability in implementing noise-robust ASR. In the recognition system presented herein, extracted audio and facial MES features were integrated by a decision fusion method, where the likelihood score of the audio-MES...

Filter options

Keywords:
ACCURACY
NOISE
SPEECH RECOGNITION
ACOUSTIC NOISE

Publication date

Set your own date range

Publication type

book (5)
article (1)

Keywords

SPEECH (5)
FEATURE EXTRACTION (3)
HIDDEN MARKOV MODELS (3)
TRAINING (3)
ACOUSTIC SIGNAL PROCESSING (2)
ACOUSTICS (2)
ARTIFICIAL NEURAL NETWORKS (2)
AUTOMATIC SPEECH RECOGNITION (2)
MEL FREQUENCY CEPSTRAL COEFFICIENT (2)
NOISE ROBUSTNESS (2)
SIGNAL CLASSIFICATION (2)
SPEECH CODING (2)
SPEECH PROCESSING (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
ADAPTATION MODEL (1)
AIRCRAFT (1)
AIRCRAFT MANUFACTURE (1)
ASR SYSTEMS (1)
AUDIO CODING (1)
AUDIO MES FEATURE (1)
AUDIO NOISE CLASSIFICATION (1)
AUDIO-MES ASR (1)
AUDITORY MODELS (1)
AUTOMATIC SPEECH RECOGNITION SYSTEMS (1)
AUTOMATIC SPEECH RECOGNIZERS (1)
AUTOMOBILE ENVIRONMENT (1)
BABBLE NOISE (1)
BACKGROUND NOISE (1)
BACKGROUND NOISE CLASSIFIER (1)
BARK SCALE FEATURE (1)
CEPSTRAL ANALYSIS (1)
CLASSIFICATION ALGORITHMS (1)
COLOR (1)
CONJUGATE GRADIENT (1)
CONJUGATE GRADIENT DESCENT OPTIMIZATION (1)
CONJUGATE GRADIENT METHODS (1)
CONVERGENCE (1)
COVARIANCE MATRIX (1)
CURVE FITTING (1)
DATA ANALYSIS (1)
DATA MODELS (1)
DECISION FUSION (1)
DECISION FUSION METHOD (1)
DECISION WINDOW (1)
DECODING (1)
DISCRIMINATIVE TRAINING (1)
DISTANCE MEASUREMENT (1)
ELECTRODES (1)
ELECTROMYOGRAPHY (1)
ENVIRONMENTAL SOUND CLASSIFICATION (1)
ESTIMATION (1)
FACIAL MES FEATURE (1)
FEATURE COMPUTATION MODULE (1)
GAUSSIAN DISTRIBUTION (1)
HARMONIC ANALYSIS (1)
HARMONIC STRUCTURE (1)
HARMONIC STRUCTURE INFORMATION (1)
HEARING (1)
HELIUM (1)
INDEXES (1)
INDUSTRIES (1)
K-NN TECHNIQUE (1)
LATTICES (1)
LEARNING-BASED AUDITORY ENCODING (1)
LINEAR PREDICTIVE CODING (1)
LOGISTIC FUNCTIONS (1)
LONG TERM TEMPORAL SPEECH INFORMATION (1)
LONG-TERM TEMPORAL INFORMATION (1)
MACHINE LEARNING (1)
MAMMALIAN AUDITORY SYSTEMS (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
MAXIMUM MUTUALINFORMATION (MMI) CRITERION (1)
MEDICAL SIGNAL PROCESSING (1)
MFCC (1)
MOBILE TERMINAL (1)
MOTORCYCLES (1)
MUSCLES (1)
MUTUAL INFORMATION (1)
MYOELECTRIC SIGNALS (1)
MYOELECTRIC SIGNALS (MESS) (1)
NEURAL NETS (1)
NEURAL NETWORKS (1)
NOISE MEASUREMENT (1)
OBJECT RECOGNITION (1)
OPTIMALWEIGHTING (1)
OPTIMIZATION (1)
OPTIMIZATION METHODS (1)
PARALLEL ARCHITECTURES (1)
PATTERN CLASSIFICATION (1)
PATTERN RECOGNITION SYSTEM (1)
PHYSIOLOGICALLY-MOTIVATED COMPONENTS (1)
POSTERIORI PROBABILITY (1)
RATE-LEVEL NONLINEARITIES (1)
RELIABILITY (1)
RELIABILITY THEORY (1)
ROBUST SPEECH RECOGNITION (1)
more

INFONA - science communication portal

Search results

Dynamic selection of a speech enhancement method for robust speech recognition in moving motorcycle environment

Learning-based auditory encoding for robust speech recognition

Improved voice activity detection using static harmonic features

A pattern recognition system for environmental sound classification based on MFCCs and neural networks

Audio Noise Classification using Bark scale features and K-NN Technique

SNR-Adaptive Stream Weighting for Audio-MES ASR

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options