Search results for: Babak Nasersharif

Items from 1 to 7 out of 7 results

chapter

Mapping Mel sub-band energies using Deep belief network for robust speech recognition

Mojtaba Gholamipour, Babak Nasersharif

2016 8th International Symposium on Telecommunications (IST) > 510 - 514

2016 8th International Symposium on Telecommunications (IST)

Sub-band speech processing is well-known in robust speech recognition. On the other hand, in recent years, deep neural networks (DNNs) have been widely used in speech recognition for acoustic modeling and also feature extraction and transformation. In this paper, we propose to use deep belief network (DBN) as a post-processing method for de-noising in Mel sub-band level where we enhance logarithm...

chapter

Improved HMM entropy for robust sub-band speech recognition

Babak Nasersharif, Ahmad Akbari

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

In recent years, sub-band speech recognition has been found useful in robust speech recognition, especially for speech signals contaminated by band-limited noise. In sub-band speech recognition, full band speech is divided into several frequency sub-bands and then sub-band feature vectors or their generated likelihoods by corresponding sub-band recognizers are combined to give the result of recognition...

chapter

Speech/music separation using non-negative matrix factorization with combination of cost functions

Babak Nasersharif, Sara Abdali

2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP) > 107 - 111

2015 International Symposium on Artificial Intelligence and Signal Processing (AISP)

A solution for separating speech from music signal as a single channel source separation is Non-negative Matrix Factorization (NMF). In this approach spectrogram of each source signal is factorized as multiplication of two matrices which are known as basis and weight matrices. To achieve proper estimation of signal spectrogram, weight and basis matrices are updated iteratively. To estimate distance...

chapter

Factored language model adaptation using Dirichlet class language model for speech recognition

Ali Hatami, Ahmad Akbari, Babak Nasersharif

The 5th Conference on Information and Knowledge Technology > 438 - 442

2013 5th Conference on Information and Knowledge Technology (IKT)

Language model (LM) is essential for speech recognition systems. Efficiency of this model depends on its adaptation to the linguistic characteristics. According to this, adaptation methods attempt to use syntactic and semantic features for language modelling. The previous adaptation methods such as family of Dirichlet class language model (DCLM) exploit class of history words. These methods due to...

chapter

Two-microphone speech enhancement using a learned binary mask

Roohollah Abdipour, Ahmad Akbari, Mohsen Rahmani, Babak Nasersharif

20th Iranian Conference on Electrical Engineering (ICEE2012) > 1359 - 1362

2012 20th Iranian Conference on Electrical Engineering (ICEE)

Ideal binary mask speech enhancement is shown to increase the speech quality as well as speech intelligibility. But, this property depends highly on the accurate separation of speech and masker time-frequency units of the input spectrum, which is a difficult task in real situations. Ordinary binary mask methods are single-microphone methods and so, can obtain little information from the environment...

chapter

An evolutionary based discriminative system for keyword spotting

Shima Tabibian, Ahmad Akbari, Babak Nasersharif

2011 International Symposium on Artificial Intelligence and Signal Processing (AISP) > 83 - 88

2011 International Symposium on Artificial Intelligence and Signal Processing (AISP)

Keyword spotting refers to detection of all occurrences of any given word in a speech utterance. In this paper, we define the keyword spotting problem as a binary classification problem and propose a discriminative approach for solving it. Our approach exploits evolutionary algorithm to determine the separating hyper plane between two classes: class of sentences containing the target keywords and...

chapter

Robust speech recognition using spectral subtraction and temporal structure normalization

Naghmeh Moradi, Babak Nasersharif, Ahmad Akbari

2011 19th Iranian Conference on Electrical Engineering > 1 - 4

2011 19th Iranian Conference on Electrical Engineering (ICEE)

Filtering approaches in spectral domain and features domain have been shown their effectiveness for robust speech recognition. In this paper, we propose a two step filtering method. In the first step, spectral subtraction filter is applied to speech spectrum. In the second step, we design a temporal structure normalization filter in order to apply to features extracted from the filtered spectrum....

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (5)
HIDDEN MARKOV MODELS (3)
FEATURE EXTRACTION (2)
NOISE MEASUREMENT (2)
NOISE REDUCTION (2)
ROBUSTNESS (2)
SIGNAL TO NOISE RATIO (2)
TRAINING (2)
ACCURACY (1)
ACOUSTICS (1)
ADAPTATION MODELS (1)
BINARY MASK (1)
BIOLOGICAL CELLS (1)
COMPUTATIONAL MODELING (1)
DBN (1)
DISCRIMINATIVE MODELS (1)
ENTROPY (1)
EVOLUTIONARY ALGORITHM (1)
EVOLUTIONARY COMPUTATION (1)
FACTORED LANGUAGE MODEL (1)
FINITE IMPULSE RESPONSE FILTER (1)
HISTORY (1)
ITAKURA-SAITO DIVERGENCE (1)
KEYWORD SPOTTING (1)
KULLBACK-LEIBLER DIVERGENCE (1)
LMFB (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MUSIC (1)
NOISE (1)
NON-NEGATIVE MATRIX FACTORIZATION (NMF) (1)
PART-OF-SPEECH (1)
PERPLEXITY (1)
ROBUST SPEECH RECOGNITION (1)
SINGLE CHANNEL SOURCE SEPARATION (1)
SPECTRAL SUBTRACTION (1)
SPEECH PROCESSING (1)
SPEECH/NOISE CLASSIFICATION (1)
TANDEM FEATURES (1)
TEMPORAL STRUCTURE NORMALIZATION (1)
TWO-MICROPHONE FEATURES (1)
VECTORS (1)
WORD ERROR RATE (1)
more

INFONA - science communication portal

Search results for: Babak Nasersharif

Mapping Mel sub-band energies using Deep belief network for robust speech recognition

Improved HMM entropy for robust sub-band speech recognition

Speech/music separation using non-negative matrix factorization with combination of cost functions

Factored language model adaptation using Dirichlet class language model for speech recognition

Two-microphone speech enhancement using a learned binary mask

An evolutionary based discriminative system for keyword spotting

Robust speech recognition using spectral subtraction and temporal structure normalization

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options