Search results for: Hardik B. Sailor

Items from 1 to 10 out of 10 results

chapter

Novel TEO-based Gammatone features for environmental sound classification

Dharmesh M. Agrawal, Hardik B. Sailor, Meet H. Soni, Hemant A. Patil

2017 25th European Signal Processing Conference (EUSIPCO) > 1809 - 1813

2017 25th European Signal Processing Conference (EUSIPCO)

In this paper, we propose to use modified Gammatone filterbank with Teager Energy Operator (TEO) for environmental sound classification (ESC) task. TEO can track energy as a function of both amplitude and frequency of an audio signal. TEO is better for capturing energy variations in the signal that is produced by a real physical system, such as, environmental sounds that contain amplitude and frequency...

article

Novel Unsupervised Auditory Filterbank Learning Using Convolutional RBM for Speech Recognition

Hardik B. Sailor, Hemant A. Patil

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2016 > 24 > 12 > 2341 - 2353

To learn auditory filterbanks, recently, we have proposed an unsupervised learning model based on convolutional restricted Boltzmann machine (RBM) with rectified linear units. In this paper, theory, training algorithm of our proposed model, and detailed analysis of learned filterbank are being presented. Learning of the model with different databases shows that the model is able to learn cochlear-like...

chapter

Unsupervised learning of temporal receptive fields using convolutional RBM for ASR task

Hardik B. Sailor, Hemant A. Patil

2016 24th European Signal Processing Conference (EUSIPCO) > 873 - 877

2016 24th European Signal Processing Conference (EUSIPCO)

There has been a significant research attention for unsupervised representation learning to learn the features for speech processing applications. In this paper, we investigate unsupervised representation learning using Convolutional Restricted Boltzmann Machine (ConvRBM) with rectified units for speech recognition task. Temporal modulation representation is learned using log Mel-spectrogram as an...

chapter

Analysis of hierarchical bottleneck framework for improved phoneme recognition

Mohammadi Zaki, Hardik B. Sailor, Hemant A. Patil

2016 International Conference on Signal Processing and Communications (SPCOM) > 1 - 5

2016 International Conference on Signal Processing and Communications (SPCOM)

In this paper, an attempt is made to examine and evaluate the effect of bottleneck and the hierarchical bottleneck (HBN) framework in MLP-based Automatic Speech Recognition (ASR) systems. In particular, the bottleneck and hierarchical bottleneck framework are analyzed using Volterra series. Experiments on several architectures with incorporation of systematic hierarchical and bottleneck properties...

chapter

Filterbank learning using Convolutional Restricted Boltzmann Machine for speech recognition

Hardik B. Sailor, Hemant A. Patil

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5895 - 5899

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Convolutional Restricted Boltzmann Machine (ConvRBM) as a model for speech signal is presented in this paper. We have developed ConvRBM with sampling from noisy rectified linear units (NReLUs). ConvRBM is trained in an unsupervised way to model speech signal of arbitrary lengths. Weights of the model can represent an auditory-like filterbank. Our proposed learned filterbank is also nonlinear with...

chapter

Spectro-temporal analysis of HIE and asthma infant cries using auditory spectrogram

Anshu Chittora, Hemant A. Patil, Hardik B. Sailor

2015 International Conference on BioSignal Analysis, Processing and Systems (ICBAPS) > 145 - 150

2015 International Conference on BioSignal Analysis, Processing and Systems (ICBAPS)

In this paper, auditory spectrogram is proposed for analysis of HIE and asthma infant cries. Auditory spectrogram represents a 2-dimensional (i.e., 2-D) pattern of neural activity, distributed along a logarithmic frequency-axis. Features are derived from the auditory spectrograms of each class. These features are then used to train support vector machine (SVM) classifier. Effectiveness of the proposed...

chapter

Deterministic annealing EM algorithm for developing TTS system in Gujarati

Nirmesh J. Shah, Hemant A. Patil, Maulik C. Madhavi, Hardik B. Sailor, more

The 9th International Symposium on Chinese Spoken Language Processing > 526 - 530

2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP)

The generalized statistical framework of Hidden Markov Model (HMM) has been successfully applied from the field of speech recognition to speech synthesis. In this work, we have applied HMM-based Speech Synthesis System (HTS) method to Gujarati language. Adaption and evaluation of HTS for Gujarati language has been done here. Evaluation of HTS system built using Gujarati data is done in terms of naturalness...

chapter

Fusion of magnitude and phase-based features for objective evaluation of TTS voice

Hardik B. Sailor, Hemant A. Patil

The 9th International Symposium on Chinese Spoken Language Processing > 521 - 525

2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP)

This paper analyzes the distance-based objective measures for evaluation of Text-to-Speech (TTS) systems (which is generally used objective measures). In this paper, we discuss some aspects of evaluation of speech quality of synthesized speech. Some of the limitations and issues of subjective evaluation are discussed and importance of objective measures is presented. Traditional objective measure...

chapter

Effectiveness of PLP-based phonetic segmentation for speech synthesis

Nirmesh J. Shah, Bhavik B. Vachhani, Hardik B. Sailor, Hemant A. Patil

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 270 - 274

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, use of Viterbi-based algorithm and spectral transition measure (STM)-based algorithm for the task of speech data labeling is being attempted. In the STM framework, we propose use of several spectral features such as recently proposed cochlear filter cepstral coefficients (CFCC), perceptual linear prediction cepstral coefficients (PLPCC) and RelAtive SpecTrAl (RASTA)-based PLPCC in addition...

chapter

A syllable-based framework for unit selection synthesis in 13 Indian languages

Hemant A Patil, Tanvina B Patel, Nirmesh J Shah, Hardik B Sailor, more

2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) > 1 - 8

2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)

In this paper, we discuss a consortium effort on building text to speech (TTS) systems for 13 Indian languages. There are about 1652 Indian languages. A unified framework is therefore attempted required for building TTSes for Indian languages. As Indian languages are syllable-timed, a syllable-based framework is developed. As quality of speech synthesis is of paramount interest, unit-selection synthesizers...

Filter options

Publication date

Set your own date range

Publication type

book (9)
article (1)

Keywords

MEL FREQUENCY CEPSTRAL COEFFICIENT (5)
FEATURE EXTRACTION (4)
CONVOLUTION (3)
DATABASES (3)
SPEECH (3)
CONVOLUTIONAL RBM (2)
FILTERBANK (2)
HIDDEN MARKOV MODELS (2)
SPEECH RECOGNITION (2)
AUDITORY PROCESSING (1)
AUDITORY SPECTROGRAM (1)
BOTTLENECK (1)
COCHLEAR MODEL (1)
CONVRBM (1)
CORRELATION COEFFICIENT (1)
DECODING (1)
DEEP NEURAL NETWORKS (1)
DETERMINISTIC ANNEALING EXPECTATION-MAXIMIZATION (DAEM) (1)
DYNAMIC TIME WRAPPING (DTW) (1)
EXPECTATION-MAXIMIZATION (1)
FREQUENCY MODULATION (1)
HBN (1)
HIDDEN MARKOV MODEL (HMM) (1)
HIERARCHICAL MULTILAYER PERCEPTRON (1)
HMM (1)
INDIAN LANGUAGES (1)
LABELING (1)
MATHEMATICAL MODEL (1)
MFCC (1)
MODIFIED GROUP DELAY (1)
MODULATION (1)
OBJECTIVE MEASURES (1)
PATHOLOGY (1)
PEDIATRICS (1)
PLPCC (1)
POOLING (1)
PRONUNCIATION DICTIONARY (1)
RECORDING (1)
RECTIFIED LINEAR UNITS (1)
SPEAKER SELECTION (1)
SPECTRAL TRANSITION MEASURE (STM) (1)
SPECTROGRAM (1)
SPEECH PROCESSING (1)
SUBBAND FILTERS (1)
SUPPORT VECTOR MACHINE (SVM) CLASSIFIER (1)
SUPPORT VECTOR MACHINES (1)
TANDEM (1)
TEMPORAL MODULATIONS (1)
TEXT OPTIMIZATION (1)
TEXT-TO-SPEECH (TTS) (1)
TIME-FREQUENCY ANALYSIS (1)
TRAINING (1)
more

INFONA - science communication portal

Search results for: Hardik B. Sailor

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options