Search results

Items from 1 to 20 out of 41 results

chapter

3D human behavior recognition based on spatiotemporal texture features

Chunxiao Fan, Lei Tian, Guangchao Wang, Yue Ming, more

2015 8th International Conference on Human System Interaction (HSI) > 350 - 356

2015 8th International Conference on Human System Interactions (HSI)

Nowadays, more and more activity recognition algorithms begin to improve recognition performance by combining the RGB and depth information. Although, the space-time volumes (STV) algorithm and the space-time local features algorithm can combine the RGB and depth information effectively, they also have their own defects. Such as they need expensive computational cost and they are not suitable for...

chapter

Noise level classification for EEG using Hidden Markov Models

Sherif Haggag, Shady Mohamed, Asim Bhatti, Hussein Haggag, more

2015 10th System of Systems Engineering Conference (SoSE) > 439 - 444

2015 10th System of Systems Engineering Conference (SoSE)

EEG signal is one of the most important signals for diagnosing some diseases. EEG is always recorded with an amount of noise, the more noise is recorded the less quality is the EEG signal. The included noise can represent the quality of the recorded EEG signal, this paper proposes a signal quality assessment method for EEG signal. The method generates an automated measure to detect the noise level...

chapter

A unified probabilistic framework for robust decoding of linear barcodes

Umut Simsekli, Tolga Birdal

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1946 - 1950

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Both consumer market and manufacturing industry makes heavy use of 1D (linear) barcodes. From helping the visually impaired to identifying the products to industrial automated industry management, barcodes are the prevalent source of item tracing technology. Because of this ubiquitous use, in recent years, many algorithms have been proposed targeting barcode decoding from high-accessibility devices...

chapter

Online handwriting Farsi character and number recognition based on hand movement direction using Hidden Markov Models

Zohre Sadrnezhad, Atefeh Nekouie, Majid Vafaei Jahan

2014 International Congress on Technology, Communication and Knowledge (ICTCK) > 1 - 6

2014 International Congress on Technology, Communication and Knowledge (ICTCK)

Online handwriting recognition has many applications and the recognition with high accuracy is essential. In this paper, we introduce a method for online handwriting Farsi character and number recognition using Hidden Markov Models (HMM). First we recognize handwriting direction then we get some statistical and formatting features. The letters are classified by means of these features and then we...

chapter

A fuzzy clustering algorithm via enhanced spatially constraint for brain MR image segmentation

Zexuan Ji, Jinyao Liu, Guannan Li

2014 International Conference on Orange Technologies > 105 - 108

2014 IEEE International Conference on Orange Technologies (ICOT)

Fuzzy clustering has been extensively used in brain magnetic resonance (MR) image segmentation. However, due to the existence of noise and intensity inhomogeneity, many segmentation algorithms suffer from limited accuracy. In this paper, we propose a fuzzy clustering algorithm via enhanced spatially constraint for brain MR image segmentation. A novel spatial factor is proposed by incorporating the...

chapter

Performance of a hierarchical temporal memory network in noisy sequence learning

Daniel E. Padilla, Russell Brinkworth, Mark D. McDonnell

2013 IEEE International Conference on Computational Intelligence and Cybernetics (CYBERNETICSCOM) > 45 - 51

2013 IEEE International Conference on Computational Intelligence and Cybernetics (CYBERNETICSCOM)

As neurobiological evidence points to the neocortex as the brain region mainly involved in high-level cognitive functions, an innovative model of neocortical information processing has been recently proposed. Based on a simplified model of a neocortical neuron, and inspired by experimental evidence of neocortical organisation, the Hierarchical Temporal Memory (HTM) model attempts at understanding...

chapter

Security in the cloud based systems: Structure and breaches

Vivek Shandilya, Sajjan Shiva

2013 IEEE Third International Conference on Information Science and Technology (ICIST) > 542 - 547

2013 IEEE Third International Conference on Information Science and Technology (ICIST)

Cloud based systems(CBSs) are increasing in the computing world. These systems derive their complexity due to both the disparate components and the diverse stake holders involved in them. The component wise security alone does not solve the problem of securing CBSs, but the stakeholder's computational space spanning across many components of the CBS, needs to be secured too. There have been initial...

chapter

Fourier-Bessel cepstral coefficients for robust speech recognition

Chetana Prakash, Suryakanth V. Gangashetty

2012 International Conference on Signal Processing and Communications (SPCOM) > 1 - 5

2012 International Conference on Signal Processing and Communications (SPCOM)

In this paper we propose Fourier-Bessel cepstral coefficients (FBCC) features for robust speech recognition. The Fourier-Bessel representation of the speech signal is obtained using Bessel function as a basis set. The FBCC are extracted from zero^th order Bessel coefficients taking into account of the perceptual characteristics of human auditory system. Recognition accuracy is measured using the CMU...

chapter

Cepstral analysis based on the glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise

Cassia Valentini-Botinhao, Ranniery Maia, Junichi Yamagishi, Simon King, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3997 - 4000

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper we introduce a new cepstral coefficient extraction method based on an intelligibility measure for speech in noise, the Glimpse Proportion measure. This new method aims to increase the intelligibility of speech in noise by modifying the clean speech, and has applications in scenarios such as public announcement and car navigation systems. We first explain how the Glimpse Proportion measure...

chapter

Fine-tuning HMMS for nonverbal vocalizations in spontaneous speech: A multicorpus perspective

Dmytro Prylipko, Bjorn Schuller, Andreas Wendemuth

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4625 - 4628

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Phenomena like filled pauses, laughter, breathing, hesitation, etc. play significant role in everyday human-to-human conversation and have a significant influence on speech recognition accuracy [1]. Because of their nature (e. g. long duration), they should be modeled with different number of emitting states and Gaussian mixtures. In this paper we address this question and try to determine the most...

chapter

Improvements to VTS feature enhancement

Jinyu Li, Michael L. Seltzer, Yifan Gong

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4677 - 4680

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

By explicitly modelling the distortion of speech signals, model adaptation based on vector Taylor series (VTS) approaches have been shown to significantly improve the robustness of speech recognizers to environmental noise. However, the computational cost of VTS model adaptation (MVTS) methods hinders them from being widely used because they need to adapt all the HMM parameters for every utterance...

chapter

Combining eigenvoice speaker modeling and VTS-based environment compensation for robust speech recognition

Zhijian Ou, Kan Deng

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4673 - 4676

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Eigenvoice and vector Taylor series (VTS) are good models for speaker differences and environmental variations separately. However, speaker and environmental variation always coexist in real-world speech. In this paper, we propose to combine eigenvoice and VTS. Specifically, we introduce eigenvoice speaker modeling for the clean speech into VTS's nonlinear mismatch function. In contrast, the standard...

chapter

A Fast Alignment Scheme for Automatic OCR Evaluation of Books

Ismet Zeki Yalniz, R. Manmatha

2011 International Conference on Document Analysis and Recognition > 754 - 758

2011 International Conference on Document Analysis and Recognition (ICDAR)

This paper aims to evaluate the accuracy of optical character recognition (OCR) systems on real scanned books. The ground truth e-texts are obtained from the Project Gutenberg website and aligned with their corresponding OCR output using a fast recursive text alignment scheme (RETAS). First, unique words in the vocabulary of the book are aligned with unique words in the OCR output. This process is...

chapter

A robust Hidden Markov Model based clustering algorithm

Shitong Yao

2011 6th IEEE Joint International Information Technology and Artificial Intelligence Conference > 2 > 259 - 264

2011 6th IEEE Joint International Information Technology and Artificial Intelligence Conference (ITAIC)

Hidden Markov models (HMMs) are widely employed in sequential data modeling both because they are capable of handling multivariate data of varying length, and because they capture the underlying hidden properties of time-series. Over the years, HMM-based clustering methods have been widely investigated and improved. However, their performance on noisy data and the effectiveness of similarity measure...

chapter

Accelerometer-based swinging gesture detection for an electronic handbell

Liyanaarachchi Lekamalage Chamara Kasun, Wooi-Boon Goh

2011 IEEE 15th International Symposium on Consumer Electronics (ISCE) > 272 - 277

2011 IEEE 15th International Symposium on Consumer Electronics - (ISCE 2011)

This paper tackles the problem of detecting the swinging action of an electronic handbell. It describes a threshold-based algorithm that is able to detect an orientation-free swinging motion using only the X and Y axis signals of an accelerometer that is mounted at the end of a handle. Equations governing the accelerations of the accelerometer are defined. The equations are used to select the appropriate...

chapter

Acoustic features for detection of aspirated stops

V Patil, P Rao

2011 National Conference on Communications (NCC) > 1 - 5

2011 National Conference on Communications (NCC)

Aspiration is an important phonemic feature in several Indian languages. Unlike English, languages such as Marathi have lexicons in which words with different meanings differ only in the aspiration feature of the initial voiced or unvoiced stop. Thus the reliable discrimination of aspirated stops from their unaspirated counterparts is important in automatic speech recognition for such languages. The...

article

Early fusion of Sparse Classification and GMM for noise robust ASR

Yang Sun, Jort F. Gemmeke, Bert Cranen, Louis ten Bosch, more

02011 00019th European Signal Processing Conference > 2011 > 1495 - 1499

2011 19th European Signal Processing Conference

In previous work we have shown that an ASR system consisting of a dual-input DBN which simultaneously observes MFCC acoustic features and predicted phone labels that are generated by an exemplar-based Sparse Classification (SC) system can achieve better word recognition accuracies in noise than a system observing only one of those input streams. This paper explores two modifications of the SC input...

chapter

Multi-Class Classification Using a New Sigmoid Loss Function for Minimum Classification Error (MCE)

M V Ratnagiri, L Rabiner, Biing-Hwang Juang

2010 Ninth International Conference on Machine Learning and Applications > 84 - 89

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

A new loss function has been introduced for Minimum Classification Error, that approaches optimal Bayes' risk and also gives an improvement in performance over standard MCE systems when evaluated on the Aurora connected digits database.

chapter

Performance evaluation of MLPC and MFCC for HMM based noisy speech recognition

M Rahman, M B I Islam

2010 13th International Conference on Computer and Information Technology (ICCIT) > 273 - 276

13th International Conference on Computer and Information Technology (ICCIT 2010)

In this paper auditory like features MLPC and MFCC have been used as front-end and their performance has been evaluated on Aurora-2 database for Hidden Markov Model (HMM) based noisy speech recognition. The clean data set is used for training and test set A is used to examine the performance. It has been found that almost the same recognition performance has been obtained both for MLPC and MFCC and...

chapter

New robust speech recognition using DTW in noise

Zhang Yuxin, Y Miyanaga, C Siriteanu

2010 10th International Symposium on Communications and Information Technologies > 34 - 38

2010 10th International Symposium on Communications and Information Technologies (ISCIT 2010)

This paper proposes a new robust speech recognition method. Since the hidden Markov model (HMM) algorithm need a lot of training calculation, The dynamic time warping (DTW) algorithm based on median filter is used instead in our system. According to the short-term energy method, the non-speech segment can be removed. Recognition accuracy is thus improved. The cepstral mean subtraction (CMS), running...

Data set:
ieee
Keywords:
ACCURACY
NOISE
HIDDEN MARKOV MODELS

Publication date

Set your own date range

INFONA - science communication portal

Search results

3D human behavior recognition based on spatiotemporal texture features

Noise level classification for EEG using Hidden Markov Models

A unified probabilistic framework for robust decoding of linear barcodes

Online handwriting Farsi character and number recognition based on hand movement direction using Hidden Markov Models

A fuzzy clustering algorithm via enhanced spatially constraint for brain MR image segmentation

Performance of a hierarchical temporal memory network in noisy sequence learning

Security in the cloud based systems: Structure and breaches

Fourier-Bessel cepstral coefficients for robust speech recognition

Cepstral analysis based on the glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise

Fine-tuning HMMS for nonverbal vocalizations in spontaneous speech: A multicorpus perspective

Improvements to VTS feature enhancement

Combining eigenvoice speaker modeling and VTS-based environment compensation for robust speech recognition

A Fast Alignment Scheme for Automatic OCR Evaluation of Books

A robust Hidden Markov Model based clustering algorithm

Accelerometer-based swinging gesture detection for an electronic handbell

Acoustic features for detection of aspirated stops

Early fusion of Sparse Classification and GMM for noise robust ASR

Multi-Class Classification Using a New Sigmoid Loss Function for Minimum Classification Error (MCE)

Performance evaluation of MLPC and MFCC for HMM based noisy speech recognition

New robust speech recognition using DTW in noise

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options