Search results

Items from 1 to 20 out of 23 results

chapter

Using k-Nearest Neighbor and Speaker Ranking for Phoneme Prediction

Muhammad Rizwan, David V. Anderson

2014 13th International Conference on Machine Learning and Applications > 383 - 387

2014 13th International Conference on Machine Learning and Applications (ICMLA)

Speech recognition systems are either based on parametric approach or non-parametric approach. Parametric based systems such as HMMs have been the dominant technology for speech recognition in the past decade. Despite a lot of advancements and enhancements in the design of these systems: key problems such as long term temporal dependence, etc. Has not yet been solved. Recently due to availability...

chapter

Efficient training of acoustic models for reverberation-robust medium-vocabulary automatic speech recognition

Armin Sehr, Hendrik Barfuss, Christian Hofmann, Roland Maas, more

2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA) > 177 - 181

2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA)

A recently proposed concept for training reverberation-robust acoustic models for automatic speech recognition using pairs of clean and reverberant data is extended from word models to tied-state triphone models in this paper. The key idea of the concept, termed ICEWIND, is to use the clean data for the temporal alignment and the reverberant data for the estimation of the emission densities. Experiments...

chapter

Analyzing Sequence Data Based on Conditional Random Fields with Co-training

Leilei Yang, Guiquan Liu, Qi Liu, Lei Zhang, more

2012 Eighth International Conference on Computational Intelligence and Security > 94 - 98

2012 Eighth International Conference on Computational Intelligence and Security (CIS)

Sequence data plays an important role in data analysis applications, such as sequence classification. One important aspect of sequence data analysis is to obtain the labeled sequence data and use a machine learning model to predict the sequence structures. Conditional Random Fields (CRF) is such a machine learning method which is popular used in sequential data analysis. This is because that CRF can...

chapter

A Pointwise Approach for Vietnamese Diacritics Restoration

Tuan Anh Luu, Kazuhide Yamamoto

2012 International Conference on Asian Language Processing > 189 - 192

2012 International Conference on Asian Language Processing (IALP)

The automatic insertion of diacritics in electronic texts is necessary for a number of languages, including French, Romanian, Croatian, Sindhi, Vietnamese, etc. When diacritics are removed from a word and the resulting string of characters is not a word, it is easy to recover the diacritics. However, sometimes the resulting string is also a word, possibly with different grammatical properties or a...

chapter

Using KL-divergence and multilingual information to improve ASR for under-resourced languages

David Imseng, Herve Bourlard, Philip N. Garner

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4869 - 4872

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Setting out from the point of view that automatic speech recognition (ASR) ought to benefit from data in languages other than the target language, we propose a novel Kullback-Leibler (KL) divergence based method that is able to exploit multilingual information in the form of universal phoneme posterior probabilities conditioned on the acoustics. We formulate a means to train a recognizer on several...

chapter

A HMM-based method for anomaly detection

Fei Wang, Hongliang Zhu, Bin Tian, Yang Xin, more

2011 4th IEEE International Conference on Broadband Network and Multimedia Technology > 276 - 280

2011 4th IEEE International Conference on Broadband Network & Multimedia Technology (IC-BNMT 2011)

Intrusion-detection systems (IDSs) are essential tools for the security of computer systems. Anomaly detection, which uses knowledge about normal behaviors and attempts to detect intrusions by noting significant deviations, has been paid more and more attention. In this paper, we introduce a HMM-based method for anomaly detection. The proposed method is composed of two important stages: off-line training...

chapter

A method of Chinese organization named entities recognition based on statistical word frequency, part of speech and length

Xiying Yao

2011 4th IEEE International Conference on Broadband Network and Multimedia Technology > 637 - 641

2011 4th IEEE International Conference on Broadband Network & Multimedia Technology (IC-BNMT 2011)

We propose a recognition method based on statistics through analysis the grammatical and semantic characteristics of the Chinese organization name. This recognition method includes three elements: frequency, part of speech, word length. We use the data in mature collection as training data; separately calculate a candidate organization name's word frequency, part of speech and word length of the contribution...

chapter

A novel framework for anomaly detection based on hybrid HMM-SVM model

Hongliang Zhu, Yang Xin, Fei Wang

2011 4th IEEE International Conference on Broadband Network and Multimedia Technology > 670 - 674

2011 4th IEEE International Conference on Broadband Network & Multimedia Technology (IC-BNMT 2011)

Intrusion-detection systems (IDSs) are essential tools for the security of computer systems. Anomaly detection, which uses knowledge about normal behaviors and attempts to detect intrusions by noting significant deviations, has been paid more and more attention. In this paper, we introduce a novel framework for anomaly detection. In the proposed method, two widely used statistical learning method,...

chapter

Co-training for Handwritten Word Recognition

Volkmar Frinken, Andreas Fischer, Horst Bunke, Alicia Foornes

2011 International Conference on Document Analysis and Recognition > 314 - 318

2011 International Conference on Document Analysis and Recognition (ICDAR)

To cope with the tremendous variations of writing styles encountered between different individuals, unconstrained automatic handwriting recognition systems need to be trained on large sets of labeled data. Traditionally, the training data has to be labeled manually, which is a laborious and costly process. Semi-supervised learning techniques offer methods to utilize unlabeled data, which can be obtained...

chapter

Prediction of state of user's behavior using Hidden Markov Model in ubiquitous home network

Wonjoon Kang, Dongkyoo Shine, Doingil Shin

2010 IEEE International Conference on Industrial Engineering and Engineering Management > 1752 - 1756

2010 IEEE International Conference on Industrial Engineering & Engineering Management (IE&EM 2010)

In this paper, we used Hidden Markov prediction tools to predict the state of the behavior of users in a ubiquitous home network. The state of the user's behavior presents a change of interest in the action of the user. This paper proposes a weight (WEIGHT) for the level of interest in the behavior and the strength of the relation between the behavior and interest, which is the formulation of the...

chapter

Empirical evaluation of active sampling for CRF-based analysis of pages

Manabu Ohta, Ryohei Inoue, Atsuhiro Takasu

2010 IEEE International Conference on Information Reuse&Integration > 13 - 18

2010 IEEE International Conference on Information Reuse & Integration (IRI 2010)

We propose an automatic method of extracting bibliographies for academic articles scanned with OCR markup. The method uses conditional random fields (CRF) for labeling serially OCR-ed text lines on an article's title page as appropriate names for bibliographic elements. Although we achieved excellent extraction accuracies for some Japanese academic journals, we needed a substantial amount of training...

chapter

Unsupervised Learning of Stroke Tagger for Online Kanji Handwriting Recognition

Mathieu Blondel, Kazuhiro Seki, Kuniaki Uehara

2010 20th International Conference on Pattern Recognition > 1973 - 1976

2010 20th International Conference on Pattern Recognition (ICPR 2010)

Traditionally, HMM-based approaches to online Kanji handwriting recognition have relied on a hand-made dictionary, mapping characters to primitives such as strokes or substrokes. We present an unsupervised way to learn a stroke tagger from data, which we eventually use to automatically generate such a dictionary. In addition to not requiring a prior hand-made dictionary, our approach can improve the...

chapter

Improved AdaBoost Algorithm Using VQMAP for Speaker Identification

Haiyang Wu, Yong Lü, Zhenyang Wu

2010 International Conference on Electrical and Control Engineering > 1176 - 1179

2010 International Conference on Electrical and Control Engineering (ICECE 2010)

Adaptive boosting (AdaBoost) learning method can improve the performance of a base classifier by mining feature information in depth. But it is computationally expensive, and the base classifier without a suitable accuracy will cause over fitting. In this paper an improved Adaboost algorithm using maximum a posteriori vector quantization model (VQMAP) for speaker identification is presented. A suitable...

chapter

Information retrieval methods for automatic speech recognition

Xiaoqiang Xiao, J Droppo, A Acero

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5550 - 5553

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In this paper, we use information retrieval (IR) techniques to improve a speech recognition (ASR) system. The potential benefits include improved speed, accuracy, and scalability. Where conventional HMM-based speech recognition systems decode words directly, our IR-based system first decodes subword units. These are then mapped to a target word by the IR system. In this decoupled system, the IR serves...

chapter

Structured Prediction Models for Chord Transcription of Music Audio

A. Weller, D. Ellis, T. Jebara

2009 International Conference on Machine Learning and Applications > 590 - 595

Eighth International Conference on Machine Learning and Applications (ICMLA 2009)

Chord sequences are a compact and useful description of music, representing each beat or measure in terms of a likely distribution over individual notes without specifying the notes exactly. Transcribing music audio into chord sequences is essential for harmonic analysis, and would be an important component in content-based retrieval and indexing, but accuracy rates remain fairly low. In this paper,...

chapter

Chunker for Tamil

V. Dhanalakshmi, P. Padmavathy, K.M. Anand, K.P. Soman, more

2009 International Conference on Advances in Recent Technologies in Communication and Computing > 436 - 438

2009 International Conference on Advances in Recent Technologies in Communication and Computing. ARTCom 2009

This paper presents the chunker for Tamil using Machine learning techniques. Chunking is the task of identifying and segmenting the text into syntactically correlated word groups. The chunking is done by the machine learning techniques, where the linguistical knowledge is automatically extracted from the annotated corpus. We have developed our own tagset for annotating the corpus, which is used for...

chapter

Effect of gaussian densities and amount of training data on grapheme-based acoustic modeling for Arabic

M. Elmahdy, R. Gruhn, W. Minker, S. Abdennadher

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 5

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

Grapheme-based acoustic modeling for Arabic is a demanding research area since high phonetic transcription accuracy is not yet solved completely. In this paper, we are studying the use of a pure grapheme-based approach using Gaussian mixture model to implicitly model missing diacritics and investigating the effect of Gaussian densities and amount of training data on speech recognition accuracy. Two...

chapter

Data sampling based ensemble acoustic modelling

Xin Chen, Yunxin Zhao

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 3805 - 3808

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we propose a novel technique of using cross validation (CV) data sampling to construct an ensemble of acoustic models for conversational speech recognition. We further propose using hierarchical Gaussian mixture model (HGMM) and repartition training data to increase the ensemble size and diversity. The proposed methods are found to work well together for ensemble acoustic modeling....

chapter

A novel risk assessment system for port state control inspection

Zhong Gao, Guanming Lu, Mengjue Liu, Meng Cui

2008 IEEE International Conference on Intelligence and Security Informatics > 242 - 244

2008 IEEE International Conference on Intelligence and Security Informatics (ISI 2008)

Port state control (PSC) inspection is the most important mechanism to ensure world marine safe. Recently, some SVM-based risk assessment systems have been presented in the world. They estimate the risk of each candidate ship based on its generic factors and history inspection factors to select high-risk one before conducting on-board PSC inspection. However, how to improve the performance of the...

chapter

Pronunciation Recognition and Assessment for Mandarin Chinese

Cencen Zhong, Zhenjiang Miao

2008 Congress on Image and Signal Processing > 5 > 352 - 356

International Congress on Image and Signal Processing (CISP 2008)

This paper establishes a speaker-independent pronunciation recognition and assessment system with 673 words for mandarin Chinese under the background of a Chinese learning system framework. The recognition part is based on HTK using HMM (Hidden Markov Models) and improved in the aspect of acoustic model. Making use of the recognition results and the log-likelihood obtained from the Viterbi coding,...

Keywords:
TRAINING
ACCURACY
HIDDEN MARKOV MODELS
TRAINING DATA

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (9)
DATA MODELS (7)
DATA MINING (6)
SPEECH (6)
ACOUSTICS (5)
CLASSIFICATION ALGORITHMS (5)
COMPUTATIONAL MODELING (5)
FEATURE EXTRACTION (5)
HIDDEN MARKOV MODEL (5)
HMM (5)
MACHINE LEARNING (5)
TESTING (5)
LEARNING (ARTIFICIAL INTELLIGENCE) (4)
SIGNAL PROCESSING (4)
SUPPORT VECTOR MACHINES (4)
ARTIFICIAL NEURAL NETWORKS (3)
CLASSIFICATION (3)
COMPUTERS (3)
CONFERENCES (3)
DATABASES (3)
GAIN (3)
GAUSSIAN PROCESSES (3)
HANDWRITING RECOGNITION (3)
MULTIMEDIA COMMUNICATION (3)
MULTIMEDIA SYSTEMS (3)
PATTERN RECOGNITION (3)
ROBUSTNESS (3)
SUPPORT VECTOR MACHINE CLASSIFICATION (3)
VECTORS (3)
VITERBI ALGORITHM (3)
ACOUSTIC MODELING (2)
ALGORITHM DESIGN AND ANALYSIS (2)
ANOMALY DETECTION (2)
ARTIFICIAL INTELLIGENCE (2)
BUILDINGS (2)
CAMERAS (2)
CHARACTER RECOGNITION (2)
CO-TRAINING (2)
COMPLEXITY THEORY (2)
COMPUTER VISION (2)
CONDITIONAL RANDOM FIELDS (2)
CORRELATION (2)
COVARIANCE MATRIX (2)
DECODING (2)
DICTIONARIES (2)
EIGENVALUES AND EIGENFUNCTIONS (2)
ELECTRONIC MAIL (2)
FACE RECOGNITION (2)
IMAGE COLOR ANALYSIS (2)
IMAGE SEGMENTATION (2)
INFORMATION RETRIEVAL (2)
INFORMATION SCIENCE (2)
INTERNET (2)
K-NEAREST NEIGHBOR (2)
LEARNING SYSTEMS (2)
MEDIA (2)
MEL FREQUENCY CEPSTRAL COEFFICIENT (2)
NATURAL LANGUAGE PROCESSING (2)
NOISE (2)
ORGANIZATIONS (2)
PRINCIPAL COMPONENT ANALYSIS (2)
REGISTERS (2)
SHAPE (2)
SOFTWARE (2)
SPEAKER RECOGNITION (2)
SPEECH PROCESSING (2)
SUPPORT VECTOR MACHINE (2)
TELECOMMUNICATIONS (2)
WRITING (2)
ACADEMIC LIBRARIES (1)
ACCELERATION (1)
ACOUSTIC FEATURES (1)
ACOUSTIC MEASUREMENTS (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACTIVE SAMPLING (1)
ADABOOST (1)
ADAPTATION MODEL (1)
ADAPTIVE BOOSTING LEARNING METHOD (1)
AEROSPACE ELECTRONICS (1)
ANALYTICAL MODELS (1)
ANNOTATED CORPUS (1)
ARABIC LANGUAGE (1)
AUDIO CODING (1)
AUTOMATIC DIACRITIC RESTORATION (1)
AUTOMATIC SPEECH RECOGNITION (1)
BAG OF WORDS (1)
BAG-OF-WORDS (1)
BASE CLASSIFIER (1)
BIBLIOGRAPHIC ELEMENTS (1)
BIBLIOGRAPHIES (1)
BIBLIOGRAPHIES EXTRACTION (1)
BIBLIOGRAPHY EXTRACTION (1)
BISMUTH (1)
BLSTM NN (1)
BOOKS (1)
CANDIDATE SHIP (1)
more

INFONA - science communication portal

Search results

Using k-Nearest Neighbor and Speaker Ranking for Phoneme Prediction

Efficient training of acoustic models for reverberation-robust medium-vocabulary automatic speech recognition

Analyzing Sequence Data Based on Conditional Random Fields with Co-training

A Pointwise Approach for Vietnamese Diacritics Restoration

Using KL-divergence and multilingual information to improve ASR for under-resourced languages

A HMM-based method for anomaly detection

A method of Chinese organization named entities recognition based on statistical word frequency, part of speech and length

A novel framework for anomaly detection based on hybrid HMM-SVM model

Co-training for Handwritten Word Recognition

Prediction of state of user's behavior using Hidden Markov Model in ubiquitous home network

Empirical evaluation of active sampling for CRF-based analysis of pages

Unsupervised Learning of Stroke Tagger for Online Kanji Handwriting Recognition

Improved AdaBoost Algorithm Using VQMAP for Speaker Identification

Information retrieval methods for automatic speech recognition

Structured Prediction Models for Chord Transcription of Music Audio

Chunker for Tamil

Effect of gaussian densities and amount of training data on grapheme-based acoustic modeling for Arabic

Data sampling based ensemble acoustic modelling

A novel risk assessment system for port state control inspection

Pronunciation Recognition and Assessment for Mandarin Chinese

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options