Search results

Items from 1 to 8 out of 8 results

chapter

Improving keyword detection rate using a set of rules to merge HMM-based and SVM-based keyword spotting results

Akram Shokri, Mohammad Hossein Davarpour, Ahmad Akbari

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1715 - 1718

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Evaluating the accuracy of HMM-based and SVM-based spotters in detecting keywords and recognizing the true place of keyword occurrence shows that the HMM-based spotter detects the place of occurrence more precisely than the SVM-based spotter. On the other hand, the SVM-based spotter performs much better in detecting

chapter

A Hybrid Method of Chinese Prosodic Word Tagging Based on Keyword Anchor and Hidden Markov Model

Zhou Quan, Deng Pan, Liu Hongjian, Guo Defeng, more

2009 International Conference on Asian Language Processing > 71 - 75

2009 International Conference on Asian Language Processing (IALP 2009)

In this paper, a new method of Chinese prosodic word tagging is presented. This method consists of a rule-based algorithm named ??keyword anchor?? and a statistical algorithm based on hidden Markov model (HMM). For keyword anchor algorithm, an anchor of the prosodic word is defined to help the system to find the whole

chapter

Confidence measure improvement using useful predictor features and support vector machines

Yasser Shekofteh, Jahanshah Kabudian, Mohammad Mohsen Goodarzi, Iman Sarraf Rezaei

20th Iranian Conference on Electrical Engineering (ICEE2012) > 1168 - 1171

2012 20th Iranian Conference on Electrical Engineering (ICEE)

In traditional keyword spotting (KWS) systems, confidence measure (CM) of each keyword is computed from normalized acoustic likelihoods. In addition to likelihood based scores, some keyword dependent features named predictor features such as duration and prosodic features could be defined to improve the performance of

chapter

Spectrographic seam patterns for discriminative word spotting

Shubhranshu Barnwal, Kamal Sahni, Rita Singh, Bhiksha Raj

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4725 - 4728

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper presents a novel method for deriving patterns for classification of speech sounds. In contrast to conventional methods that attempt to capture time-frequency patterns as represented by spectral envelopes or peaks, our method captures patterns of high-energy tracks, or seams, of maximum “whiteness” across frequency in spectrograms. Our hypothesis is that these seams could potentially carry...

chapter

A hybrid phonotactic language identification system with an SVM back-end for simultaneous lecture translation

Michael Heck, Sebastian Stuker, Alex Waibel

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4857 - 4860

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

actual language identification. On our bi-lingual lecture tasks the PPRLM system clearly outperforms the PPR system in various segment length conditions, however at the cost of slower run-time. By using lexical information in the form of keyword spotting, and additional language models we show ways to improve the

chapter

ICSI'S 2005 speaker recognition system

N. Mirghafori, A.O. Hatch, S. Stafford, K. Boakye, more

IEEE Workshop on Automatic Speech Recognition and Understanding, 2005. > 23 - 28

2005 IEEE Workshop on Automatic Speech Recognition and Understanding

This paper describes ICSI's 2005 speaker recognition system, which was one of the top performing systems in the NIST 2005 speaker recognition evaluation. The system is a combination of four sub-systems: 1) a keyword conditional HMM system, 2) an SVM-based lattice phone n-gram system, 3) a sequential nonparametric

chapter

Cross-Media Image Retrieval via Latent Semantic Indexing and Mixed Bagging

Jing Guo, Xianjun Liao

2009 WRI World Congress on Computer Science and Information Engineering > 4 > 187 - 193

2009 WRI World Congress on Computer Science and Information Engineering, CSIE

Semantic image retrieval using text such keywords or captions at different semantic levels has attracted considerable research attention in recent years. Automatic image annotation (AIA) has been proved to be an effective and promising solution to automatically deduce the high-level semantics from low-level visual

chapter

Multi-modal information fusion for news story segmentation in broadcast video

Bailan Feng, Peng Ding, Jiansong Chen, Jinfeng Bai, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1417 - 1420

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

fusion (MMIF) strategy. Compared with traditional methods, the proposed scheme extracts a wealth of semantic-level features including anchor person, topic caption, face, silence, acoustic change, audio keywords and textual content. Parallel to this, we make use of a multi-modal information fusion strategy for news story

Filter options

Keywords:
HIDDEN MARKOV MODELS
SUPPORT VECTOR MACHINES

Publication date

Set your own date range

Content availability

Available (7)
None (1)

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options