Search results

Items from 1 to 5 out of 5 results

chapter

Open-vocabulary keyword detection from super-large scale speech database

N. Kanda, H. Sagawa, T. Sumiyoshi, Y. Obuchi

2008 IEEE 10th Workshop on Multimedia Signal Processing > 939 - 944

2008 IEEE 10th Workshop on Multimedia Signal Processing (MMSP)

This paper presents our recent attempt to make a super-large scale spoken-term detection system, which can detect any keyword uttered in a 2,000-hour speech database within a few seconds. There are three problems to achieve such a system. The system must be able to detect out-of-vocabulary (OOV) terms (OOV problem

chapter

Context modeling using RNN for keyword detection

J. Alvarez-Cercadillo, J. Ortega-Garcia, L.A. Hernandez-Gomez

1993 IEEE International Conference on Acoustics, Speech, and Signal Processing > 1 > 569 - 572 vol.1

Proceedings of ICASSP '93

The authors present some experiments that show the capabilities of using recurrent neural networks (RNNs) in conjunction with hidden Markov models (HMMs) in the context of keyword spotting (KWS): the automatic recognition of a small set of keywords as they occur in unconstrained speech and/or noise. KWS is usually

chapter

Keyword Spotting Based on Syllable Confusion Network

Pengyuan Zhang, Jian Shao, Qingwei Zhao, Yonghong Yan

Third International Conference on Natural Computation (ICNC 2007) > 2 > 656 - 659

2007 3rd International Conference on Natural Computation

Keyword spotting becomes a very important branch of speech recognition. But the acoustic mismatch between training and testing environments often causes a severe degradation in the recognition performance. This paper presents an improved keyword spotting strategy. A fuzzy search algorithm is proposed to extract

chapter

Using textual information from LVCSR transcripts for phonetic-based spoken term detection

C. Dubois, D. Charlet

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4961 - 4964

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

accurately locate the occurrences of a list of keywords in a broadcast corpus. Textual information from the transcripts and an efficient rescoring scheme are used to improve the performance of the phonetic search. Our experiments show that the proposed method outperforms the baseline textual and phonetic searches by its ability

chapter

Fast Vocabulary-Independent Audio Search Based on Syllable Confusion Network Indexing in Mandarin Spontaneous Speech

Jian Shao, Pengyuan Zhang, Zhaojie Liu, Qingwei Zhao, more

2007 Second International Conference on Digital Telecommunications (ICDT'7) > 8

Second International Conference on Digital Telecommunications, ICDT 2007

. Experiments carried out on conversational corpora for the keyword spotting task in the Chinese 2005 863 Evaluation show that this method can not only yield highly compact SCN lattices with syllable graph density (SGD) of 3.83, but also achieve an equal error rate (EER) of 32.45%, which is about 33% relatively reduction when

Filter options

Keywords:
SEARCH PROBLEMS
SPEECH RECOGNITION

Publication date

Set your own date range

Keywords

AUTOMATIC SPEECH RECOGNITION (2)
KEYWORD SPOTTING (2)
NATURAL LANGUAGE PROCESSING (2)
SPEECH PROCESSING (2)
ACCURACY (1)
ACOUSTIC MISMATCH (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTICS (1)
ARTIFICIAL NEURAL NETWORKS (1)
BROADCAST CORPUS (1)
CONTEXT MODELING (1)
CONTEXT MODELLING (1)
CONTEXT-SENSITIVE GRAMMARS (1)
DATA MINING (1)
DATABASES (1)
DECODING (1)
DECODING SEARCH (1)
DIRECT SYLLABIC DECODING (1)
ELECTRONIC MAIL (1)
EQUAL ERROR RATE REDUCTION (1)
FAST VOCABULARY-INDEPENDENT AUDIO SEARCH (1)
FEATURE EXTRACTION (1)
FINITE-STATE GRAMMARS (1)
FUZZY SEARCH ALGORITHM (1)
FUZZY SET THEORY (1)
HEURISTIC ALGORITHMS (1)
HIDDEN MARKOV MODELS (1)
INDEXES (1)
INDEXING (1)
KEYWORD DETECTION (1)
KEYWORD HYPOTHESES EXTRACTION (1)
LARGE VOCABULARY CONTINUOUS SPEECH RECOGNIZER SYSTEM (1)
LATTICE REPRESENTATION (1)
LVCSR TRANSCRIPTS (1)
MANDARIN CONVERSATIONAL TELEPHONE SPEECH (1)
MANDARIN SPONTANEOUS SPEECH (1)
MINIMUM CLASSIFICATION ERROR OPTIMIZED CONFIDENCE MEASURE (1)
NATURAL LANGUAGES (1)
OBJECT DETECTION (1)
OOV KEYWORD (1)
OPEN-VOCABULARY KEYWORD DETECTION (1)
OUT-OF-VOCABULARY DETECTION (1)
PATTERN MATCHING (1)
PHONEME-BASED SEARCH METHOD (1)
PHONETIC REPRESENTATION (1)
PHONETIC SEARCH (1)
PHONETIC-BASED SPOKEN TERM DETECTION (1)
PHONETIZATION (1)
POST-PROCESSING METHOD (1)
PRE-STORED INDEX DATABASE (1)
RECURRENT NEURAL NETS (1)
RECURRENT NEURAL NETWORKS (1)
RESCORING SCHEME (1)
SIGNAL CLASSIFICATION (1)
SPEECH (1)
SPEECH ENHANCEMENT (1)
SPOKEN TERM DETECTION (1)
SUPER-LARGE SCALE SPEECH DATABASE (1)
SUPER-LARGE SCALE SPOKEN-TERM DETECTION SYSTEM (1)
SYLLABLE COLLOCATION (1)
SYLLABLE CONFUSION MATRIX (1)
SYLLABLE CONFUSION NETWORK (1)
SYLLABLE CONFUSION NETWORK INDEXING (1)
SYLLABLE GRAPH DENSITY (1)
TELECOMMUNICATION STANDARDS (1)
TESTING ENVIRONMENTS (1)
TEXTUAL INFORMATION (1)
TEXTUAL SEARCH (1)
TRAINING ENVIRONMENTS (1)
UNCONSTRAINED SPEECH (1)
WORD TRANSCRIPTS (1)
more

INFONA - science communication portal

Search results

Open-vocabulary keyword detection from super-large scale speech database

Context modeling using RNN for keyword detection

Keyword Spotting Based on Syllable Confusion Network

Using textual information from LVCSR transcripts for phonetic-based spoken term detection

Fast Vocabulary-Independent Audio Search Based on Syllable Confusion Network Indexing in Mandarin Spontaneous Speech

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options