Search results

Items from 1 to 13 out of 13 results

chapter

Performance comparison of several techniques to detect keywords in audio streams and audio scene

Marek Bohac

Proceedings ELMAR-2012 > 215 - 218

2012 54th International Symposium ELMAR

This paper is focused on the task of detecting words of interest in an audio scene (a room, a lab or a workshop) or in a continually recorded stream of speech, music and other sounds. The solution of this task is important in many applications, e.g. for command control in houses for handicapped persons, for automating

chapter

Incorporation of happiness into neutral speech by modifying emotive-keywords

G. Anushiya Rachel, S. Sreenidhi, P. Vijayalakshmi, T. Nagarajan

TENCON 2014 - 2014 IEEE Region 10 Conference > 1 - 6

TENCON 2014 - 2014 IEEE Region 10 Conference

emotive-keywords. The happy speech synthesized by the proposed method, when assessed subjectively, yields a mean opinion score of 2.53 out of a possible 3. The synthetic speech is also assessed objectively using a GMM-based emotion recognition system, and all the tested sentences are recognized to be happy.

chapter

Using n-best recognition output for extractive summarization and keyword extraction in meeting speech

Yang Liu, Shasha Xie, Fei Liu

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5310 - 5313

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

. This paper presents an initial study using n-best recognition hypotheses for two tasks, extractive summarization and keyword extraction. We extend the approach used on 1-best output to n-best hypotheses: MMR (maximum marginal relevance) for summarization and TFIDF (term frequency, inverse document frequency) weighting for

chapter

Template-based Keyword Search with pseudo posteriorgrams

Batuhan Gundogdu, Leda Sari, Gozde Cetinkaya, Murat Saraclar

2016 24th Signal Processing and Communication Application Conference (SIU) > 973 - 976

2016 24th Signal Processing and Communication Application Conference (SIU)

In this work, a template-based search approach is adopted for the Keyword Search (KWS) problem on two of the low-resource languages (Turkish and Swahili). In low-resource languages, the use of Large Vocabulary Continuous Speech Recognition (LVCSR) systems in KWS tasks may perform poorly especially on out-of-vocabulary

chapter

Automatic Pronunciation Transliteration for Chinese-English Mixed Language Keyword Spotting

Shilei Zhang, Zhiwei Shuang, Yong Qin

2010 20th International Conference on Pattern Recognition > 1610 - 1613

2010 20th International Conference on Pattern Recognition (ICPR 2010)

implement the pronunciation conversion of English keywords to Chinese automatically. The efficiency of the proposed method was demonstrated under KWS task on mixed language database.

chapter

An Improved Mandarin Keyword Spotting System Using MCE Training and Context-Enhanced Verification

JiaEn Liang, Meng Meng, XiaoRui Wang, Peng Ding, more

2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings > 1 > I

2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

The task of keyword spotting is to detect a set of keywords in the input continuous speech. The main goal of this work is to develop an improved Mandarin keyword spotting (KWS) system for conversational telephone speech (CTS). In this paper, we propose an efficient online-garbage model based KWS system, which

chapter

Keyword spotting system for Tamil isolated words using Multidimensional MFCC and DTW algorithm

Senthildevi K. A, Chandra E

2015 International Conference on Communications and Signal Processing (ICCSP) > 550 - 554

2015 International Conference on Communications and Signal Processing (ICCSP)

Audio mining is a speaker independent speech processing technique and is related to data mining. Keyword spotting plays an important role in audio mining. Keyword spotting is retrieval of all instances of a given keyword in spoken utterances. It is well suited to data mining tasks that process large amount of speech

chapter

A new keyword spotting approach

H. Bahi, N. Benati

2009 International Conference on Multimedia Computing and Systems > 77 - 80

2009 International Conference on Multimedia Computing and Systems (ICMCS'09)

Keyword spotting is the task of identifying the occurrences of certain desired keywords in an arbitrary speech signal. Keyword spotting has many applications one of them is telephone routing. In particular, we consider a big company which receives thousands of telephone calls daily. We are interested with the

chapter

Spoken term detection from noisy input

G Gosztolya, G Kovacs, L Toth

2011 6th IEEE International Symposium on Applied Computational Intelligence and Informatics (SACI) > 91 - 96

2011 6th IEEE International Symposium on Applied Computational Intelligence and Informatics (SACI)

The aim of the spoken term detection task is to find the occurrence of user-entered keywords in an archive of audio recordings. The kind of techniques that are used usually are vocabulary-independent, using only the acoustic information available. In this scenario, however, we rely exclusively on the acoustic model

chapter

Sub-word modeling of out of vocabulary words in spoken term detection

I. Szoke, L. Burget, J. Cernocky, M. Fapso

2008 IEEE Spoken Language Technology Workshop > 273 - 276

2008 IEEE Workshop on Spoken Language Technology. SLT 2008

This paper deals with comparison of sub-word based methods for spoken term detection (STD) task and phone recognition. The sub-word units are needed for search for out-of-vocabulary words. We compared words, phones and multigrams. The maximal length and pruning of multigrams were investigated first. Then two

chapter

Stressed speech processing: Human vs automatic in non-professional speakers scenario

S Shukla, S R M Prasanna, S Dandapat

2011 National Conference on Communications (NCC) > 1 - 5

2011 National Conference on Communications (NCC)

This study analyzes the effect of stress in human and automatic stressed speech processing tasks for speech collected from non-professional speakers. The database of 33 keywords is collected under five stress conditions, namely, neutral, angry, happy, sad and Lombard from fifteen speakers. The first study is to

chapter

A Word-Dependent Automatic Arabic Speaker Identification System

S.S. Al-Dahri, Y.H. Al-Jassar, Y.A. Alotaibi, M.M. Alsulaiman, more

2008 IEEE International Symposium on Signal Processing and Information Technology > 198 - 202

2008 8th IEEE International Symposium on Signal Processing and Information Technology. ISSPIT 2008

Automatic speaker recognition is one of the difficult tasks in the field of computer speech and speaker recognition. Speaker recognition is a biometric process of automatically recognizing who is speaking on the basis of speaker dependent features of the speech signal. Currently, speaker recognition system is an

chapter

Maximum Entropy Based Normalization Of Word Posteriors For Phonetic And Lvcsr Lattice Search

Peng Yu, Duo Zhang, F. Seide

2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings > 1 > I

2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

In many key word-spotting systems, the word posterior probability is an elementary quantity. In theory, the posterior of a keyword match denotes the probability of the match being correct. However, posteriors estimated on lattices, in particular phoneme lattices, are often off by orders of magnitude. This paper

Filter options

Keywords:
SPEECH PROCESSING
Publication type:
book

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (10)
SPEECH (9)
HIDDEN MARKOV MODELS (8)
KEYWORD SPOTTING (5)
ACCURACY (3)
DATABASES (3)
LATTICES (3)
TRAINING (3)
VOCABULARY (3)
ACOUSTIC SIGNAL PROCESSING (2)
ACOUSTICS (2)
AUDIO SIGNAL PROCESSING (2)
FEATURE EXTRACTION (2)
HIDDEN MARKOV MODEL (2)
HUMANS (2)
NATURAL LANGUAGES (2)
NIST (2)
SPEAKER RECOGNITION (2)
SPOKEN TERM DETECTION (2)
ACOUSTIC ANALYSIS (1)
ACOUSTIC CHARACTERISATION (1)
ACOUSTIC INFORMATION (1)
ACOUSTIC MODEL (1)
ACTION ITEM DETECTION (1)
ADAPTATION MODEL (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ANALYTICAL MODELS (1)
ARABIC (1)
ARBITRARY SPEECH SIGNAL (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUDIO INFORMATION RETREIVAL (1)
AUDIO MINING (1)
AUDIO RECORDINGS (1)
AUTOMATIC PRONUNCIATION TRANSLITERATION (1)
AUTOMATIC SPEAKER RECOGNITION (1)
AUTOMATIC SPEECH PROCESSING (1)
AUTOMATIC STRESS CLASSIFIER (1)
BIOMETRIC PROCESS (1)
BROWSING (1)
CHINESE-ENGLISH MIXED LANGUAGE KEYWORD SPOTTING (1)
COMPARISON (1)
CONFERENCES (1)
CONTEXT-ENHANCED VERIFICATION (1)
CONTEXT-ENHANCED VERIFICATION METHOD (1)
CONTEXTUAL ANALYSIS (1)
CONVERSATIONAL TELEPHONE SPEECH (1)
COUPLINGS (1)
DATA MINING (1)
DTW ALGORITHM (1)
EQUAL-ERROR-RATE (1)
EXTRACTIVE SUMMARIZATION (1)
FIGURE OF MERIT (1)
FILLER MODEL (1)
FREQUENCY CONVERSION (1)
GLASS (1)
HMM (1)
HUMAN SPEECH PROCESSING (1)
HUMAN STRESS CLASSIFICATION (1)
HUMAN VS AUTOMATIC (1)
INDEXES (1)
INFORMATION RETRIEVAL (1)
KERNEL (1)
KEY WORD-SPOTTING SYSTEMS (1)
KEYWORD EXTRACTION (1)
KEYWORD SEARCH (1)
KEYWORD SPOTTING APPROACH (1)
LARGE-VOCABULARY CONTINUOUS-SPEECH RECOGNITION (1)
LATTICE (1)
LINGUISTICS (1)
LOW-QUALITY MICROPHONE (1)
LOW-RESOURCE LANGUAGES (1)
LVCSR LATTICE SEARCH (1)
MANDARIN KEYWORD SPOTTING SYSTEM (1)
MAXIMUM ENTROPY BASED NORMALIZATION (1)
MAXIMUM ENTROPY METHODS (1)
MAXIMUM MARGINAL RELEVANCE (1)
MEASUREMENT (1)
MEETING SPEECH (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MESSAGE AUTHENTICATION (1)
MFCC FEATURE VECTORS (1)
MIXED LANGUAGE (1)
MMR (1)
MULTIGRAM (1)
N-BEST HYPOTHESES (1)
N-BEST RECOGNITION HYPOTHESES (1)
N-BEST RECOGNITION OUTPUT (1)
NIST STD06 DEV-SET CTS DATA (1)
NOISE MEASUREMENT (1)
NOISY INPUT (1)
NON PROFESSIONAL SPEAKERS SCENARIO (1)
ONLINE-GARBAGE MODEL (1)
OOV WORD (1)
OUT-OF-VOCABULARY WORD (1)
PATTERN MATCHING (1)
PERCEPTUAL APPROACH (1)
PHONE (1)
PHONE RECOGNITION (1)
PHONEME BASED HMM (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options