Search results

Items from 1 to 7 out of 7 results

chapter

End-to-end ASR-free keyword search from speech

Kartik Audhkhasi, Andrew Rosenberg, Abhinav Sethy, Bhuvana Ramabhadran, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4840 - 4844

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

sequence during training. This paper explores the design of an ASR-free end-to-end system for text query-based keyword search (KWS) from speech trained with minimal supervision. Our E2E KWS system consists of three sub-systems. The first sub-system is a recurrent neural network (RNN)-based acoustic auto-encoder trained to

chapter

Score normalization for keyword search

Leda Sari, Murat Saraclar

2016 24th Signal Processing and Communication Application Conference (SIU) > 761 - 764

2016 24th Signal Processing and Communication Application Conference (SIU)

In this work, keyword search (KWS) is based on a symbolic index that uses posteriorgram representation of the speech data. For each query, sum-to-one normalization or keyword specific thresholding is applied to the search results. The effect of these methods on the proposed KWS system is investigated. Results are

chapter

Audio Clips Content Comparison Using Latent Semantic Indexing

K. Biatov, J. Koehler, D. Schneider

2009 IEEE International Conference on Semantic Computing > 509 - 512

2009 IEEE International Conference on Semantic Computing (ICSC)

This paper describes experiments for audio clips comparison based on spoken context. The spoken content is obtained using automatic speech recognition. The social tags that are available for most of the audio clips are used as keywords. These keywords are mapped to the spoken transcription representing the audio clips

chapter

Semi-supervised training in low-resource ASR and KWS

Florian Metze, Ankur Gandhe, Yajie Miao, Zaid Sheikh, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4699 - 4703

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In particular for “low resource” Keyword Search (KWS) and Speech-to-Text (STT) tasks, more untranscribed test data may be available than training data. Several approaches have been proposed to make this data useful during system development, even when initial systems have Word Error Rates (WER) above 70

chapter

Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications

Lei Xie, Wenhuai Zhao, Xiangzeng Zhou, Xiaohai Tian, more

2010 7th International Conference on Ubiquitous Intelligence&Computing and 7th International Conference on Autonomic&Trusted Computing > 503 - 505

2010 7th International Conference on Ubiquitous Intelligence & Computing and 7th International Conference on Autonomic & Trusted Computing (UIC/ATC 2010)

prototype system demonstrates our latest development on automatic speech recognition, keyword spotting, personalized text-to-speech synthesis and visual speech synthesis. The second demo exhibits a virtual concert with immersive audio effects. Through our virtual auditory technology, wearing simple earphones, listeners are

chapter

Research on the Embedded Intelligent Information Service Platform Based on ASR

Tang Yujun, Wang Xia, Wang Yongqing

2009 International Conference on Information Engineering and Computer Science > 1 - 4

2009 International Conference on Information Engineering and Computer Science. ICIECS 2009

The paper discusses the overall design scheme of intelligent information service platform based on automatic speech recognition and geographical information system, with the carrier of opening multimedia operation platform. This platform can implement good communication between human and the system through keyword

chapter

Voice enabled multilingual newspaper reading system

Jose Stephen, M. Anjali, V. K. Bhadran

2013 IEEE Global Humanitarian Technology Conference: South Asia Satellite (GHTC-SAS) > 317 - 320

2013 IEEE Global Humanitarian Technology Conference: South Asia Satellite (GHTC-SAS)

Speech Recognition (ASR), Multilingual Text-to-Speech system with other enhanced features like keyword search facility, Intelligent/ Auto customization in accordance with user and paper independent classified headings. The integration of ASR enables user to operate the system in complete hands free mode.

Filter options

Keywords:
SPEECH
AUTOMATIC SPEECH RECOGNITION
Publication type:
book

Publication date

Set your own date range

Keywords

HIDDEN MARKOV MODELS (4)
SPEECH RECOGNITION (4)
KEYWORD SEARCH (3)
ACOUSTICS (2)
DATA MINING (2)
INDEXING (2)
TRAINING (2)
ARTIFICIAL INTELLIGENCE (1)
AUDIO CLIPS CONTENT COMPARISON (1)
AUDIO CLIPS-FEATURE VECTORS MATRIX (1)
AUDITORY INTERFACES (1)
AVATARS (1)
CONFERENCES (1)
DIGITAL SIGNAL PROCESSING (1)
EAR (1)
EMBEDDED INTELLIGENT INFORMATION SERVICE PLATFORM (1)
END-TO-END SYSTEMS (1)
FEATURE EXTRACTION (1)
GEOGRAPHIC INFORMATION SYSTEMS (1)
GEOGRAPHICAL INFORMATION SYSTEM (1)
GOVERNMENT (1)
HEAD RELATED TRANSFER FUNCTIONS (1)
HUMAN COMPUTER INTERACTION (1)
HUMAN-COMPUTER INTERACTION (1)
INDEXES (1)
INFORMATION SERVICES (1)
INTELLIGENT SPOKEN QUESTION ANSWERING SYSTEM (1)
KEYWORD MAPPING (1)
KEYWORD RECOGNITION (1)
KEYWORD SPOTTING (1)
KEYWORDS SEARCH (1)
LARGE SCALE INTEGRATION (1)
LARGE VOCABULARY CONTINUOUS SPEECH RECOGINITON (1)
LATENT SEMANTIC INDEXING (1)
LATTICES (1)
LOAD MODELING (1)
LOW-RESOURCE LTS (1)
MAGNETIC CORES (1)
MAGNETIC HEADS (1)
MATRIX ALGEBRA (1)
MULTILINGUAL APPLICATION (1)
MULTIMEDIA COMMUNICATION (1)
MULTIMEDIA COMPUTING (1)
NATURAL LANGUAGE PROCESSING (1)
NATURAL SPEECH DIALOGUES (1)
NEURAL NETWORKS (1)
NEWSPAPER READING SYSTEM (1)
OPENING MULTIMEDIA OPERATION PLATFORM (1)
PERSONALIZED TEXT-TO-SPEECH SYNTHESIS (1)
QUESTION ANSWERING (1)
RECURRENT NEURAL NETWORKS (1)
SCORE NORMALIZATION (1)
SEMANTIC ANALYSIS (1)
SEMI-SUPERVISED TRAINING (1)
SINGULAR VALUE DECOMPOSITION (1)
SOCIAL TAGS (1)
SOCIAL TAGS-KEYWORDS (1)
SPEECH INTERFACES (1)
SPEECH SYNTHESIS (1)
SPOKEN CONTEXT (1)
SPOKEN DIALOGUE SYSTEM (1)
SPOKEN TERM DETECTION (1)
SPOKEN TRANSCRIPTION (1)
STANDARD VECTOR SPACE MODEL (1)
TALKING FACE (1)
TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY WEIGHTING (1)
TEXT ANALYSIS (1)
TEXT-TO-SPEECH SYSTEM (1)
THREE DIMENSIONAL DISPLAYS (1)
UBIQUITOUS COMPUTING (1)
USER INTERFACES (1)
VECTORS (1)
VIRTUAL AUDITORY (1)
VIRTUAL AUDITORY TECHNOLOGY (1)
VISUA SPEECH SYNTHESIS (1)
VISUAL SPEECH SYNTHESIS (1)
VOCABULARY (1)
WEIGHT MEASUREMENT (1)
XML (1)
more

INFONA - science communication portal

Search results

End-to-end ASR-free keyword search from speech

Score normalization for keyword search

Audio Clips Content Comparison Using Latent Semantic Indexing

Semi-supervised training in low-resource ASR and KWS

Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications

Research on the Embedded Intelligent Information Service Platform Based on ASR

Voice enabled multilingual newspaper reading system

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options