Search results

chapter

The IBM keyword search system for the DARPA RATS program

Lidia Mangu, Hagen Soltau, Hong-Kwang Kuo, George Saon

2013 IEEE Workshop on Automatic Speech Recognition and Understanding > 204 - 209

2013 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU)

The paper describes a state-of-the-art keyword search (KWS) system in which significant improvements are obtained by using Convolutional Neural Network acoustic models, a two-step speech segmentation approach and a simplified ASR architecture optimized for KWS. The system described in this paper had the best

chapter

Order-free spoken term detection

Lidia Mangu, George Saon, Michael Picheny, Brian Kingsbury

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5331 - 5335

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

additional ordering present in a lattice or CN is discarded. TMW lists compactly summarize a large ASR search space. Representing a large search space is critical for STD metrics such as ATWV that heavily penalize misses of rare keywords. Comparisons on the OpenKWS 2014 Tamil limited language pack task [1] show that the new TMW

chapter

Efficient spoken term detection using confusion networks

Lidia Mangu, Brian Kingsbury, Hagen Soltau, Hong-Kwang Kuo, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7844 - 7848

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we present a fast, vocabulary independent algorithm for spoken term detection (STD) that demonstrates a word-based index is sufficient to achieve good performance for both in-vocabulary (IV) and out-of-vocabulary (OOV) terms. Previous approaches have required that a separate index be built at the sub-word level and then expanded to allow for matching OOV terms. Such a process, while...

chapter

Exploiting diversity for spoken term detection

Lidia Mangu, Hagen Soltau, Hong-Kwang Kuo, Brian Kingsbury, more

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 8282 - 8286

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The paper describes a state-of-the-art spoken term detection system in which significant improvements are obtained by diversifying the ASR engines used for indexing and combining the search results. First, we describe the design factors that, when varied, produce complementary STD systems and show that the performance of the combined system is 3 times better than the best individual component. Next,...

chapter

Towards spoken-document retrieval for the enterprise: Approximate word-lattice indexing with text indexers

F. Seide, Peng Yu, Yu Shi

2007 IEEE Workshop on Automatic Speech Recognition&Understanding (ASRU) > 629 - 634

2007 IEEE Workshop on Automatic Speech Recognition and Understanding

Enterprise-scale search engines are generally designed for linear text. Linear text is suboptimal for audio search, where accuracy can be significantly improved if the search includes alternate recognition candidates, commonly represented as word lattices. We propose two methods to enable text indexers to approximately index lattices with little or no code change: "TMI" (Time-based Merging...

article

Rapid Yet Accurate Speech Indexing Using Dynamic Match Lattice Spotting

Kishan Thambiratnam, Sridha Sridharan

IEEE Transactions on Audio, Speech, and Language Processing > 2007 > 15 > 1 > 346 - 357

The support for typically out-of-vocabulary query terms such as names, acronyms, and foreign words is an important requirement of many speech indexing applications. However, to date many unrestricted vocabulary indexing systems have struggled to provide a balance between good detection rate and fast query speeds. This paper presents a fast and accurate unrestricted vocabulary speech indexing technique...

INFONA - science communication portal

Search results

The IBM keyword search system for the DARPA RATS program

Order-free spoken term detection

Efficient spoken term detection using confusion networks

Exploiting diversity for spoken term detection

Towards spoken-document retrieval for the enterprise: Approximate word-lattice indexing with text indexers

Rapid Yet Accurate Speech Indexing Using Dynamic Match Lattice Spotting

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

The IBM keyword search system for the DARPA RATS program

Order-free spoken term detection

Efficient spoken term detection using confusion networks

Exploiting diversity for spoken term detection

Towards spoken-document retrieval for the enterprise: Approximate word-lattice indexing with text indexers

Rapid Yet Accurate Speech Indexing Using Dynamic Match Lattice Spotting

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options