Search results

Items from 41 to 49 out of 49 results

chapter

A study on sports video classification based on audio analysis and speech recognition

Li Lu, Qingwei Zhao, Yonghong Yan, Kun Liu

2010 International Conference on Audio, Language and Image Processing > 737 - 742

2010 International Conference on Audio, Language and Image Processing (ICALIP)

keywords which are used as features to distinguish different sports. Finally, based on the keyword spotting (KWS) results and specific keywords selected for each kind of sports, a score ranking strategy is designed for conducting classification automatically. For robust KWS in our system, adaptation techniques for acoustic

article

Automatic Sentiment Detection in Naturalistic Audio

Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2017 > 25 > 8 > 1668 - 1679

challenging problem. Generic methods for sentiment extraction generally use transcripts from a speech recognition system, and process the transcript using text-based sentiment classifiers. In this study, we show that this baseline system is suboptimal for audio sentiment extraction. Alternatively, new architecture using keyword

chapter

Investigating techniques for low resource conversational speech recognition

Antoine Laurent, Thiago Fraga-Silva, Lori Lamel, Jean-Luc Gauvain

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5975 - 5979

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we investigate various techniques in order to build effective speech to text (STT) and keyword search (KWS) systems for low resource conversational speech. Subword decoding and graphemic mappings were assessed in order to detect out-of-vocabulary keywords. To deal with the limited amount of transcribed

chapter

Radio-browsing for developmental monitoring in Uganda

Raghav Menon, Armin Saeb, Hugh Cameron, William Kibira, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5795 - 5799

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

automatic speech recognisers using HMM/GMM, SGMM and DNN/HMM acoustic models as keyword spotters. We present the first results indicating promising performance of the radio-browsing system.

chapter

Voice-activity home care system

Oscal T.-C. Chen, Y. H. Tsai, C. W. Su, P. C. Kuo, more

2016 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI) > 110 - 113

2016 IEEE-EMBS 3rd International Conference on Biomedical and Health Informatics (BHI)

This work proposes a voice-activity home care system which can construct a life log associated with voices at home. Accordingly, the techniques of sound-pressure-level calculation, abnormal sound detection, noise reduction, text-independent speaker recognition and keyword spotting are developed. In abnormal sound

chapter

Improving data selection for low-resource STT and KWS

Thiago Fraga-Silva, Antoine Laurent, Jean-Luc Gauvain, Lori Lamel, more

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) > 153 - 159

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)

This paper extends recent research on training data selection for speech transcription and keyword spotting system development. Selection techniques were explored in the context of the IARPA-Babel Active Learning (AL) task for 6 languages. Different selection criteria were considered with the goal of improving over a

chapter

Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications

Lei Xie, Wenhuai Zhao, Xiangzeng Zhou, Xiaohai Tian, more

2010 7th International Conference on Ubiquitous Intelligence&Computing and 7th International Conference on Autonomic&Trusted Computing > 503 - 505

2010 7th International Conference on Ubiquitous Intelligence & Computing and 7th International Conference on Autonomic & Trusted Computing (UIC/ATC 2010)

prototype system demonstrates our latest development on automatic speech recognition, keyword spotting, personalized text-to-speech synthesis and visual speech synthesis. The second demo exhibits a virtual concert with immersive audio effects. Through our virtual auditory technology, wearing simple earphones, listeners are

chapter

Hierarchical audio-visual cue integration framework for activity analysis in intelligent meeting rooms

S.T. Shivappa, M.M. Trivedi, B.D. Rao

2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops > 107 - 114

2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

recognition using audio and visual cues. The novelty lies in putting together the tasks such that they can provide relevant information to one another. We evaluate the performance of our system and present results for tasks such as keyword spotting and tracking re-identification on real-world meeting scenes collected in our

chapter

Fusing multiple systems into a compact lattice index for chinese spoken term detection

Sha Meng, Peng Yu, Jia Liu, F. Seide

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4345 - 4348

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

We examine the task of spoken term detection in Chinese spontaneous speech with a lattice-based approach. We first compare lattices generated with different units: word, character, tonal and toneless syllables, and also lattices converted from one unit to another unit. Then we combine lattices from multiple systems into a single lattice. By fully exploiting the redundant information in the combined...

Keywords:
KEYWORD SPOTTING
SPEECH RECOGNITION

Publication date

Set your own date range

INFONA - science communication portal

Search results

A study on sports video classification based on audio analysis and speech recognition

Automatic Sentiment Detection in Naturalistic Audio

Investigating techniques for low resource conversational speech recognition

Radio-browsing for developmental monitoring in Uganda

Voice-activity home care system

Improving data selection for low-resource STT and KWS

Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications

Hierarchical audio-visual cue integration framework for activity analysis in intelligent meeting rooms

Fusing multiple systems into a compact lattice index for chinese spoken term detection

Filter options

Publication date

Publication type

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options