Search results

Items from 1 to 8 out of 8 results

chapter

Graphical models for the recognition of Arabic continuous speech based triphones modeling

Elyes Zarrouk, Yassine Benayed, Faiez Gargouri

2015 IEEE/ACIS 16th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) > 1 - 6

2015 IEEE/ACIS 16th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

Recent developments in inference and learning in Dynamic Bayesian networks (DBN) allow their use in real-world applications is the first successful application of DBNs to a large scale speech recognition problem. Even if their progress is huge, those models lack a discriminatory ability especially on speech recognition such as the Hidden Markov models (HMM). In this paper, we present the performance...

chapter

Passive versus active: Vocal classification system

Z. Hammal, B. Bozkurt, L. Couvreur, D. Unay, more

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

Five expressions are commonly considered to characterize human emotional states: Happiness, Surprise, Anger, Sadness and Neutral. Different measures can be extracted from speech signals to characterize these expressions, for example the pitch, the energy, the SPI and the speech rate. Automatic classification of the five expressions based on these features shows a great confusion between Anger, Surprise...

chapter

Prosody based voice forgery detection using SVM

Renjith S., Leena Mary, Anish Babu K.K., Aju Joseph, more

2013 International Conference on Control Communication and Computing (ICCC) > 527 - 530

2013 International Conference on Control Communication and Computing (ICCC)

Speaker recognition has many applications such as access control, person authentication systems, forensics etc. In forensic applications, questioned recording may be received through different channels, noisy conditions and with cases of voice forgery, which make speaker recognition a challenging task. State of the art speaker recognition systems use spectral features which are susceptible to channel...

chapter

Multi-modal feature integration for story boundary detection in broadcast news

Mi-Mi Lu, Lei Xie, Zhong-Hua Fu, Dong-Mei Jiang, more

2010 7th International Symposium on Chinese Spoken Language Processing > 420 - 425

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

This paper investigates how to integrate multi-modal features for story boundary detection in broadcast news. The detection problem is formulated as a classification task, i.e., classifying each candidate into boundary/non-boundary based on a set of features. We use a diverse collection of features from text, audio and video modalities: lexical features capturing the semantic shifts of news topics...

chapter

A Hierarchical System Design for Language Identification

Haipeng Wang, Xiang Xiao, Xiang Zhang, Jianping Zhang, more

2009 Second International Symposium on Information Science and Engineering > 443 - 446

Second International Symposium on Information Science and Engineering (ISISE 2009)

Token-based approaches have proven quite effective for spoken language identification (LID). Traditionally, Speech utterances are first decoded into token sequences, and then LID tasks are performed on these token sequences by either n-gram language models or support vector machines. In this paper, we propose a hierarchical system design, which utilizes a group of bayesian logistic regression models...

chapter

Combined speech decoders output for phoneme recognition enhancement

K. Abida, F. Karray, W. Abida

2009 3rd International Conference on Signals, Circuits and Systems (SCS) > 1 - 6

2009 3rd International Conference on Signals, Circuits and Systems (SCS 2009)

Phoneme recognition is an essential component of any robust speech decoder and has been tackled by many researchers. Speech feature extraction constitutes the front end module of any speech decoder: it plays an essential role and has a strong impact on the recognition performance. The research community is aggressively searching for more powerful solutions which combine the existing feature extraction...

chapter

A novel strategy for speaker verification based on SVM classification of pairs of speech sequences

K. Daoudi, J. Louradour

2007 9th International Symposium on Signal Processing and Its Applications > 1 - 4

2007 9th International Symposium on Signal Processing and Its Applications (ISSPA)

We introduce a novel strategy for speaker verification based on the conception of a classifier which is independent of the target speaker, as opposed to traditional systems where the classifier is always target dependent. The basic principle is to build a system that decides whether two sequences were pronounced by the same speaker. In our view, this system is aimed to complement traditional ones...

chapter

Mixed Type Audio Classification with Support Vector Machine

Lei Chen, S. Gunduz, M.T. Ozsu

2006 IEEE International Conference on Multimedia and Expo > 781 - 784

2006 IEEE International Conference on Multimedia and Expo

Content-based classification of audio data is an important problem for various applications such as overall analysis of audio-visual streams, boundary detection of video story segment, extraction of speech segments from video, and content-based video retrieval. Though the classification of audio into single type such as music, speech, environmental sound and silence is well studied, classification...

INFONA - science communication portal

Search results

Graphical models for the recognition of Arabic continuous speech based triphones modeling

Passive versus active: Vocal classification system

Prosody based voice forgery detection using SVM

Multi-modal feature integration for story boundary detection in broadcast news

A Hierarchical System Design for Language Identification

Combined speech decoders output for phoneme recognition enhancement

A novel strategy for speaker verification based on SVM classification of pairs of speech sequences

Mixed Type Audio Classification with Support Vector Machine

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options