Speech/music discrimination for radio broadcasts using a hybrid HMM-Bayesian Network architecture

Aggelos Pikrakis; Theodoros Giannakopoulos; Sergios Theodoridis

Speech/music discrimination for radio broadcasts using a hybrid HMM-Bayesian Network architecture

Pikrakis, Aggelos, Giannakopoulos, Theodoros, Theodoridis, Sergios

Source

2006 14th European Signal Processing Conference > 2006 > 1 - 5

Abstract

This paper presents a speech/music discrimination scheme for radio recordings using a hybrid architecture based on a combination of a Variable Duration Hidden Markov Model (VDHMM) and a Bayesian Network (BN). The proposed scheme models speech and music as states in a VDHMM. A modified Viterbi algorithm for the computation of the observations' probabilities at each state is proposed. This is achieved by embedding a BN, that outputs to the HMM the required probability values. The proposed system has been tested on audio recordings from a variety of radio stations and has exhibited an overall performance close to 95%.