An Overview of Speech/Music Discrimination Techniques in the Context of Audio Recordings

Aggelos Pikrakis; Theodoros Giannakopoulos; Sergios Theodoridis

doi:10.1007/978-3-540-78502-6_4

An Overview of Speech/Music Discrimination Techniques in the Context of Audio Recordings

Aggelos Pikrakis, Theodoros Giannakopoulos, Sergios Theodoridis

Source

Studies in Computational Intelligence > Multimedia Services in Intelligent Environments > 81-102

Abstract

Summary

Speech/music discrimination of audio recordings refers to the problem of segmenting an audio stream and labeling each segment as either speech or music. This chapter provides an overview of methods that have been proposed in the field during the past decade and also presents in more detail a methodology that treats the problem as a posterior probability maximization task. Given that feature extraction is of primary importance to all methods, a study of feature extraction schemes is first provided. The existing methods are then broadly classified to categories depending on the underlying design philosophy. Finally, a performance study is given by presenting the datasets and accompanying assumptions that each method has adopted.

Identifiers

series ISSN :	1860-949X
series e-ISSN :	1860-9503
book ISBN :	978-3-540-78491-3
book e-ISBN :	978-3-540-78502-6
DOI	10.1007/978-3-540-78502-6_4

Authors

Aggelos Pikrakis

University of Athens, Department of Informatics and Telecommunications, Athens, Greece

Theodoros Giannakopoulos

University of Athens, Department of Informatics and Telecommunications, Athens, Greece

Sergios Theodoridis

University of Athens, Department of Informatics and Telecommunications, Athens, Greece

Additional information

Data set: Springer

Publisher

Springer Berlin Heidelberg

chapter

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

An Overview of Speech/Music Discrimination Techniques in the Context of Audio Recordings $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Aggelos Pikrakis

Theodoros Giannakopoulos

Sergios Theodoridis

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

An Overview of Speech/Music Discrimination Techniques in the Context of Audio Recordings