Search results

chapter

GUI implementation of real time feedback system to learn singing

Arvind Kumar, Mahesh Chandra, Shubham Agarwal

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT) > 1051 - 1054

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT)

Singing is an act of producing musical sound with the voice. It is an integral part of Indian culture. Right from old era of mythological period to various medieval periods, we saw different genre and varieties of singers. The art of singing in India also varies from region to region and from one gharanas to another. It can be done as religious devotion, as a source of pleasure or ritual or as a part...

chapter

Emotion, voices and musical instruments: Repeated exposure to angry vocal sounds makes instrumental sounds angrier

Casady Bowman, Takashi Yamauchi, Kunchen Xiao

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 670 - 676

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

The perception of emotion is critical for social interactions. Nonlinguistic signals such as those in the human voice and musical instruments are used for communicating emotion. Using an adaptation paradigm, this study examines the extent to which common mental mechanisms are applied for emotion processing of instrumental and vocal sounds. In two experiments we show that prolonged exposure to affective...

chapter

Aural fragment analysis framework pestial on aspect mining

Madhuri P. Borawake, Kawitkar Rameshwar

International Conference on Computing, Communication & Automation > 128 - 132

2015 International Conference on Computing, Communication & Automation (ICCCA)

This Manuscript probe delinquent of classification of uninterrupted of broad-spectrum aural data for content based recovery. This paper is dealing with scheme for classifying aural data & segmentation is also done on same data so that processing rate is faster. Aural data is able to classify into eight categories Simple speech, noise, silence, music single speech with music, double speech with...

chapter

Comparison of fundamental frequency detection methods and introducing simple self-repairing algorithm for musical applications

Miroslav Stanek, Tomas Smatana

2015 25th International Conference Radioelektronika (RADIOELEKTRONIKA) > 217 - 221

2015 25th International Conference Radioelektronika (RADIOELEKTRONIKA

This paper presents the comparison of five commonly used methods for fundamental frequency detection in speech signal, exactly in vocal and melodic instrument signals. The efficiency of chosen method is verified on known set of musical notes performed by bass clarinet. The highest efficiency in fundamental frequency detection was reached by AutoCorrelation (ACF) and Modified AutoCorrelation (MACF)...

chapter

Speech vs music discrimination using Empirical Mode Decomposition

Banriskhem K. Khonglah, Rajib Sharma, S. R. Mahadeva Prasanna

2015 Twenty First National Conference on Communications (NCC) > 1 - 6

2015 Twenty First National Conference on Communications (NCC)

This work explores the use of Empirical Mode Decomposition (EMD) for discriminating speech regions from music in audio recordings. The different frequency scales or Intrinsic Mode Functions (IMFs) obtained from EMD of the audio signal are found to contain discriminatory evidence for distinguishing the speech regions from the music regions of the audio signal. Different statistical measures like mean,...

chapter

Melody Extraction for Vocal Polyphonic Music Based on Bayesian Framework

Liming Song, Ming Li, Yonghong Yan

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 570 - 573

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

In order to automatically extract the main melody contours from polyphonic music especially vocal melody songs, we present an effective approach based on a Bayesian framework. According to various information from the music signals, we use a pitch evolution model describing how pitch contour changes and an acoustic model representing the acoustic characteristics when the pitch is a hypothesized one,...

chapter

Evaluation of Sinusoidal Modeling for Polyphonic Music Signal

Yuki Igarashi, Masashi Ito, Akinori Ito

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 464 - 467

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

There are various kinds of sound signal analysis methods. Sinusoidal modeling, one of those signal analysis method, is based on the idea that all sound signal can be expressed as the sum of sinusoidal components of which instantaneous frequency and amplitude continuously vary with time. Sinusoidal modeling is known as a good model for sound signals, but it has been applied to the data which had only...

chapter

On the automatic identification of difficult examples for beat tracking: Towards building new evaluation datasets

A. Holzapfel, M. E. P. Davies, J. R. Zapata, J. L. Oliveira, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 89 - 92

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, an approach is presented that identifies music samples which are difficult for current state-of-the-art beat trackers. In order to estimate this difficulty even for examples without ground truth, a method motivated by selective sampling is applied. This method assigns a degree of difficulty to a sample based on the mutual disagreement between the output of various beat tracking systems...

chapter

Supervised and semi-supervised suppression of background music in monaural speech recordings

Felix Weninger, Jordi Feliu, Bjorn Schuller

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 61 - 64

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we propose a semi-supervised algorithm based on sparse non-negative matrix factorization (NMF) to improve separation of speech from background music in monaural signals. In our approach, fixed speech basis vectors are obtained from training data whereas music bases are estimated on-the-fly to cope with spectral variability while preserving small NMF dimensionality for decreased computation...

chapter

Speech/Music Classification Using Empirical Mode Decomposition

A Ghosal, B C Dhara, S K Saha

2011 Second International Conference on Emerging Applications of Information Technology > 49 - 52

Second International Conference on Emerging Applications of Information Technology (EAIT 2011)

Audio classification serves as the fundamental step towards application like content based audio retrieval. In this work, we have tried to exploit the inherent difference in the composition of speech and music signal. A music signal has richer frequency component in comparison to speech signal. Energy distribution of speech and music signal also reflects a pattern that can be used to differentiate...

chapter

Fast multimedia contents retrieval by partially spoken query

So-Young Jeong, Icksang Han, Byung-Kwan Kwak, Jeongmi Cho, more

2011 IEEE International Conference on Consumer Electronics (ICCE) > 839 - 840

2011 IEEE International Conference on Consumer Electronics (ICCE)

We present novel fast multi-pass decoding strategies for recognizing large named-entities on a low-resource embedded device and thus retrieving MP3 music using spoken query, which contains partial segments of whole music titles and artists. After acoustic-phonetic decoding in the first stage processing, we incorporate word boundary information with phonetic confusion matrix into next stage partial...

chapter

On the development of early vocoders

Rüdiger Hoffmann

2010 Second Region 8 IEEE Conference on the History of Communications > 1 - 6

2010 Second IEEE Region 8 Conference on the History of Telecommunications (HISTELCON)

The historic acoustic-phonetic collection (HAPS) of the Dresden University of Technology [47] preserves historic material from more than 100 years of experimental phonetics in Germany and more than 50 years of speech technology in Dresden. The latter begun with the development of a channel vocoder in the 1950-th which was the starting point for continuous investigations in speech analysis and synthesis...

chapter

Semi-blind Speech-Music Separation Using Sparsity and Continuity Priors

H Erdogan, E M Grais

2010 20th International Conference on Pattern Recognition > 4573 - 4576

2010 20th International Conference on Pattern Recognition (ICPR 2010)

In this paper we propose an approach for the problem of single channel source separation of speech and music signals. Our approach is based on representing each source's power spectral density using dictionaries and nonlinearly projecting the mixture signal spectrum onto the combined span of the dictionary entries. We encourage sparsity and continuity of the dictionary coefficients using penalty terms...

chapter

Popular singer identification based on cepstrum transformation

Wei-Ho Tsai, Hao-Ping Lin

2010 IEEE International Conference on Multimedia and Expo > 584 - 589

2010 IEEE International Conference on Multimedia and Expo (ICME)

A prerequisite for identifying the singers in popular music recordings is to reduce the interference of background accompaniment when trying to characterize the singer voice. This study proposes a background music removal approach for singer identification (SID) by exploiting the underlying relationships between solo voices and their accompanied versions in cepstrum. The relationships are characterized...

chapter

Interactive Music Archive Access System

M Gallagher, M Gainza, D Fitzgerald, D Barry, more

2010 IEEE International Conference on Multimedia and Expo > 723 - 724

2010 IEEE International Conference on Multimedia and Expo (ICME)

The goal of the Interactive Music Archive Access System (IMAAS) project was to develop an interactive music archive access system which was capable of allowing an end-user to easily extract rhythmic, melodic and harmonic musical metadata descriptors from audio, and allow the user to interact with the archive contents in a manner not typically allowed in archive access systems. To this end, the IMAAS...

chapter

Direction of arrival estimation of speech signals using ICA and MUSIC methods

Chun-Li Liu, Hsueh-Ming Hang

2010 5th IEEE Conference on Industrial Electronics and Applications > 1768 - 1773

2010 5th IEEE Conference on Industrial Electronics and Applications (ICIEA 2010)

Techniques of using a microphone array to determine a sound source location, the localization problem, has been studied for many years. A popular method is the so-called MUSIC (Multiple Signal Classification). There is a second type of method that tries to solve both sound separation and localization problems in one setting. The second method used for localization purpose is less known. In this study,...

chapter

A higher-order spectro-temporal integration model for predicting signal audibility

Qing Yang, J G Harris

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 193 - 196

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

A higher-order model to determine audibility of audio signals is presented. Previous models have been energy based (second order) and adequate only for stationary, narrow-band signals. Music, speech and other audio signals are nonstationary and wideband so traditional energy models poorly predict the audibility of these sounds. The predictions from the higher-order model are compared to actual subjective...

chapter

Towards effective singing voice extraction from stereophonic recordings

Stratis Sofianos, Aladdin Ariyaeeinia, Richard Polfreman

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 233 - 236

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Extracting a singing voice from its music accompaniment can significantly facilitate certain applications of Music Information Retrieval including singer identification and singing melody extraction. In this paper, we present a hybrid approach for this purpose, which combines properties of the Azimuth Discrimination and Resynthesis (ADRess) method with Independent Component Analysis (ICA). Our proposed...

chapter

Cyclic tempogram—A mid-level tempo representation for musicsignals

Peter Grosche, Meinard Müller, Frank Kurth

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5522 - 5525

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

The extraction of local tempo and beat information from audio recordings constitutes a challenging task, particularly for music that reveals significant tempo variations. Furthermore, the existence of various pulse levels such as measure, tactus, and tatum often makes the determination of absolute tempo problematic. In this paper, we present a robust mid-level representation that encodes local tempo...

chapter

Multimodal similarity between musical streams for cover version detection

Remi Foucard, Jean-Louis Durrieu, Mathieu Lagrange, Gel Richard

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5514 - 5517

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Expressing the similarity between musical streams is a challenging task as it involves the understanding of many factors which are most often blended into one information channel: the audio stream. Consequently, separating the musical audio stream into its main melody and its accompaniment may prove as being useful to root the similarity computation on a more robust and expressive representation....

INFONA - science communication portal

Search results

GUI implementation of real time feedback system to learn singing

Emotion, voices and musical instruments: Repeated exposure to angry vocal sounds makes instrumental sounds angrier

Aural fragment analysis framework pestial on aspect mining

Comparison of fundamental frequency detection methods and introducing simple self-repairing algorithm for musical applications

Speech vs music discrimination using Empirical Mode Decomposition

Melody Extraction for Vocal Polyphonic Music Based on Bayesian Framework

Evaluation of Sinusoidal Modeling for Polyphonic Music Signal

On the automatic identification of difficult examples for beat tracking: Towards building new evaluation datasets

Supervised and semi-supervised suppression of background music in monaural speech recordings

Speech/Music Classification Using Empirical Mode Decomposition

Fast multimedia contents retrieval by partially spoken query

On the development of early vocoders

Semi-blind Speech-Music Separation Using Sparsity and Continuity Priors

Popular singer identification based on cepstrum transformation

Interactive Music Archive Access System

Direction of arrival estimation of speech signals using ICA and MUSIC methods

A higher-order spectro-temporal integration model for predicting signal audibility

Towards effective singing voice extraction from stereophonic recordings

Cyclic tempogram—A mid-level tempo representation for musicsignals

Multimodal similarity between musical streams for cover version detection

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options