Search results for: Zhiyao Duan

Items from 1 to 5 out of 5 results

chapter

IMINET: Convolutional semi-siamese networks for sound search by vocal imitation

Yichi Zhang, Zhiyao Duan

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 304 - 308

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Searching sounds by text labels is often difficult, as text labels cannot always provide sufficient information for the sound content. Previously we proposed an unsupervised system called IMISOUND for sound search by vocal imitation. In this paper, we further propose a Convolutional Semi-Siamese Network (CSN) called IMINET. IMINET uses two towers of Convolutional Neural Networks (CNN) to extract features...

chapter

Visually informed multi-pitch analysis of string ensembles

Karthik Dinesh, Bochen Li, Xinzhao Liu, Zhiyao Duan, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3021 - 3025

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Multi-pitch analysis of polyphonic music requires estimating concurrent pitches (estimation) and organizing them into temporal streams according to their sound sources (streaming). This is challenging for approaches based on audio alone due to the polyphonic nature of the audio signals. Video of the performance, when available, can be useful to alleviate some of the difficulties. In this paper, we...

chapter

Deep ranking: Triplet MatchNet for music metric learning

Rui Lu, Kailun Wu, Zhiyao Duan, Changshui Zhang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 121 - 125

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Metric learning for music is an important problem for many music information retrieval (MIR) applications such as music generation, analysis, retrieval, classification and recommendation. Traditional music metrics are mostly defined on linear transformations of handcrafted audio features, and may be improper in many situations given the large variety of music styles and instrumentations. In this paper,...

chapter

Retrieving sounds by vocal imitation recognition

Yichi Zhang, Zhiyao Duan

2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP)

Vocal imitation is widely used in human communication. In this paper, we propose an approach to automatically recognize the concept of a vocal imitation, and then retrieve sounds of this concept. Because different acoustic aspects (e.g., pitch, loudness, timbre) are emphasized in imitating different sounds, a key challenge in vocal imitation recognition is to extract appropriate features. Hand-crafted...

chapter

Audio tonality mode classification without tonic annotations

Zhiyao Duan, Lie Lu, Changshui Zhang

2008 IEEE International Conference on Multimedia and Expo > 1361 - 1364

2008 IEEE International Conference on Multimedia and Expo (ICME)

Traditional tonality mode (major or minor) classification or audio key finding algorithms often rely on tonic annotations (key names) of the training songs. However, unlike classical music whose keys are usually explicitly labeled in their titles, the keys of numerous popular music are hard to obtain. In contrast, it is much easier to only label the mode for each song. With only modes labeled, traditional...

Filter options

Keywords:
FEATURE EXTRACTION

Publication date

Set your own date range

Keywords

SUPPORT VECTOR MACHINES (3)
TRAINING (3)
CONVOLUTION (2)
MEASUREMENT (2)
METRIC LEARNING (2)
MUSIC (2)
VOCAL IMITATION (2)
ACCURACY (1)
AUDIO CODING (1)
AUDIO KEY FINDING (1)
AUDIO TONALITY MODE CLASSIFICATION (1)
AUDIO-VISUAL ANALYSIS (1)
AUTOMATIC FEATURE LEARNING (1)
CONFERENCES (1)
CONSTRAINED CLUSTERING (1)
CONVOLUTIONAL NEURAL NETWORKS (1)
CONVOLUTIONAL SIAMESE NETWORK (1)
CORRELATION (1)
DEEP LEARNING (1)
ESTIMATION (1)
HIDDEN MARKOV MODELS (1)
INFORMATION RETRIEVAL (1)
INSTRUMENTS (1)
INTEGRATED OPTICS (1)
MODE LEARNING (1)
MULTI-CLASS CLASSIFICATION (1)
MULTI-PITCH ESTIMATION (1)
MULTIPLE PROFILE CORRELATION (1)
MUSIC INFORMATION RETRIEVAL (1)
MUSIC SIMILARITY (1)
OPTICAL IMAGING (1)
POLES AND TOWERS (1)
PRINCIPAL COMPONENT ANALYSIS (1)
SEMANTICS (1)
SILICON CARBIDE (1)
SINGLE PROFILE CORRELATION (1)
SOUND RETRIEVAL (1)
SOURCE SEPARATION (1)
SPECTROGRAM (1)
STACKED AUTO-ENCODER (1)
STREAMING (1)
STREAMING MEDIA (1)
SUPPORT VECTOR MACHINE (1)
SUPPORT VECTOR MACHINE CLASSIFICATION (1)
SVM CLASSIFIER (1)
SYNTHESIZERS (1)
TIMBRE (1)
TONALITY CLASSIFICATION (1)
more

INFONA - science communication portal

Search results for: Zhiyao Duan

IMINET: Convolutional semi-siamese networks for sound search by vocal imitation

Visually informed multi-pitch analysis of string ensembles

Deep ranking: Triplet MatchNet for music metric learning

Retrieving sounds by vocal imitation recognition

Audio tonality mode classification without tonic annotations

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options