Yuma Koizumi

chapter

Optimizing acoustic feature extractor for anomalous sound detection based on Neyman-Pearson lemma

Yuma Koizumi, Shoichiro Saito, Hisashi Uematsu, Noboru Harada

2017 25th European Signal Processing Conference (EUSIPCO) > 698 - 702

2017 25th European Signal Processing Conference (EUSIPCO)

We propose a method for optimizing an acoustic feature extractor for anomalous sound detection (ASD). Most ASD systems adopt outlier-detection techniques because it is difficult to collect a massive amount of anomalous sound data. To improve the performance of such outlier-detection-based ASD, it is essential to extract a set of efficient acoustic features that is suitable for identifying anomalous...

article

Informative Acoustic Feature Selection to Maximize Mutual Information for Collecting Target Sources

Yuma Koizumi, Kenta Niwa, Yusuke Hioka, Kazunori Kobayashi, more

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2017 > 25 > 4 > 768 - 779

An informative acoustic-feature-selection method for collecting target sources in noisy environments is proposed. Wiener filtering is a powerful framework for sound-source enhancement. For Wiener-filter estimation, statistical-mapping functions, such as deep neural network based or Gaussian mixture model based mappings, have been used. In this framework, it is essential to find informative acoustic...

chapter

DNN-based source enhancement self-optimized by reinforcement learning using sound quality measurements

Yuma Koizumi, Kenta Niwa, Yusuke Hioka, Kazunori Kobayashi, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 81 - 85

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We investigated whether a deep neural network (DNN)-based source enhancement function can be self-optimized by reinforcement learning (RL). The use of a DNN is a powerful approach to describing the relationship between two sets of variables and can be useful for source enhancement function design. By training the DNN using a huge amount of training data, sound quality of output signals are improved...

chapter

Supervised source enhancement composed of nonnegative auto-encoders and complementarity subtraction

Kenta Niwa, Yuma Koizumi, Tomoko Kawase, Kazunori Kobayashi, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 266 - 270

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

A method for constructing deep neural networks (DNNs) for accurate supervised source enhancement is proposed. Attempts were made in previous studies to estimate the power spectral densities (PSDs) of sound sources, which are used to estimate Wiener filters for source enhancement, from the output of multiple beamformings using DNNs. Although performance improved, it was not possible to guarantee accurate...

chapter

On relationships between amplitude and phase of short-time Fourier transform

Suehiro Shimauchi, Shinya Kudo, Yuma Koizumi, Ken'ichi Furuya

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 676 - 680

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The relationships between the amplitude and phase of the short-time Fourier transform (STFT) are investigated. By choosing the Gaussian window for the STFT, we reveal that the group delay and instantaneous frequency of each signal segment, both of which are derived from the phase by definition, can also be explicitly linked with the amplitude. As a result, the amplitude and phase can also be linked...

chapter

Pinpoint extraction of distant sound source based on DNN mapping from multiple beamforming outputs to prior SNR

Kenta Niwa, Yuma Koizumi, Tomoko Kawase, Kazunori Kobayashi, more

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 435 - 439

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a method for estimating the prior signal-to-noise ratio (SNR), which is used for calculating the Wiener filter for distant sound source extraction, from output signals of beamforming using statistical mapping based on the deep neural network (DNN). Since informative features to estimate the prior SNR are included in multiple beamforming outputs, the SNR can be accurately estimated by this...

chapter

Integrated approach of feature extraction and sound source enhancement based on maximization of mutual information

Yuma Koizumi, Kenta Niwa, Yusuke Hioka, Kazunori Kobayashi, more

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 186 - 190

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We investigated informative acoustic feature extraction based on dimension reduction for collecting target sources on a noisy sports field. Although a Wiener filter is often used for sound source enhancement, it is difficult to accurately design the Wiener filter by simply using spatial cues because the noise on a sports field (e.g., cheering from spectators) arrives from the same direction as that...

chapter

Informative acoustic feature selection on microphone array wiener filtering for collecting target source on sports ground

Yuma Koizumi, Kenta Niwa, Yusuke Hioka, Kazunori Kobayashi, more

2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 1 - 5

2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

We propose a Wiener filter design method for collecting target sources on a noisy sports field. Because the noise on a sports field, e.g., cheering from the audience, arrives from the same direction as that of the targeted source, it is difficult to accurately design a Wiener filter by simply using spatial cues. This study focused on a combination of spatial cues and acoustic feature modeling. The...

chapter

Effective approach to character input for novice BCI users

Yuma Koizumi, Yuki Ijichi, Hisaya Tanaka, Ayumi Otera, more

2015 10th Asia-Pacific Symposium on Information and Telecommunication Technologies (APSITT) > 1 - 3

2015 10th Asia-Pacific Symposium on Information and Telecommunication Technologies (APSITT)

A brain-computer interface (BCI) character input experiment focusing on participants’ BCI intelligibility was performed. In theory, a BCI can be operated by anyone if cognitive activity is possible. However, individual differences clearly occur in practice. Therefore, we supposed that this difference was related to BCI intelligibility. In a previous study, BCI experts and BCI novice users were compared...

chapter

Intra-note segmentation via sticky HMM with DP emission

Yuma Koizumi, Katunobu Itou

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2144 - 2148

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents an intra-note segmentation method for mono-phonic recordings based on acoustic feature variation; each musical note is separated into onset, steady and offset states. The task of intra-note segmentation from audio signals is detecting change points of acoustic feature. In proposed method, the Markov process is assumed on state transition, and time-varying acoustic feature is represented...

INFONA - science communication portal

Search results for: Yuma Koizumi

Optimizing acoustic feature extractor for anomalous sound detection based on Neyman-Pearson lemma

Informative Acoustic Feature Selection to Maximize Mutual Information for Collecting Target Sources

DNN-based source enhancement self-optimized by reinforcement learning using sound quality measurements

Supervised source enhancement composed of nonnegative auto-encoders and complementarity subtraction

On relationships between amplitude and phase of short-time Fourier transform

Pinpoint extraction of distant sound source based on DNN mapping from multiple beamforming outputs to prior SNR

Integrated approach of feature extraction and sound source enhancement based on maximization of mutual information

Informative acoustic feature selection on microphone array wiener filtering for collecting target source on sports ground

Effective approach to character input for novice BCI users

Intra-note segmentation via sticky HMM with DP emission

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results for: Yuma Koizumi

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options