Search results

chapter

Improved cepstra minimum-mean-square-error noise reduction algorithm for robust speech recognition

Jinyu Li, Yan Huang, Yifan Gong

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4865 - 4869

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In the era of deep learning, although beam-forming multi-channel signal processing is still very helpful, it was reported that single-channel robust front-ends usually cannot benefit deep learning models because the layer-by-layer structure of deep learning models provides a feature extraction strategy that automatically derives powerful noise-resistant features from primitive raw data for senone...

chapter

On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition

Xiong Xiao, Shengkui Zhao, Douglas L. Jones, Eng Siong Chng, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3246 - 3250

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Acoustic beamforming has played a key role in the robust automatic speech recognition (ASR) applications. Accurate estimates of the speech and noise spatial covariance matrices (SCM) are crucial for successfully applying the minimum variance distortionless response (MVDR) beamforming. Reliable estimation of time-frequency (TF) masks can improve the estimation of the SCMs and significantly improve...

chapter

Channel estimation for crosstalk cancellation in wireless acoustic networks

Gema Pinero, Patrick A. Naylor

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 586 - 590

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we deal with the estimation of the room impulse response (RIR) between each loudspeaker and each microphone of a wireless acoustic network of two nodes when used to implement a crosstalk canceller. The nodes of the network are commercial devices connected via standard wireless links, presenting low computational requirements and non-ideal synchronization between them. Moreover, the nodes...

chapter

Clock drift estimation and compensation for asynchronous impulse response measurements

Hannes Gamper

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA) > 186 - 190

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA)

The impulse response (IR) of an acoustic environment or audio device can be measured by recording its response to a known test signal. Ideally, the same digital clock should be used for playback and recording to ensure synchronous digital-to-analog and analog-to-digital conversion. When measuring the acoustic performance of a hardware device, be it for audio input to a device microphone or audio output...

chapter

Direction of arrival (DOA) estimation of a wideband acoustic source in multipath environment using spatial sparsity

Kandarp K. Patel, Mark L. Fowler

2017 IEEE Sensors Applications Symposium (SAS) > 1 - 5

2017 IEEE Sensors Applications Symposium (SAS)

This paper proposes a novel method for direction-of-arrival (DOA) estimation in azimuth based on spatial sparsity for a wideband acoustic signal using a uniform linear array. The performance of this method is compared with classic subspace based methods such as Root-MUSIC and ESPRIT. In the presence of a multipath reflection, the proposed spatial sparsity based method performs significantly better...

chapter

Microphone array signal processing for robot audition

Heinrich W. Lollmann, AlastairH. Moore, Patrick A. Naylor, Boaz Rafaely, more

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA) > 51 - 55

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA)

Robot audition for humanoid robots interacting naturally with humans in an unconstrained real-world environment is a hitherto unsolved challenge. The recorded microphone signals are usually distorted by background and interfering noise sources (speakers) as well as room reverberation. In addition, the movements of a robot and its actuators cause ego-noise which degrades the recorded signals significantly...

chapter

A Diffusion Strategy for the Multichannel Active Noise Control System in Distributed Network

Ju-Man Song, PooGyeon Park

2016 International Conference on Computational Science and Computational Intelligence (CSCI) > 659 - 664

2016 International Conference on Computational Science and Computational Intelligence (CSCI)

This paper introduces a diffusion strategy for the multichannel active noise control. The diffusion strategy is designed to reduce the computational complexity by distributes computations to all nodes of multichannel active noise control system. Thus, the multichannel filtered-x normalized least mean square algorithm, which is the simplest way for real active noise control environments is used as...

chapter

A landmark-based approach to automatic voice onset time estimation in stop-vowel sequences

Stephan R. Kuberski, Stephen J. Tobin, Adamantios I. Gafos

2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 60 - 64

2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

In the field of phonetics, voice onset time (VOT) is a major parameter of human speech defining linguistic contrasts in voicing. In this article, a landmark-based method of automatic VOT estimation in acoustic signals is presented. The proposed technique is based on a combination of two landmark detection procedures for release burst onset and glottal activity detection. Robust release burst detection...

chapter

Improved prediction of the accent gap between speakers of English for individual-based clustering of World Englishes

Fumiya Shiozawa, Daisuke Saito, Nobuaki Minematsu

2016 IEEE Spoken Language Technology Workshop (SLT) > 129 - 135

2016 IEEE Spoken Language Technology Workshop (SLT)

The term of “World Englishes” describes the current state of English and one of their main characteristics is a large diversity of pronunciation, called accents. In our previous studies, we developed several techniques to realize effective clustering and visualization of the diversity. For this aim, the accent gap between two speakers has to be quantified independently of extra-linguistic factors...

chapter

Acoustic probing to estimate freshness of tomato

Hidetomo Kataoka, Takashi Ijiri, Jeremy White, Akira Hirabayashi

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

The freshness of vegetables attracts significant interest, because consumers will determine the way of cooking based on the maturity of the vegetable or select better vegetables in supermarkets based on the freshness information. This paper focuses on tomatoes, and reports our preliminary studies on acoustic probing techniques to estimate their storage term. We hit an acoustic probe that sweeps audible...

chapter

Voice conversion to emotional speech based on three-layered model in dimensional approach and parameterization of dynamic features in prosody

Yawen Xue, Yasuhiro Hamada, Masato Akagi

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

This paper proposes a system to convert neutral speech to emotional with controlled intensity of emotions. Most of previous researches considering synthesis of emotional voices used statistical or concatenative methods that can synthesize emotions in categorical emotional states such as joy, angry, sad, etc. While humans sometimes enhance or relieve emotional states and intensity during daily life,...

chapter

Performance estimation of spontaneous speech recognition using non-reference acoustic features

Ling Guo, Takeshi Yamada, Shoji Makino

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

To ensure a satisfactory QoE (Quality of Experience), it is essential to establish a method that can be used to efficiently investigate recognition performance for spontaneous speech. By using this method, it is allowed to monitor the recognition performance in providing speech recognition services. It can be also used as a reliability measure in speech dialogue systems. Previously, methods for estimating...

chapter

A new method for the estimation of time difference of arrival for localization of partial discharge sources using acoustic detection technique

R. Ghosh, B. Chatterjee, S. Dalai

2016 IEEE 7th Power India International Conference (PIICON) > 1 - 5

2016 IEEE 7th Power India International Conference (PIICON)

In an acoustic partial discharge (PD) detection system, estimation of time difference of arrival (TDOA) between acoustic signals arriving at a sensor array is an important criterion for accurate localization of PD sources inside a transformer. The localization accuracy can be improved by improving the accuracy of estimation of TDOA between sensors. The estimation of TDOA is a challenging task because...

chapter

Spatial aliasing suppression for pooled angular spectrum using a widely spaced microphone array

Zhi-yong Xu, Zhao Zhao

2016 IEEE 13th International Conference on Signal Processing (ICSP) > 443 - 447

2016 IEEE 13th International Conference on Signal Processing (ICSP)

A time-frequency pooled angular spectrum capable of suppressing spatial aliasing effectively is studied for a widely spaced microphone array to estimate multi-source directions of arrival (DOA) in a reverberant environment based on diffuse noise model and time-frequency sparsity of acoustic signals. By using constant false-alarm rate (CFAR) detection technique, only the high-valued elements very likely...

chapter

Variable step-size matrix for the improved multiband-structured subband adaptive filter

Yan Zhenhai, Yang Feiran, Yang Jun

2016 IEEE 13th International Conference on Signal Processing (ICSP) > 362 - 366

2016 IEEE 13th International Conference on Signal Processing (ICSP)

The improved multiband-structured subband adaptive filter (IMSAF) algorithm could enhance the convergence performance of multiband-structured subband adaptive filter algorithms and affine projection. However, the original IMSAF algorithm with a fixed step-size factor have to compromise between convergence rate and steady-state misalignment. A new IMSAF algorithm with variable step-size matrix (VSM)...

chapter

Non-field-of-view sound source localization using diffraction and reflection signals

Kuya Takami, Hangxin Liu, Tomonari Furukawa, Makoto Kumon, more

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) > 157 - 162

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

This paper describes a non-field-of-view (NFOV) localization approach for a mobile robot in an unknown environment based on an acoustic signal combined with the geometrical information from an optical sensor. The approach estimates the location of a target through the mobile robot's sensor observation frame, which consists of a combination of diffraction and reflection acoustic signals and a 3-D environment...

chapter

Position estimation of sound source on ground by multirotor helicopter with microphone array

Kai Washizaki, Mizuho Wakabayashi, Makoto Kumon

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) > 1980 - 1985

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Multirotor helicopters are expected to be utilized various tasks including rescue missions and surveillance. For those missions, sensors are equipped with helicopters in order to recognize the environment, and auditory information is one of such information that can be utilized to find the target sound source even if it is occluded by objects. One of the difficulty comes from the fact that the noise...

chapter

Reduction of ultrasonic indoor localization infrastructure based on the use of graph information

D. Gualda, J. Urena, J.C. Garcia, J. Alcala, more

2016 International Conference on Indoor Positioning and Indoor Navigation (IPIN) > 1 - 6

2016 International Conference on Indoor Positioning and Indoor Navigation (IPIN)

This paper presents a constrained navigation on a Metric Description Graph (MDG) based on the use of a H-Infinity Filter (H-∞) including the restriction on the graph as a fictitious observation. The main goal is to reduce the number of the required ultrasonic beacons for covering an extensive indoor area. This reduction of the localization infrastructure involves an increment of the error in the estimation...

chapter

A new eigenvector-based 3D wideband acoustic DOA estimator

Ralph E. Hudson, Kung Yao

2016 IEEE International Symposium on Phased Array Systems and Technology (PAST) > 1 - 4

2016 IEEE International Symposium on Phased Array Systems and Technology (PAST)

Many multiple narrow-band source detection and DOA methods have been proposed. In the past, we have used an Approximate Maximum-Likelihood (AML) method needing considerable computational complexity for the detection and 3D DOA estimation of multiple broad-band sources. Now, we propose a novel eigen system-based array detection and DOA estimation of multiple broad-band sources with significantly reduced...

chapter

Lightning imaging with thunder using broadband direction-of-arrival estimation technique

Zhang Han, Zhao Chun, Gu Shanqiang, Feng Wanxing, more

2016 33rd International Conference on Lightning Protection (ICLP) > 1 - 7

2016 33rd International Conference on Lightning Protection (ICLP)

A single-station-based three-dimensional (3D) acoustic lighting mapping system comprising a microphone array has been developed and used for lightning observations, in which a new broadband direction-of-arrival (DOA) estimation techniques namely incoherent signal-subspace method are proposed for thunder signals in the far-field. Two cloud-to-ground (CG) flashes with highly branch channels recorded...

INFONA - science communication portal

Search results

Improved cepstra minimum-mean-square-error noise reduction algorithm for robust speech recognition

On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition

Channel estimation for crosstalk cancellation in wireless acoustic networks

Clock drift estimation and compensation for asynchronous impulse response measurements

Direction of arrival (DOA) estimation of a wideband acoustic source in multipath environment using spatial sparsity

Microphone array signal processing for robot audition

A Diffusion Strategy for the Multichannel Active Noise Control System in Distributed Network

A landmark-based approach to automatic voice onset time estimation in stop-vowel sequences

Improved prediction of the accent gap between speakers of English for individual-based clustering of World Englishes

Acoustic probing to estimate freshness of tomato

Voice conversion to emotional speech based on three-layered model in dimensional approach and parameterization of dynamic features in prosody

Performance estimation of spontaneous speech recognition using non-reference acoustic features

A new method for the estimation of time difference of arrival for localization of partial discharge sources using acoustic detection technique

Spatial aliasing suppression for pooled angular spectrum using a widely spaced microphone array

Variable step-size matrix for the improved multiband-structured subband adaptive filter

Non-field-of-view sound source localization using diffraction and reflection signals

Position estimation of sound source on ground by multirotor helicopter with microphone array

Reduction of ultrasonic indoor localization infrastructure based on the use of graph information

A new eigenvector-based 3D wideband acoustic DOA estimator

Lightning imaging with thunder using broadband direction-of-arrival estimation technique

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options