Search results for: M. Fujimoto

Items from 1 to 6 out of 6 results

chapter

Real-time meeting recognition and understanding using distant microphones and omni-directional camera

T Hori, S Araki, T Yoshioka, M Fujimoto, more

2010 IEEE Spoken Language Technology Workshop > 424 - 429

2010 IEEE Spoken Language Technology Workshop (SLT 2010)

This paper presents our newly developed real-time meeting analyzer for monitoring conversations in an ongoing group meeting. The goal of the system is to automatically recognize “who is speaking what” in an online manner for meeting assistance. Our system continuously captures the utterances and the face pose of each speaker using a distant microphone array and an omni-directional camera at the center...

chapter

Online meeting recognizer with multichannel speaker diarization

S Araki, T Hori, M Fujimoto, S Watanabe, more

2010 Conference Record of the Forty Fourth Asilomar Conference on Signals, Systems and Computers > 1697 - 1701

2010 44th Asilomar Conference on Signals, Systems and Computers

We present our newly developed real-time conversation analyzer for group meetings. The goal of the system is to estimate automatically “who speaks when and what” in an online manner. In our system, “who speaks when” information is first obtained by estimating the directions of arrival (DOAs) of signals. Then, “who speaks what” is estimated with our automatic speech recognition (ASR) system, after...

chapter

A voice activity detection based on the adaptive integration of multiple speech features and a signal decision scheme

M. Fujimoto, K. Ishizuka, T. Nakatani

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4441 - 4444

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

This paper addresses the problem of voice activity detection (VAD) in noisy environments. The VAD method proposed in this paper integrates multiple speech features and a signal decision scheme, namely the speech periodic to aperiodic component ratio and a switching Kalman filter. The integration is carried out by using the weighted sum of likelihoods outputted from each VAD (stream). The stream weight...

chapter

Sequential Non-Stationary Noise Tracking Using Particle Filtering with Switching Dynamical System

M. Fujimoto, S. Nakamura

2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings > 1 > I

2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

This paper addresses a speech recognition problem in non-stationary noise environments: the estimation of noise sequences. To solve this problem, we present a particle filter-based sequential noise estimation method for the front-end processing of speech recognition. In the proposed method, the particle filter is defined by a dynamical system based on Polyak averaging and feedback. We also introduce...

chapter

Hands-free speech recognition and communication on PDAs using microphone array technology

W. Herbordt, T. Horiuchi, M. Fujimoto, T. Jitsuhiro, more

IEEE Workshop on Automatic Speech Recognition and Understanding, 2005. > 302 - 307

2005 IEEE Workshop on Automatic Speech Recognition and Understanding

In this paper, a personal digital assistant (PDA) for hands-free speech recognition and communication with a microphone array mounted on the PDA is presented. An outlier-robust generalized sidelobe canceller (RGSC) and a minimum mean-squared error (MMSE) estimator for log Mel-spectral energy coefficients using a Gaussian mixture model (GMM) for clean speech are implemented in real-time and evaluated...

chapter

Particle filtering and Polyak averaging-based non-stationary noise tracking for ASR in noise

M. Fujimoto, S. Nakamura

IEEE Workshop on Automatic Speech Recognition and Understanding, 2005. > 337 - 342

2005 IEEE Workshop on Automatic Speech Recognition and Understanding

This paper addresses a speech recognition problem in non-stationary noise environments: the estimation of noise sequences. To solve this problem, we present a particle filter-based sequential noise estimation method for front-end processing of speech recognition in noise. In the proposed method, a noise sequence is estimated in three stages: a sequential importance sampling step, a residual resampling...

Filter options

Keywords:
SPEECH RECOGNITION

Publication date

Set your own date range

Content availability

Available (4)
None (2)

Keywords

NOISE (3)
ADAPTATION MODEL (2)
LEAST MEAN SQUARE METHODS (2)
LEAST MEAN SQUARES METHODS (2)
MICROPHONES (2)
PARTICLE FILTERING (2)
PARTICLE FILTERING (NUMERICAL METHODS) (2)
SIGNAL DENOISING (2)
SPEECH (2)
SPEECH ENHANCEMENT (2)
ADAPTIVE INTEGRATION (1)
ARRAY SIGNAL PROCESSING (1)
ASR SYSTEM (1)
AUDIO PROCESSING OPERATION (1)
AUTOMATIC SPEECH RECOGNITION SYSTEM (1)
BACKGROUND NOISE (1)
CAMERAS (1)
CLEAN SPEECH ESTIMATION (1)
DETECTED SPEECH SIGNALS (1)
DIRECTION-OF-ARRIVAL ESTIMATION (1)
DIRECTIONS OF ARRIVAL ESTIMATION (1)
DISTANT MICROPHONES (1)
DOA ESTIMATION (1)
FRONT-END PROCESSING (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN PROCESSES (1)
HANDS-FREE SPEECH RECOGNITION (1)
HIDDEN MARKOV MODELS (1)
IMPORTANCE SAMPLING (1)
INSERTION ERROR REDUCTION (1)
INTERFERENCE SPEAKER VOICE SUPPRESSION (1)
INTERFERENCE SUPPRESSION (1)
KALMAN FILTERS (1)
LOG MEL-SPECTRAL ENERGY (1)
MARKOV CHAIN MONTE CARLO STEP (1)
MARKOV PROCESSES (1)
MEETING ANALYSIS (1)
MEETING ASSISTANCE (1)
MEETING RECOGNITION (1)
MEETING UNDERSTANDING (1)
MICROCOMPUTERS (1)
MICROPHONE ARRAY TECHNOLOGY (1)
MICROPHONE ARRAYS (1)
MINIMUM MEAN-SQUARED ERROR ESTIMATION (1)
MMSE (1)
MOBILE COMMUNICATION (1)
MULTICHANNEL DATABASE (1)
MULTICHANNEL SPEAKER DIARIZATION (1)
MULTIPLE SPEECH FEATURES (1)
NOISE COMPENSATION METHOD (1)
NOISE SEQUENCES (1)
NOISE SEQUENCES ESTIMATION (1)
NONSTATIONARY NOISE TRACKING (1)
NOTEBOOK COMPUTERS (1)
OMNIDIRECTIONAL CAMERA (1)
ONLINE MEETING RECOGNIZER (1)
PDA (1)
PERIODIC TO APERIODIC COMPONENT RATIO (1)
PERSONAL DIGITAL ASSISTANT (1)
POLYAK AVERAGING (1)
REAL TIME SYSTEMS (1)
REAL-TIME CONVERSATION ANALYZER (1)
REALTIME MEETING ANALYZER (1)
RESIDUAL RESAMPLING STEP (1)
REVERBERATION SUPPRESSION (1)
ROBUST GENERALIZED SIDELOBE CANCELLER (1)
SEQUENTIAL IMPORTANCE SAMPLING STEP (1)
SEQUENTIAL NOISE ESTIMATION (1)
SEQUENTIAL NONSTATIONARY NOISE TRACKING (1)
SIGNAL DECISION SCHEME (1)
SIGNAL DETECTION (1)
SINGLE-CHANNEL NOISE SUPPRESSION (1)
SPEAKER DIARIZATION (1)
SPEAKER RECOGNITION (1)
SPEECH PROCESSING (1)
SPEECH SIGNAL ENHANCEMENT (1)
STATE TRANSITION CHARACTERISTICS (1)
SWITCHING DYNAMICAL SYSTEM (1)
SWITCHING KALMAN FILTER (1)
TOPIC TRACKING (1)
VOICE ACTIVITY DETECTION (1)
more

INFONA - science communication portal

Search results for: M. Fujimoto

Real-time meeting recognition and understanding using distant microphones and omni-directional camera

Online meeting recognizer with multichannel speaker diarization

A voice activity detection based on the adaptive integration of multiple speech features and a signal decision scheme

Sequential Non-Stationary Noise Tracking Using Particle Filtering with Switching Dynamical System

Hands-free speech recognition and communication on PDAs using microphone array technology

Particle filtering and Polyak averaging-based non-stationary noise tracking for ASR in noise

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options