Szukanie zaawansowane

Szukanie zaawansowane w ludziach

Od:

Do:

Pozycje od 1 do 12 spośród 12 wyników

rozdział

A robust begin-end point detector for highly noisy conditions

R. Martinez, A. Alvarez, P. Gomez, V. Nieto, więcej

9th European Signal Processing Conference (EUSIPCO 1998) > 1 - 4

9th European Signal Processing Conference (EUSIPCO 1998)

Most recognition methods, which have shown to be highly efficient under noise-free conditions fail dramatically with S/N ratios around or below 10 dB. One of the consequences of these high noise levels is that most Begin-End Point Detectors fail to separate properly the speech segments of the noise ones. Therefore, the speech recognition mechanisms will not have a clear boundary to start the processing...

rozdział

Spatial diffuseness features for DNN-based speech recognition in noisy and reverberant environments

Andreas Schwarz, Christian Huemmer, Roland Maas, Walter Kellermann

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4380 - 4384

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a spatial diffuseness feature for deep neural network (DNN)-based automatic speech recognition to improve recognition accuracy in reverberant and noisy environments. The feature is computed in real-time from multiple microphone signals without requiring knowledge or estimation of the direction of arrival, and represents the relative amount of diffuse noise in each time and frequency bin...

rozdział

Feature enhancement for robust speech recognition on smartphones with dual-microphone

Ivan Lopez-Espejo, Angel M. Gomez, Jose A. Gonzalez, Antonio M. Peinado

2014 22nd European Signal Processing Conference (EUSIPCO) > 21 - 25

2014 22nd European Signal Processing Conference (EUSIPCO)

Latest smartphones often have more than one microphone in order to perform noise reduction. Although research on speech enhancement is already exploiting this new feature, robust speech recognition is not still benefiting from it. In this paper we propose two feature enhancement methods especially developed for the case of a smartphone with a dual-microphone operating in an adverse acoustic environment...

rozdział

Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation

Chen-Yu Yang, Georgina Brown, Liang Lu, Junichi Yamagishi, więcej

2012 8th International Symposium on Chinese Spoken Language Processing > 220 - 223

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

In this paper, we introduce a newly-created corpus of whispered speech simultaneously recorded via a close-talking microphone and a non-audible murmur (NAM) microphone in both clean and noisy conditions. To benchmark the corpus, which has been freely released recently, experiments on automatic recognition of continuous whispered speech were conducted. When training and test conditions are matched,...

rozdział

Assessment of general applicability of ego noise estimation

Gokhan Ince, Keisuke Nakamura, Futoshi Asano, Hirofumi Nakajima, więcej

2011 IEEE International Conference on Robotics and Automation > 3517 - 3522

2011 IEEE International Conference on Robotics and Automation (ICRA)

Noise generated due to the motion of a robot deteriorates the quality of the desired sounds recorded by robot-embedded microphones. On top of that, a moving robot is also vulnerable to its loud fan noise that changes its orientation relative to the moving limbs where the microphones are mounted on. To tackle the non-stationary ego-motion noise and the direction changes of fan noise, we propose an...

rozdział

Spoken command of large mobile robots in outdoor environments

E Chuangsuwanich, S Cyphers, J Glass, S Teller

2010 IEEE Spoken Language Technology Workshop > 306 - 311

2010 IEEE Spoken Language Technology Workshop (SLT 2010)

We describe a speech system for commanding robots in human-occupied outdoor military supply depots. To operate in such environments, the robots must be as easy to interact with as are humans, i.e. they must reliably understand ordinary spoken instructions, such as orders to move supplies, as well as commands and warnings, spoken or shouted from distances of tens of meters. These design goals preclude...

rozdział

DSP integration of sound source localization and multi-channel Wiener filter

Byoung-gi Lee, Hyun-dong Kim, Jong-suk Choi, Seyun Kim, więcej

2010 IEEE International Conference on Robotics and Automation > 4830 - 4835

2010 IEEE International Conference on Robotics and Automation (ICRA 2010)

This paper describes a DSP integration of sound source localization (SSL) and multi-channel Wiener filter (MWF). To develop a robot audition system, we integrated SSL module and MWF module into a DSP system. SSL is a module to perceive the direction of a human user's call. It measures time delay of arrival among microphones and estimates the direction of sound source. Also, it post-processes the resulted...

rozdział

Recent trends and challenges in speech-separation systems research — A tutorial review

K.S. Ananthakrishnan, K. Dogancay

TENCON 2009 - 2009 IEEE Region 10 Conference > 1 - 6

TENCON 2009. 2009 IEEE Region 10 Conference

The pioneering work on the `separation of speech from mixture of acoustic sources' dates back to as early as 70s and since then, two main approaches namely traditional approach using signal-processing techniques and computational auditory scene analysis (CASA) approach using auditory-modeling methods have been concurrently attempted by researchers to find solution to the problem of what is known as...

rozdział

COSINE - A corpus of multi-party COnversational Speech In Noisy Environments

A. Stupakov, E. Hanusa, J. Bilmes, D. Fox

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4153 - 4156

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

We present an overview of the data collection and transcription efforts for the COnversational Speech In Noisy Environments (COSINE) corpus. The corpus is a set of multi-party conversations recorded in real world environments with background noise that can be used to train noise-robust speech recognition systems. We explain the motivation for creating such a corpus and describe the resulting audio...

rozdział

Acoustic head orientation estimation applied to powered wheelchair control

A. Sasou

2009 Second International Conference on Robot Communication and Coordination > 1 - 6

2009 Second International Conference on Robot Communication and Coordination. RoboComm 2009

In this paper, we propose an acoustic-based head orientation estimation method using a microphone array mounted on a wheelchair, and apply it to a novel interface for controlling a powered wheelchair. The proposed interface does not require disabled people to wear any microphones or utter recognizable voice commands. By mounting the microphone array system on the wheelchair, our system can easily...

rozdział

From science fiction to science fact: A Smart-House interface using speech technology and a photo-realistic avatar

T.J. Moir, G.L. Filho

2008 15th International Conference on Mechatronics and Machine Vision in Practice > 327 - 333

2008 15th International Conference on Mechatronics and Machine Vision in Practice

This paper explores the problems of speech recognition in a (sometimes) noisy environment. An adaptive acoustic beamformer is proposed based on the Griffiths-Jim method and a "hot-spot" where speech can be received within a geometric defined boundary and rejected outside of it will be shown to give a certain amount of noise immunity and improve the signal-to-noise ratio for the second stage,...

rozdział

Target speech detection and separation for humanoid robots in sparse dialogue with noisy home environments

Hyun-Don Kim, Jinsung Kim, K. Komatani, T. Ogata, więcej

2008 IEEE/RSJ International Conference on Intelligent Robots and Systems > 1705 - 1711

2008 IEEE/RSJ International Conference on Intelligent Robots and Systems

In normal human communication, people face the speaker when listening and usually pay attention to the speakerpsila face. Therefore, in robot audition, the recognition of the front talker is critical for smooth interactions. This paper presents an enhanced speech detection method for a humanoid robot that can separate and recognize speech signals originating from the front even in noisy home environments...

Opcje filtrowania

Słowa kluczowe:
NOISE
NOISE MEASUREMENT
MICROPHONES
SPEECH RECOGNITION
Typ publikacji:
książka

Data publikacji

Ustaw własny zakres dat

Dostępność treści

Dostępna (11)
Brak (1)

Słowa kluczowe

SPEECH (11)
ROBOTS (4)
ACOUSTICS (3)
ARRAYS (3)
HUMAN-ROBOT INTERACTION (2)
MICROPHONE ARRAYS (2)
SOUND SOURCE LOCALIZATION (2)
VOICE ACTIVITY DETECTION (2)
ACCURACY (1)
ACOUSTIC HEAD ORIENTATION ESTIMATION (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTIC SOURCES (1)
ADAPTIVE ACOUSTIC BEAMFORMER (1)
ADAPTIVE FILTERS (1)
ADAPTIVE-FILTERING (1)
AUDITORY-MODEL (1)
AUDITORY-MODELING METHODS (1)
AURORA2-2C (1)
AVATARS (1)
BOOLEAN FORM (1)
CASA (1)
CLOSE TALKING MICROPHONES (1)
CO-CHANNEL (1)
COCKTAIL PARTY EFFECT (1)
COCKTAIL-PARTY EFFFECT (1)
COMPLEX SPECTRUM CIRCLE CENTROID METHOD (1)
COMPUTATIONAL AUDITORY SCENE ANALYSIS (1)
DEEP NEURAL NETWORKS (1)
DIFFUSE NOISE (1)
DIGITAL SIGNAL PROCESSING (1)
DIGITAL SIGNAL PROCESSING CHIPS (1)
DISTANCE MEASUREMENT (1)
DSP INTEGRATION (1)
DUAL-MICROPHONE (1)
EQUATIONS (1)
ESTIMATION (1)
FEATURE ENHANCEMENT (1)
FEATURE EXTRACTION (1)
FRONT TALKER RECOGNITION (1)
GEOMETRIC DEFINED BOUNDARY (1)
GRIFFITHS-JIM METHOD (1)
HANDICAPPED AIDS (1)
HEAD AVATAR (1)
HEAD ORIENTATION (1)
HEARING (1)
HIDDEN MARKOV MODELS (1)
HOME COMPUTING (1)
HOT-SPOT (1)
HUMAN AUDITORY SYSTEM (1)
HUMAN OCCUPIED OUTDOOR MILITARY SUPPLY DEPOTS (1)
HUMANOID ROBOTS (1)
HUMANS (1)
LARGE MOBILE ROBOTS (1)
MASSEY SMART-OFFICE (1)
MAXIMUM SIGNAL-TO-NOISE BEAMFORMER (1)
MICROPHONE ARRAY (1)
MOBILE ROBOTS (1)
MODULATION FREQUENCY (1)
MULTI-PARTY CONVERSATIONAL SPEECH (1)
MULTICHANNEL WIENER FILTER (1)
MULTIPARTY (1)
NOISE IMMUNITY (1)
NOISE ROBUST (1)
NOISE ROBUSTNESS (1)
NOISE-ROBUST SPEECH RECOGNITION SYSTEMS (1)
NOISY ENVIRONMENTS (1)
NOISY HOME ENVIRONMENTS (1)
NOISY SPEECH CORPORA (1)
NON-AUDIBLE MURMUR (NAM) (1)
NOVICE RESEARCHERS (1)
PHOTO-REALISTIC AVATAR (1)
POWERED WHEELCHAIR CONTROL (1)
PUSH-TO-TALK BUTTONS (1)
REAL-TIME SPEECH RECOGNITION (1)
REVERBERATION (1)
ROBOT AUDITION (1)
ROBOT AUDITION SYSTEM (1)
ROBOT SPEECH RECOGNITION (1)
ROBUST SPEECH RECOGNITION (1)
SHOULDERS AVATAR (1)
SIGNAL CLASSIFICATION (1)
SIGNAL TO NOISE RATIO (1)
SIGNAL-PROCESSING TECHNIQUES (1)
SIGNAL-TO-NOISE RATIO (1)
SILENT SPEECH INTERFACE (SSI) (1)
SILICON COMPOUNDS (1)
SMART PHONES (1)
SMART-HOUSE INTERFACE (1)
SMARTPHONE (1)
SPARSE DIALOGUE (1)
SPEAKER IDENTIFICATION TECHNIQUE (1)
SPEECH DETECTION (1)
SPEECH ENHANCEMENT (1)
SPEECH PROCESSING (1)
SPEECH RECOGNITION ENGINE (1)
SPEECH RESEARCH COMMUNITY (1)
więcej

INFONA - portal komunikacji naukowej

Szukanie zaawansowane