Search results

Items from 1 to 20 out of 24 results

chapter

A Research on Mongolian Standard Speech Testing System Based on Comparisons of Language Features

Mongh Jaya

2010 International Conference on E-Product E-Service and E-Entertainment > 1 - 3

2010 International Conference on E-Product E-Service and E-Entertainment (ICEEE 2010)

After subjects take part in the Mongolian Standard Speech Test, they have some divergent opinions about their tested results. In order to eliminate these divergences of the test results, a new window is opened to employ a kind of software to assist the testing of the Mongolian Standard Speech based on the comparisons of language features. Specifically speaking, after the subject's speech data are...

chapter

Translating the sign of dumb person using ARM processor

C Nijusekar, A Brindhu Kumari

2010 INTERNATIONAL CONFERENCE ON COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES > 508 - 513

2010 International Conference on Communication Control and Computing Technologies

The goal of the blabbering voice-to-Speech Translation research is to enable real-time, interpersonal communication via natural spoken language for people who do not share a common language. The Multilingual Automatic blabbering voice-to-Speech Translator (MASTOR) system is the first Speech-to-Speech system that allows for bidirectional (blabbering voice Tamil) free-form speech input and output. The...

chapter

An adaptive noise cancellation scheme using particle swarm optimization algorithm

Upal Mahbub, Celia Shahnaz, Shaikh Anowarul Fattah

2010 INTERNATIONAL CONFERENCE ON COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES > 683 - 686

2010 International Conference on Communication Control and Computing Technologies

This paper deals with the problem of noise cancellation of speech signals in an acoustic environment. In this regard, generally, different adaptive filter algorithms are employed, many of them may lack the flexibility of controlling the convergence rate, range of variation of filter coefficients, and consistency in error within tolerance limit. In order to achieve these desirable attributes as well...

chapter

Isolated Malayalam digit recogntion using Support Vector Machines

Cini Kurian, A Firoz Shah, Kannan Balakrishnan

2010 INTERNATIONAL CONFERENCE ON COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES > 692 - 695

2010 International Conference on Communication Control and Computing Technologies

Voice is the natural communication system used by all beings, human beings in particular. Understanding and recognizing human uttered voice for various applications is the core technology of "information" age. Automatic speech recognition has wide spread applications in real life situations. Here speech recognition of Malayalam isolated digit is created by using Mel Frequency Cepstral Coefficients...

chapter

Realtime speech processing for automated cue-generation

Ibrahim Patel, Y Srinivas Rao

2010 INTERNATIONAL CONFERENCE ON COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES > 592 - 596

2010 International Conference on Communication Control and Computing Technologies

Here have been great efforts made in the development of automated Instrumentation system for speech recognition (AISR) to provide a two-way communication between deaf and vocal people. This system performance achievable with the output of current real-time speech recognition systems would be extremely poor relative to normal speech reception. An alternate application of AISR technology to aid the...

chapter

Real-Time Blind Source Separation of Speech Signals with Adaptive Particle Swarm Optimization

Taohua Luo

2010 International Symposium on Intelligence Information Processing and Trusted Computing > 246 - 249

2010 International Symposium on Intelligence Information Processing and Trusted Computing (IPTC 2010)

To solve the problems of slow convergence and low computational precision of blind source separation(BSS) based on traditional particle swarm optimization(PSO), a novel approach-based adaptive particle swarm optimization for real-time blind source separation is proposed, in which the observations are linear convolutive mixtures of statistically independent speech sources. It combines the independent...

chapter

Research on key parameters of speech denoising algorithm based on wavelet packet transform

Ligang Du, Ru Xu, Fang Xu, Deqing Wang, more

2010 3rd International Conference on Computer Science and Information Technology > 6 > 551 - 556

2010 3rd IEEE International Conference on Computer Science and Information Technology (ICCSIT 2010)

Wavelet packet transform is an efficient method in speech denoising processing. In this paper, we research on various wavelet packet basis, decomposition layers, values of the threshold and threshold functions which are key parameters in wavelet packet denoising. Furthermore, we adopt three methods to evaluate the effects of denoised speech, including signal-noise-ratio(SNR), wavelet spectrum distortion...

chapter

A voice activity detection system based on FPGA

Junhee Jung, Seunghun Jin, Dongkyun Kim, Hyung Soon Kim, more

ICCAS 2010 > 2304 - 2308

2010 International Conference on Control, Automation and Systems (ICCAS 2010)

In this paper, we present a FPGA-based voice activity detection system. DoV (Degree of Voicing) and QSNR (Quantile Signal-to-Noise Ratio) are used as parameters of the VAD algorithm of the proposed system. All VAD system functions are implemented using a dedicated parallel architecture, including signal capturing, DoV processing module and QSNR processing module. The system uses several DPRAMs (Dual...

chapter

Speaker Identification Based on Robust AM-FM Features

M.S. Deshpande, R.S. Holambe

2009 Second International Conference on Emerging Trends in Engineering&Technology > 880 - 884

2009 2nd International Conference on Emerging Trends in Engineering and Technology (ICETET 2009)

Linear source-filter models have been widely used by researchers as a front-end for speaker identification systems. It uses the cepstral features derived from the power spectrum of the speech signal. But it is also well known that a significant part of the acoustic information cannot be modeled by the linear source-filter model, and thus, the need for nonlinear features becomes apparent. In this paper,...

chapter

A support vector machine classifier of emotion from voice and facial expression data

S. Das, A. Halder, P. Bhowmik, A. Chakraborty, more

2009 World Congress on Nature&Biologically Inspired Computing (NaBIC) > 1010 - 1015

2009 World Congress on Nature & Biologically Inspired Computing (NaBIC 2009)

The paper provides a novel approach to emotion recognition from facial expression and voice of subjects. The subjects are asked to manifest their emotional exposure in both facial expression and voice, while uttering a given sentence. Facial features including mouth-opening, eye-opening, eyebrow-constriction, and voice features including, first three formants: F₁, F₂, and F₃, and respective powers...

chapter

Design and development of a frame based MT system for English-to-ISL

K. Anuja, S. Suryapriya, S.M. Idicula

2009 World Congress on Nature&Biologically Inspired Computing (NaBIC) > 1382 - 1387

2009 World Congress on Nature & Biologically Inspired Computing (NaBIC 2009)

This paper presents the design and development of a frame based approach for speech to sign language machine translation system in the domain of railways and banking. This work aims to utilize the capability of Artificial intelligence for the improvement of physically challenged, deaf-mute people. Our work concentrates on the sign language used by the deaf community of Indian subcontinent which is...

chapter

Time-frequency feature extraction from spectrograms and wavelet packets with application to automatic stress and emotion classification in speech

Ling He, M. Lech, N.C. Maddage, N.B. Allen

2009 7th International Conference on Information, Communications and Signal Processing (ICICS) > 1 - 5

2009 7th International Conference on Information, Communications & Signal Processing (ICICS)

Three new methods of feature extraction based on time-frequency analysis of speech are presented and compared. In the first approach, speech spectrograms were passed through a bank of 12 log-Gabor filters and the outputs are averaged. In the second approach, the spectrograms were sub-divided into ERB frequency bands and the average energy for each band is calculated. In the third approach, wavelet...

chapter

A non-uniform subband approach to speech-based cognitive load classification

Phu Ngoc Le, E. Ambikairajah, E.H.C. Choi, J. Epps

2009 7th International Conference on Information, Communications and Signal Processing (ICICS) > 1 - 5

2009 7th International Conference on Information, Communications & Signal Processing (ICICS)

Speech has recently been recognized as an attractive method for the measurement of cognitive load. Current speech-based cognitive load measurement systems utilize acoustic features derived from auditory-motivated frequency scales. This paper aims to investigate the distribution of speech information specific to cognitive load discrimination as a function of frequency. We found that this distribution...

chapter

An instantaneous amplitude model based speech coder

Cong Yu, Gang Li, Chaogeng Huang

2009 7th International Conference on Information, Communications and Signal Processing (ICICS) > 1 - 5

2009 7th International Conference on Information, Communications & Signal Processing (ICICS)

In this paper, a new algorithm for speech coding is proposed. This algorithm is based a revised sinusoidal model, in which each component is represented with two instantaneous amplitudes and a frequency. This model avoids the difficulty in estimating the highly nonlinear phases and allows one to optimize the amplitudes once the frequencies are estimated. Simulations indicate that the proposed model...

chapter

Estimation of instants of significant excitation from speech signal using temporal phase periodicity

N. Sripriya, P. Vijayalakshmi, C.A. Kumar, T. Nagarajan

TENCON 2009 - 2009 IEEE Region 10 Conference > 1 - 4

TENCON 2009. 2009 IEEE Region 10 Conference

Voiced speech is produced by excitation of the vocal tract system with the quasiperiodic vibrations of the vocal folds at the glottis. These excitations have become significantly stronger when the vocal folds are fully opened or about to be closed. In this work, the focus is on estimating these instants of significant excitation using temporal phase periodicity present in the speech signal. Assuming...

chapter

Selective pole modification-based technique for the analysis and detection of hypernasality

P. Vijayalakshmi, T. Nagarajan, V. Jayanthan Ra

TENCON 2009 - 2009 IEEE Region 10 Conference > 1 - 5

TENCON 2009. 2009 IEEE Region 10 Conference

Inadequate velopharyngeal closure, due to structural or neurological problems, allows air to pass through the nasal cavity leading to introduction of inappropriate nasal resonances during speech production resulting in hypernasal speech. Our previous work on the acoustic analysis of hypernasal speech using group delay function for the detection of hypernasality showed stable effects of vowel nasalization...

chapter

Recent trends and challenges in speech-separation systems research — A tutorial review

K.S. Ananthakrishnan, K. Dogancay

TENCON 2009 - 2009 IEEE Region 10 Conference > 1 - 6

TENCON 2009. 2009 IEEE Region 10 Conference

The pioneering work on the `separation of speech from mixture of acoustic sources' dates back to as early as 70s and since then, two main approaches namely traditional approach using signal-processing techniques and computational auditory scene analysis (CASA) approach using auditory-modeling methods have been concurrently attempted by researchers to find solution to the problem of what is known as...

chapter

Development of Chinese whispered database for speaker verification

Chenghui Gong, Heming Zhao, Yanlei Wang, Wang Min, more

2009 Asia Pacific Conference on Postgraduate Research in Microelectronics&Electronics (PrimeAsia) > 197 - 200

2009 Asia Pacific Conference on Postgraduate Research in Microelectronics & Electronics (PrimeAsia)

A database for speaker verification of Chinese whispered speech is established. It is based on the assumption that whispers are easily affected by the environmental and speakers' emotional factors. The manuscript for the corpus considers the structure of Chinese syllables, including all the categories of the initials, finals and tones. 8 typical channels are applied to collect the speech, mainly stated...

chapter

On the use of stress information in speech for speaker recognition

M.L. Narayana, S.K. Kopparapu

TENCON 2009 - 2009 IEEE Region 10 Conference > 1 - 4

TENCON 2009. 2009 IEEE Region 10 Conference

The performance of a speaker recognition system decreases when the speaker is under stress or emotion. In this paper we explore and identify a mechanism that enables use of inherent stress-in-speech or speaking style information present in speech of a person as additional cues for speaker recognition. We quantify the the inherent stress present in the speech of a speaker mainly using 3 features, namely,...

chapter

On the use of cepstral coefficients and Multilayer Perceptron Networks for vocal fold edema diagnosis

J.V.M.L. Marinus, J.M. Fechine, H.M. Gomes, S.C. Costa

2009 9th International Conference on Information Technology and Applications in Biomedicine > 1 - 4

2009 9th International Conference on Information Technology and Applications in Biomedicine (ITAB 2009)

Laryngeal diseases affect many professionals who use their voices as the main working tool, such as teachers, singers, radio and TV presenters, among others. Advanced diagnosis techniques of these diseases are typically invasive, causing much discomfort to the patient. In recent years techniques of digital voice processing have been investigated to obtain non-invasive systems to aid the diagnosis...

Content availability:
None
Keywords:
SPEECH

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (11)
FEATURE EXTRACTION (8)
ACOUSTICS (4)
NATURAL LANGUAGE PROCESSING (4)
NOISE (4)
SPEAKER RECOGNITION (4)
ACOUSTIC SIGNAL PROCESSING (3)
ARTIFICIAL NEURAL NETWORKS (3)
BANDWIDTH (3)
EMOTION RECOGNITION (3)
HIDDEN MARKOV MODELS (3)
MEL FREQUENCY CEPSTRAL COEFFICIENT (3)
PATTERN CLASSIFICATION (3)
TRAINING (3)
ACCURACY (2)
ADAPTIVE SIGNAL PROCESSING (2)
ALGORITHM DESIGN AND ANALYSIS (2)
ANIMATION (2)
CEPSTRAL ANALYSIS (2)
DATA MINING (2)
DATABASES (2)
DISCRETE FOURIER TRANSFORMS (2)
FACE RECOGNITION (2)
FILTER BANK (2)
FILTERING THEORY (2)
FREQUENCY ESTIMATION (2)
GABOR FILTERS (2)
GAUSSIAN PROCESSES (2)
HANDICAPPED AIDS (2)
LANGUAGE TRANSLATION (2)
MEDICAL SIGNAL PROCESSING (2)
PARTICLE SWARM OPTIMISATION (2)
QUANTIZATION (2)
SIGNAL DENOISING (2)
SIGNAL TO NOISE RATIO (2)
SOFTWARE (2)
SPEECH CODING (2)
SPEECH SIGNAL (2)
SPEECH SIGNALS (2)
SPEECH SYNTHESIS (2)
STRESS (2)
SUPPORT VECTOR MACHINE (2)
SUPPORT VECTOR MACHINES (2)
TESTING (2)
3D ANIMATION (1)
3D VIRTUAL HUMAN (1)
ACOUSTIC CI SIMULATIONS (1)
ACOUSTIC FEATURE EXTRACTION (1)
ACOUSTIC FEATURES (1)
ACOUSTIC FILTERS (1)
ACOUSTIC SOURCES (1)
ACOUSTIC SPEECH SIGNAL (1)
ADAPTIVE (1)
ADAPTIVE FILTER (1)
ADAPTIVE FILTER ALGORITHMS (1)
ADAPTIVE FILTERS (1)
ADAPTIVE NOISE CANCELLATION SCHEME (1)
ADAPTIVE PARTICLE SWARM OPTIMIZATION (1)
ADAPTIVE-FILTERING (1)
AISR (1)
AMPLITUDE (1)
AMPLITUDE MODULATION (1)
ANALYSIS WINDOW SAMPLE (1)
ARM PROCESSOR (1)
ARTICULATORY FEATURES (1)
ARTICULATORY TRAJECTORIES (1)
ARTIFICIAL INTELLIGENCE (1)
AUDIO DATABASE (1)
AUDIO DATABASES (1)
AUDITORY MODEL BASED CI SYSTEM (1)
AUDITORY SYSTEM (1)
AUDITORY-MODEL (1)
AUDITORY-MODELING METHODS (1)
AUTOMATED CUE GENERATION (1)
AUTOMATED INSTRUMENTATION SYSTEM (1)
AUTOMATIC SPEECH RECOGNITION (1)
AVERAGE MAGNITUDE DIFFERENCE FUNCTION (1)
AVERAGE MAGNITUDE DIFFERENT FUNCTION-BASED PITCH FEATURE EXTRACTOR (1)
AWARDS ACTIVITIES (1)
BAND PASS FILTERS (1)
BIOCOMMUNICATIONS (1)
BIT RATE (1)
BLIND SOURCE SEPARATION (1)
CASA (1)
CEPSTRAL COEFFICIENT (1)
CEPSTRAL COEFFICIENTS (1)
CHINESE SYLLABLE (1)
CHINESE WHISPERED DATABASE (1)
CLASSIFICATION (1)
CLASSIFICATION RATE (1)
CLASSIFICATION TASKS (1)
CLASSIFIER (1)
CLOSE-SET SPEAKER IDENTIFICATION (1)
CO-CHANNEL (1)
COCHLEAR IMPLANT MODELS (1)
COCHLEAR IMPLANTS (1)
COCKTAIL PARTY EFFECT (1)
COCKTAIL-PARTY EFFFECT (1)
more

INFONA - science communication portal

Search results

A Research on Mongolian Standard Speech Testing System Based on Comparisons of Language Features

Translating the sign of dumb person using ARM processor

An adaptive noise cancellation scheme using particle swarm optimization algorithm

Isolated Malayalam digit recogntion using Support Vector Machines

Realtime speech processing for automated cue-generation

Real-Time Blind Source Separation of Speech Signals with Adaptive Particle Swarm Optimization

Research on key parameters of speech denoising algorithm based on wavelet packet transform

A voice activity detection system based on FPGA

Speaker Identification Based on Robust AM-FM Features

A support vector machine classifier of emotion from voice and facial expression data

Design and development of a frame based MT system for English-to-ISL

Time-frequency feature extraction from spectrograms and wavelet packets with application to automatic stress and emotion classification in speech

A non-uniform subband approach to speech-based cognitive load classification

An instantaneous amplitude model based speech coder

Estimation of instants of significant excitation from speech signal using temporal phase periodicity

Selective pole modification-based technique for the analysis and detection of hypernasality

Recent trends and challenges in speech-separation systems research — A tutorial review

Development of Chinese whispered database for speaker verification

On the use of stress information in speech for speaker recognition

On the use of cepstral coefficients and Multilayer Perceptron Networks for vocal fold edema diagnosis

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options