2008 2nd International Conference on Signal Processing and Communication Systems. ICSPCS'2008

Items from 1 to 7 out of 7 results

chapter

Voice activity detection using AdaBoost with multi-frame information

T. Usukura, W. Mitsuhashi

2008 2nd International Conference on Signal Processing and Communication Systems > 1 - 8

2008 2nd International Conference on Signal Processing and Communication Systems. ICSPCS'2008

A noise robust scheme for voice activity detection (VAD) that employs a combination of both intra- and inter-frame acoustic features is presented in this paper. As intra-frame features full-band energy and mel-frequency cepstrum coefficient (MFCC) are calculated whereas integrated bispectrum is estimated as inter-frame features. The parameters combined by intra- and inter-frame features are sorted...

chapter

A study of phonetic feature representations for SVM-based speaker verification

E. Merkley, B. Baker, R. Vogt, S. Sridharan

2008 2nd International Conference on Signal Processing and Communication Systems > 1 - 5

2008 2nd International Conference on Signal Processing and Communication Systems. ICSPCS'2008

We investigate an alternative formulation of phonetic feature representations for SVM-based speaker verification. The new features are based on conditional likelihood representations rather than the joint-likelihood or bag-of-ngram calculations traditionally used. Conditional likelihoods are shown to be a more natural method of modelling phonetic information, and improve upon conventional joint likelihoods...

chapter

Speech Endpoint Detection Using Gradient Based Edge Detection Techniques

H. Ghaemmaghami, R. Vogt, S. Sridharan, M. Mason

2008 2nd International Conference on Signal Processing and Communication Systems > 1 - 8

2008 2nd International Conference on Signal Processing and Communication Systems. ICSPCS'2008

This paper proposes a novel method for speech endpoint detection. The developed method utilises gradient based edge detection algorithms, used in image processing, to detect boundaries of continuous speech in noisy conditions. It is simple and has low computational complexity. The accuracy of the proposed method was evaluated and compared to the ITU-T G.729 Annex-B voice activity detection (VAD) algorithm...

chapter

A visual front-end for a continuous pose-invariant lipreading system

P. Lucey, S. Sridharan

2008 2nd International Conference on Signal Processing and Communication Systems > 1 - 6

2008 2nd International Conference on Signal Processing and Communication Systems. ICSPCS'2008

Having an audio-visual automatic speech recognition (AVASR) system which can recognise what a speaker's says regardless of head position (i.e. left profile, front, right profile etc.), would be most useful as it enables this technology to be used in a host of realistic applications such as mobile phone and in-vehicle speech recognition. A major hurdle in achieving this goal is in developing a visual...

chapter

FPGA implementation of spectral subtraction for in-car speech enhancement and recognition

J. Whittington, K. Deo, T. Kleinschmidt, M. Mason

2008 2nd International Conference on Signal Processing and Communication Systems > 1 - 8

2008 2nd International Conference on Signal Processing and Communication Systems. ICSPCS'2008

The use of speech recognition in noisy environments requires the use of speech enhancement algorithms in order to improve recognition performance. Deploying these enhancement techniques requires significant engineering to ensure algorithms are realisable in electronic hardware. This paper describes the design decisions and process to port the popular spectral subtraction algorithm to a Virtex-4 field-programmable...

chapter

A pattern recognition system for environmental sound classification based on MFCCs and neural networks

F. Beritelli, R. Grasso

2008 2nd International Conference on Signal Processing and Communication Systems > 1 - 4

2008 2nd International Conference on Signal Processing and Communication Systems. ICSPCS'2008

The paper proposes a study of a background noise classifier based on a pattern recognition approach using a neural network. The signals submitted to the neural network are characterised by means of a set of 12 MFCC (Mel frequency cepstral coefficient) parameters typically present in the front end of a mobile terminal. The performance of the classifier, evaluated in terms of percent misclassification,...

chapter

Adjusted training of HMM models for Slovak speech recognition system

J. Kacur

2008 2nd International Conference on Signal Processing and Communication Systems > 1 - 5

2008 2nd International Conference on Signal Processing and Communication Systems. ICSPCS'2008

In this article the relevant training aspects for building robust and accurate HMM models for large vocabulary recognition system in Slovak are discussed. As the basis for building HMM models the MASPER training procedure was assumed, and applied on the Slovak MOBILDAT database.

Filter options

Keywords:
SPEECH RECOGNITION

Publication date

Set your own date range

Keywords

SPEECH (5)
NOISE (4)
FEATURE EXTRACTION (3)
HIDDEN MARKOV MODELS (3)
ALGORITHM DESIGN AND ANALYSIS (2)
CEPSTRAL ANALYSIS (2)
DATA MINING (2)
NOISE MEASUREMENT (2)
TRAINING (2)
ACCURACY (1)
ACOUSTIC NOISE (1)
ACOUSTIC SIGNAL PROCESSING (1)
ADABOOST (1)
ADAPTIVE BOOSTING (1)
ADJUSTED TRAINING (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUDIO-VISUAL AUTOMATIC SPEECH RECOGNITION SYSTEM (1)
AUDIO-VISUAL SYSTEMS (1)
AUTOMOBILES (1)
AUTOMOTIVE ELECTRONICS (1)
BACKGROUND NOISE CLASSIFIER (1)
BAG-OF-NGRAM CALCULATION (1)
CENSREC-1-C (1)
CLASSIFICATION ALGORITHMS (1)
CONDITIONAL LIKELIHOOD CALCULATION (1)
CONTINUOUS POSE-INVARIANT LIPREADING SYSTEM (1)
DATA MODELS (1)
DATABASES (1)
DECISION WINDOW (1)
DICTIONARIES (1)
EDGE DETECTION (1)
ELECTRONIC HARDWARE (1)
ENVIRONMENTAL SOUND CLASSIFICATION (1)
FACE (1)
FACE RECOGNITION (1)
FACIAL FEATURE CLASSIFIER (1)
FACIAL FEATURES (1)
FEATURE NORMALISATION PROBLEM (1)
FIELD PROGRAMMABLE GATE ARRAYS (1)
FPGA (1)
G.729-B VAD ALGORITHM (1)
GRADIENT BASED EDGE DETECTION TECHNIQUES (1)
HARDWARE (1)
HMM MODELS (1)
IMAGE EDGE DETECTION (1)
IMAGE PROCESSING (1)
IN-CAR SPEECH ENHANCEMENT (1)
IN-VEHICLE SPEECH RECOGNITION (1)
INTER-FRAME ACOUSTIC FEATURES (1)
INTRA-FRAME ACOUSTIC FEATURES (1)
ITU-T G.729 ANNEX-B VOICE ACTIVITY DETECTION ALGORITHM (1)
JOINT-LIKELIHOOD CALCULATION (1)
JOINTS (1)
LOW COMPUTATIONAL COMPLEXITY (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MEL-FREQUENCY CEPSTRUM COEFFICIENT (1)
MFCC (1)
MOBILDAT DATABASE (1)
MOBILE PHONE (1)
MOBILE TERMINAL (1)
MOUTH (1)
MULTI-FRAME INFORMATION (1)
NATURAL LANGUAGES (1)
NEURAL NETS (1)
NEURAL NETWORKS (1)
NOISY ENVIRONMENT (1)
NOISY SPEECH RECOGNITION (1)
NOISY-SPEECH DATABASE (1)
NOISY-SPEECH SIGNALS (1)
NONPARAMETRIC METHOD (1)
NONPARAMETRIC STATISTICS (1)
PATTERN RECOGNITION SYSTEM (1)
PHONETIC FEATURE REPRESENTATION (1)
PHONETIC FEATURES (1)
POSE-ESTIMATOR (1)
PRINCIPAL COMPONENT ANALYSIS (1)
PROBABILITY DENSITY FUNCTION (1)
RESOURCE ANALYSIS (1)
ROAD VEHICLES (1)
ROBUSTNESS (1)
SIGNAL CLASSIFICATION (1)
SIGNAL REPRESENTATION (1)
SIGNAL TO NOISE RATIO (1)
SIGNAL-TO-NOISE RATIO (1)
SINGLE CAMERA (1)
SLOVAK (1)
SPEAKER RECOGNITION (1)
SPEAKER VERIFICATION (1)
SPECTRAL ANALYSIS (1)
SPECTRAL SUBTRACTION (1)
SPEECH ENDPOINT DETECTION (1)
SPEECH ENHANCEMENT (1)
SPEECH PROCESSING (1)
SPEECH RECOGNITION SYSTEM (1)
SUPPORT VECTOR MACHINES (1)
SVM-BASED SPEAKER VERIFICATION (1)
TESTING (1)
TRAINING DATA (1)
VIRTEX-4 FIELD-PROGRAMMABLE GATE ARRAY DEVICE (1)
more

INFONA - science communication portal

2008 2nd International Conference on Signal Processing and Communication Systems. ICSPCS'2008 $("#expandableTitles").expandable();

Voice activity detection using AdaBoost with multi-frame information

A study of phonetic feature representations for SVM-based speaker verification

Speech Endpoint Detection Using Gradient Based Edge Detection Techniques

A visual front-end for a continuous pose-invariant lipreading system

FPGA implementation of spectral subtraction for in-car speech enhancement and recognition

A pattern recognition system for environmental sound classification based on MFCCs and neural networks

Adjusted training of HMM models for Slovak speech recognition system

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2008 2nd International Conference on Signal Processing and Communication Systems. ICSPCS'2008