Search results

Items from 141 to 160 out of 623 results

1 ...
5
6
7
8
9
10
11

chapter

Emotion recognition from telephone speech using acoustic and nonlinear features

S. Bedoya-Jaramillo, J.R. Orozco-Arroyave, J.D. Arias-Londono, J.F. Vargas-Bonilla

2013 47th International Carnahan Conference on Security Technology (ICCST) > 1 - 5

2013 International Carnahan Conference on Security Technology (ICCST)

This paper addresses the problem of the automatic recognition of emotional states from speech recordings, especially those kind of emotions reflecting that the life or the human integrity are at risk. The paper compares the performance of two different systems: one being fed with speech signals recorded directly from the people (whole spectrum) and other one in which the speech signals are recorded...

chapter

Performance estimation of noisy speech recognition using spectral distortion and SNR of noise-reduced speech

Guo Ling, Takeshi Yamada, Shoji Makino, Nobuhiko Kitawaki

2013 IEEE International Conference of IEEE Region 10 (TENCON 2013) > 1 - 4

TENCON 2013 - 2013 IEEE Region 10 Conference

To ensure a satisfactory QoE (Quality of Experience) and facilitate system design in speech recognition services, it is essential to establish a method that can be used to efficiently investigate recognition performance in different noise environments. Previously, we proposed a performance estimation method using the PESQ (Perceptual Evaluation of Speech Quality) as a spectral distortion measure....

chapter

Application of dimensional emotion model in automatic emotional speech recognition

Milana Bojanic, Milan Gnjatovic, Milan Secujski, Vlado Delic

2013 IEEE 11th International Symposium on Intelligent Systems and Informatics (SISY) > 353 - 356

2013 IEEE 11th International Symposium on Intelligent Systems and Informatics (SISY 2013)

This paper reports on the application of the dimensional emotion model in automatic emotional speech recognition. Using the perceptron rule in combination with acoustic features, an approach to speech-based emotion recognition is introduced, which can classify the utterance with respect to the valence-arousal (V-A) dimensions of its emotional content. The mapping of 5 discrete emotion classes onto...

chapter

Reduction of confusion pairs on different rates of speech in Telugu language

N. Usha Rani, P. N. Girija

2013 15th International Conference on Advanced Computing Technologies (ICACT) > 1 - 4

2013 15th International Conference on Advanced Computing Technologies (ICACT)

Research in speech recognition area has made considerable progress in achieving the task with tremendous growth of technology. Speech rate is one of the important factors which affect the speech recognition accuracy. In the present work, training is performed on different speech rates (Normal, Slow and Fast) and testing also done on different rates of speech. Error rate will increase when the major...

chapter

A Bayesian Framework for Modeling Accents in Handwriting

Chetan Ramaiah, Arti Shivram, Venu Govindaraju

2013 12th International Conference on Document Analysis and Recognition > 917 - 921

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

Accent in speech is defined as a distinctive mode of pronunciation that is unique to a geographical region. In a similar way, we define accent in handwriting as distinctive writing characteristics that are unique to a group of people sharing a common native script. In other words, we postulate that a group of people with a common native script will share certain traits in their handwriting that can...

chapter

A new approach to develop a syllable based, continuous Amharic speech recognizer

Yitagessu B. Gebremedhin, Frank Duckhorn, Rudiger Hoffmann, Ivan Kraljevski

Eurocon 2013 > 1684 - 1689

IEEE EUROCON 2013

All of the previous syllable based Automatic Speech Recognizers (ASRs) for the Amharic language are built by training a separate acoustic model for each of the 196 distinctly pronounced Consonant-Vowel (CV) syllable. In this paper, we will demonstrate that a smaller number of acoustic models are sufficient to build a syllable based, speaker independent, continuous, Amharic ASR. It is built for weather...

chapter

Discrete wavelet transforms with multiclass SVM for phoneme recognition

M. Cutajar, E. Gatt, I. Grech, O. Casha, more

Eurocon 2013 > 1695 - 1700

IEEE EUROCON 2013

A phoneme recognition system based on Discrete Wavelet Transforms (DWT) and Support Vector Machines (SVMs), is designed for multi-speaker continuous speech environments. Phonemes are divided into frames, and the DWTs are adopted, to obtain fixed dimensional feature vectors. For the multiclass SVM, the One-against-one method with the RBF kernel was implemented. To further improve the accuracies obtained,...

chapter

Hardware-based support vector machine for phoneme classification

M. Cutajar, E. Gatt, I. Grech, O. Casha, more

Eurocon 2013 > 1701 - 1708

IEEE EUROCON 2013

This paper presents the design of a digital hardware implementation based on Support Vector Machines (SVMs), for the task of multi-speaker phoneme recognition. The One-against-one multiclass SVM method, with the Radial Basis Function (RBF) kernel was considered. Furthermore, a priority scheme was also included in the architecture, in order to forecast the three most likely phonemes. The designed system...

chapter

Development of Speech Training Aid System for Hearing-Impaired Children

Yongbin Hu, Taichang Wang, Ronghuai Huang

2013 IEEE 13th International Conference on Advanced Learning Technologies > 212 - 214

2013 IEEE 13th International Conference on Advanced Learning Technologies (ICALT)

The aim of speech training aid is to enhance the language abilities of the hearing-impaired Children. The traditional approach, utilization of photographs and teachers' gestures, is revealed to be an inefficient way. This study proposes a Speech training aid system (STAS) based on C# & Flash technology to instruct hearing-impaired children on how to improve their language abilities. An experimental...

chapter

Special Sound Detection for emergency phones

Hansheng Lei, Oscar Valdez

2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) > 816 - 820

2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Most emergency phones can be activated by a button press. However, to press the button requires the person needing help to touch the phone physically, which causes infeasibilities in practice: 1) it is hard for someone unfamiliar with the environment to locate an emergency phone quickly, and 2) even if the person knows where the phone is, she/he has to run a certain distance to reach it. Therefore,...

chapter

Overlapped sub-band modulation spectrum normalization techniques for robust speech recognition

Hao-teng Fan, Wei-jeih Yeh, Jeih-weih Hung

2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) > 1035 - 1039

2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

This paper proposes a novel approach to enhance the speech features in noise robustness for speech recognition. In the proposed approach, the speech feature time sequence is first converted into the modulation spectral domain via discrete Fourier transform (DFT). The magnitude part of the modulation spectrum is decomposed into overlapped non-uniform sub-band segments, and then each sub-band segment...

chapter

Using speaker group dependent modelling to improve fusion of fragmentary classifier decisions

Ingo Siegert, Michael Glodek, Axel Panning, Gerald Krell, more

2013 IEEE International Conference on Cybernetics (CYBCO) > 132 - 137

2013 IEEE International Conference on Cybernetics (CYBCO)

Current speech-controlled human computer interaction is purely based on spoken information. For a successful interaction, additional information such as the individual skills, preferences and actual affective state of the user are often mandatory. The most challenging of these additional inputs is the affective state, since affective cues are in general expressed very sparsely. The problem can be...

chapter

Speech emotion recognition for SROL database using weighted KNN algorithm

Monica Feraru, Marius Zbancioc

Proceedings of the International Conference on ELECTRONICS, COMPUTERS and ARTIFICIAL INTELLIGENCE - ECAI-2013 > 1 - 4

2013 International Conference on Electronics, Computers and Artificial Intelligence (ECAI)

In this study, we utilized an improved version of the classical KNN algorithm which associates to each parameter from the features vectors weights according to their performance in the classification process. We obtained the recognition percents of emotions around 65–67%, for the Romanian language, on the SROL database, which are comparable with the results for other languages, with non-professional...

chapter

Music Mood Classification Using Intro and Refrain Parts of Lyrics

Seungwon Oh, Minsoo Hahn, Jinsul Kim

2013 International Conference on Information Science and Applications (ICISA) > 1 - 3

2013 International Conference on Information Science and Applications (ICISA)

In this paper, we propose an lyrics-based classification approach. It estimates a mood of a song with only intro and refrain parts of lyrics. In general, the intro part creates a specific atmosphere of a song, and the chorus part is the strongest part of the song. The proposed method detects important features significantly associated with the mood of songs from the both parts. By calculating the...

chapter

Speaker Recognition System: Vulnerable and Challenges

Naufal Alee, Phaklen Ehkan, R. Badlishah Ahmad, Shahrul Nizam Yaakob, more

2013 International Conference on Information Science and Applications (ICISA) > 1 - 4

2013 International Conference on Information Science and Applications (ICISA)

Recently speaker recognition system became high interesting by researchers for both software and hardware solutions. Different technologies have been adopted to implement speaker recognition system that has performance with optimal time response with acceptable accuracy. Research progresses are going on to provide highly durable and precise recognition system that can be embedded into critical implementation...

chapter

Unique n-Phone Ranking Based Spoken Language Identification

Amalia Zahra, Julie Carson-Berndsen

2013 Fifth International Conference on Computational Intelligence, Communication Systems and Networks > 239 - 244

2013 Fifth International Conference on Computational Intelligence, Communication Systems and Networks (CICSyN)

This paper presents a novel approach to phonetic-based language identification (LID). Motivated by the assumption underlying phonotactic LID that accounting for permissible phone sequences supports the process of distinguishing one language from another, this paper presents a novel approach based on the automatic identification of phone sequences of different lengths unique to a language, which are...

chapter

Automatic evaluation of hypernasality and speech intelligibility for children with cleft palate

Ling He, Jing Zhang, Qi Liu, Heng Yin, more

2013 IEEE 8th Conference on Industrial Electronics and Applications (ICIEA) > 220 - 223

2013 IEEE 8th Conference on Industrial Electronics and Applications (ICIEA)

The speech of cleft palate (CP) patients has typical characteristics. Hypernasality and low speech intelligibility are the primary characteristics for CP speech. In this work, an automatic evaluation of different levels of hypernasality and speech intelligibility algorithm for CP speech was proposed, in order to provide an objective tool for speech therapist. To identify different levels of hypernasality,...

chapter

Statistical formant descriptors with linear predictive coefficients for accent classification

Yusnita Ma, Paulraj Mp, Sazali Yaacob, Shahriman Ab, more

2013 IEEE 8th Conference on Industrial Electronics and Applications (ICIEA) > 906 - 911

2013 IEEE 8th Conference on Industrial Electronics and Applications (ICIEA)

Accent is a special trait of human speech that can deliver some information about a speaker's background. At the same time it is one of the profound factors that affects the intelligibility and performance of speech recognition systems (ASRs) if not delicately handled. Normally accent recognizer in the preceding stage offers subsystem training or adaptation strategy to improve the ASRs. Formant analysis...

chapter

Three steps of Neuron Network classification for EMG-based Thai tones speech recognition

Niyawadee Srisuwan, Pornchai Phukpattaranont, Chusak Limsakul

2013 10th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology > 1 - 6

2013 10th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON 2013)

In order to overcome the problem existing in original speech recognition (e.g. noise interruption and private data loss), many researchers have investigated to deal with these problems. Electromyography (EMG) from the muscles producing speech was used to replace a voiced signal. Similarly, we aim to develop EMG speech recognition based on Thai language. Tone is the important characteristic of this...

chapter

PSoC based isolated speech recognition system

V. Naresh, B. Venkataramani, Abhishek Karan, J. Manikandan

2013 International Conference on Communication and Signal Processing > 693 - 697

2013 International Conference on Communications and Signal Processing (ICCSP)

Isolated Speech recognition systems (ISRS) have been implemented using microprocessors, digital signal processors and FPGAs and have been reported in the literature. In this paper, the study and implementation of an ISRS using Cypress Programmable System on Chip (PSoC) is presented. For the implementation, PSoC5 containing the ARM Cortex-M3 CPU is used. Recognition performance is studied using three...

1 ...
5
6
7
8
9
10
11

Keywords:
ACCURACY
SPEECH RECOGNITION

Publication date

Set your own date range

Content availability

Available (615)
None (8)

Keywords

SPEECH (465)
HIDDEN MARKOV MODELS (250)
FEATURE EXTRACTION (187)
TRAINING (187)
ACOUSTICS (113)
DATABASES (90)
MEL FREQUENCY CEPSTRAL COEFFICIENT (88)
SPEECH PROCESSING (79)
AUTOMATIC SPEECH RECOGNITION (62)
EMOTION RECOGNITION (62)
NOISE (60)
SUPPORT VECTOR MACHINES (57)
NATURAL LANGUAGE PROCESSING (53)
ARTIFICIAL NEURAL NETWORKS (47)
DATA MINING (44)
SPEAKER RECOGNITION (40)
COMPUTATIONAL MODELING (34)
DECODING (33)
CLASSIFICATION ALGORITHMS (32)
ROBUSTNESS (32)
NOISE MEASUREMENT (31)
VOCABULARY (31)
TRAINING DATA (30)
HIDDEN MARKOV MODEL (29)
DATA MODELS (28)
CONTEXT (26)
SIGNAL TO NOISE RATIO (26)
TESTING (26)
VECTORS (26)
NEURAL NETWORKS (24)
MFCC (23)
ADAPTATION MODEL (22)
LEARNING (ARTIFICIAL INTELLIGENCE) (22)
CEPSTRAL ANALYSIS (21)
MATHEMATICAL MODEL (21)
SPEECH CODING (21)
CORRELATION (20)
HUMANS (20)
VISUALIZATION (20)
DICTIONARIES (19)
HMM (19)
LATTICES (19)
NATURAL LANGUAGES (19)
NEURAL NETS (19)
PATTERN CLASSIFICATION (19)
PROBABILITY (19)
ESTIMATION (18)
SPEAKER IDENTIFICATION (18)
SPEECH ENHANCEMENT (18)
ROBOTS (17)
COMPUTERS (16)
ERROR ANALYSIS (16)
MICROPHONES (16)
SIGNAL PROCESSING (16)
SUPPORT VECTOR MACHINE (16)
ACOUSTIC SIGNAL PROCESSING (15)
ENTROPY (15)
INFORMATION RETRIEVAL (15)
PATTERN RECOGNITION (15)
ROBUST SPEECH RECOGNITION (15)
STATISTICAL ANALYSIS (15)
SUPPORT VECTOR MACHINE CLASSIFICATION (15)
ALGORITHM DESIGN AND ANALYSIS (14)
EQUATIONS (14)
FACE RECOGNITION (14)
GAUSSIAN PROCESSES (14)
LANGUAGE MODEL (14)
OPTIMIZATION (14)
ACOUSTIC MODELING (13)
CONFERENCES (13)
CONTEXT MODELING (13)
INDEXES (13)
KERNEL (13)
LABELING (13)
SPEECH ANALYSIS (13)
DISCRETE WAVELET TRANSFORMS (12)
IMAGE SEGMENTATION (12)
INDEXING (12)
MACHINE LEARNING (12)
PHONEME RECOGNITION (12)
PRINCIPAL COMPONENT ANALYSIS (12)
SVM (12)
TRANSFORMS (12)
ADAPTATION MODELS (11)
CHARACTER RECOGNITION (11)
EDUCATIONAL INSTITUTIONS (11)
HUMAN COMPUTER INTERACTION (11)
MULTILAYER PERCEPTRONS (11)
SIGNAL CLASSIFICATION (11)
SPECTRAL ANALYSIS (11)
DISCRIMINATIVE TRAINING (10)
MAXIMUM LIKELIHOOD ESTIMATION (10)
SPECTROGRAM (10)
SPEECH SYNTHESIS (10)
ACOUSTIC MODEL (9)
ASR (9)
CLASSIFICATION (9)
DECISION TREES (9)
more

INFONA - science communication portal

Search results

Emotion recognition from telephone speech using acoustic and nonlinear features

Performance estimation of noisy speech recognition using spectral distortion and SNR of noise-reduced speech

Application of dimensional emotion model in automatic emotional speech recognition

Reduction of confusion pairs on different rates of speech in Telugu language

A Bayesian Framework for Modeling Accents in Handwriting

A new approach to develop a syllable based, continuous Amharic speech recognizer

Discrete wavelet transforms with multiclass SVM for phoneme recognition

Hardware-based support vector machine for phoneme classification

Development of Speech Training Aid System for Hearing-Impaired Children

Special Sound Detection for emergency phones

Overlapped sub-band modulation spectrum normalization techniques for robust speech recognition

Using speaker group dependent modelling to improve fusion of fragmentary classifier decisions

Speech emotion recognition for SROL database using weighted KNN algorithm

Music Mood Classification Using Intro and Refrain Parts of Lyrics

Speaker Recognition System: Vulnerable and Challenges

Unique n-Phone Ranking Based Spoken Language Identification

Automatic evaluation of hypernasality and speech intelligibility for children with cleft palate

Statistical formant descriptors with linear predictive coefficients for accent classification

Three steps of Neuron Network classification for EMG-based Thai tones speech recognition

PSoC based isolated speech recognition system

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options