Wyniki wyszukiwania

Pozycje od 1 do 20 spośród 623 wyników

Poprzednia

Następna

rozdział

Determining the voiceprint recognition on the basis of emotional speech signal: Indonesia language

Kanyadian Idananta, Kristianus Oktriono

2017 3rd International Conference on Information Management (ICIM) > 388 - 392

2017 3rd International Conference on Information Management (ICIM)

Automatic voiceprint recognition, posited on human speech signal, serves many salient practical applications. A number of studies are undertaken on the basis of normal speech. This research intends to develop automatic voiceprint recognition system on the basis of emotion speech signal in Indonesia language. The study is limited to four different people with speeches of four distinctive emotional...

rozdział

Research on multi-base depth neural network speech recognition

Cai Jun, Li Fei, Zhang Yi, Liu Yu

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1540 - 1544

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

In speech recognition system, an improved multi-base neural network speech recognition model is proposed to solve the problem of long learning time and slow convergence rate of deep neural network. However, the improved model introduces a large number of parameters in the training process to make the model over-fitted in the test set, resulting in the deterioration of generalization ability and the...

rozdział

Differences between alcoholic and non alcoholic individuals in the recognition of vocal emotional stimuli: A case-control study

A. Esposito, A. Troncone

2016 7th IEEE International Conference on Cognitive Infocommunications (CogInfoCom) > 19 - 24

2016 7th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

Emotional decoding ability has been repetitively shown to be impaired in alcoholic patients. The present study aims to extend previous findings on emotions deficits examining the auditory stimuli recognition ability in alcoholism. Twenty-six alcohol-dependent patients, abstinent from alcohol for at least four weeks, were compared to 26 controls matched for sex, age and socioeconomic level. Subjects...

rozdział

Automatic speech recognition models: A characteristic and performance review

U. G. Patil, S. D. Shirbahadurkar, A. N. Paithane

2016 International Conference on Computing Communication Control and automation (ICCUBEA) > 1 - 7

2016 International Conference on Computing Communication Control and automation (ICCUBEA)

This paper presents a review on few notable speech recognition models that are reported in the last decade. Firstly, the models are categorized into sparse models, learning models and domain - specific models. Subsequently, the characteristics of the models have been observed using speech constraints, algorithmic constraints and performance constraints. The performance of these models reported in...

rozdział

Noise robust speech recognition system using Mel cepstral and genetic algorithm

Garg Mamta, Arora Ajat Shatru, Gupta Savita

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 3151 - 3155

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

This paper suggested a technique based on MFCC analysis for audio signals with speech classification application. The proposed work used multi-resolution (wavelet) analysis and spectral analysis based features for feature extraction. The proposed approach uses a no. of features like Mel Frequency Cepstral Coefficient (MFCC), and FFT Coefficients combined with wavelet based features. In addition, accuracy...

rozdział

Automatic speech annotation based on enhanced wavelet Packets Best Tree Encoding (EWPBTE) feature

Mohamed Hassan Mohamed, Ashraf Mohamed Ali Hassan, N.M. Hussein Hassan

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 2611 - 2616

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

This paper aimed at introducing a completely automated Arabic phone recognition system based on Enhanced Wavelet Packets Best Tree Encoding (EWPBTE) 15-point speech feature. The process of enhancing of WPBTE is provided by adding energy component to WPBTE, which is implemented in Matlab software and makes an enhancement of 65 % to recognizer accuracy which is the most contribution in this paper. EWPBTE...

rozdział

Automatic emotion recognition in compressed speech using acoustic and non-linear features

N. Garcia, J.C Vasquez-Correa, J.D Arias-Londono, J.F Vargas-Bonilla, więcej

2015 20th Symposium on Signal Processing, Images and Computer Vision (STSIVA) > 1 - 7

2015 20th Symposium on Signal Processing, Images and Computer Vision (STSIVA)

Automatic recognition of emotions in speech has attracted the attention of the research community in recent years. Some of the most relevant proposed applications of it are in call-centers. In these scenarios the speech is distorted by compression algorithms. The effects of such distortion on the performance of systems for automatic recognition of emotions must be assessed. In this study these effects...

rozdział

Evaluation of methods to combine different speech recognizers

Tomas Rasymas, Vytautas Rudzionis

2015 Federated Conference on Computer Science and Information Systems (FedCSIS) > 1043 - 1047

2015 Federated Conference on Computer Science and Information Systems (FedCSIS)

The paper deals with the problem of improving speech recognition by combining outputs of several different recognizers. We are presenting our results obtained by experimenting with different classification methods which are suitable to combine outputs of different speech recognizers. Methods which were evaluated are: k-Nearest neighbors (KNN), Linear Discriminant Analysis (LDA), Quadratic Discriminant...

rozdział

Towards live subtitling of TV ice-hockey commentary

Ales Prazak, Josef V. Psutka, Josef Psutka, Zdenek Loose

2013 International Conference on Signal Processing and Multimedia Applications (SIGMAP) > 151 - 155

2013 International Conference on Signal Processing and Multimedia Applications (SIGMAP)

This paper deals with live subtitling of TV ice-hockey commentaries using automatic speech recognition technology. Two methods are presented - a direct transcription of a TV program and a re-speaking approach. Practical issues emerging from the real subtitling system are introduced and their solutions are proposed. Acoustic and language modelling is described as well as modifications of existing live...

rozdział

On statistical machine translation method for lexicon refinement in speech recognition

Haihua Xu, Xiong Xiao, Eng-Siong Chng, Haizhou Li

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 25 - 29

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

In low resource Automatic Speech Recognition (ASR), one usually resorts to the Statistical Machine Translation (SMT) technique to learn transform rules to refine grapheme lexicon. To do this, we face two challenges. One is to generate grapheme sequences from the training data as the targets, which is paired with the original transcripts to train SMT models; the other is to effectively prune the learned...

rozdział

Evaluation of wains as a classifier for automatic speech recognition

Rosemary T. Salaja, Ronan Flynn, Michael Russell

2015 26th Irish Signals and Systems Conference (ISSC) > 1 - 6

2015 26th Irish Signals and Systems Conference (ISSC)

This paper introduces a new back-end classifier for a speech recognition system that is based on artificial life (ALife). The ALife species being used for classification purposes are called wains, which were developed using the Créatúr framework. The speech recognition task used in the evaluation of the new classifier is that of isolated digit recognition. Performance of the proposed back-end classifier...

rozdział

Thai speech recognition using Neuro-fuzzy system

Krittakom Srijiranon, Narissara Eiamkanitchat

2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON) > 1 - 6

2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

There are many popular algorithms to recognize the human voice. The good algorithm not only results the high recognition accuracy, but also robust to noises. Several experiments are done in this research to verify the performance of the Neuro-fuzzy system to recognize the human voice. Eight words in Thai language recorded in a different environment, syllable and pronunciations are used as a data set...

rozdział

Feature selection experiments on emotional speech classification

Piyawat Sukhummek, Sawit Kasuriya, Thanaruk Theeramunkong, Chai Wutiwiwatchai, więcej

2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON) > 1 - 4

2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

This paper presents the experiments on feature selection for emotional speech classification. There are 152 features used in this experiment. The minimum redundancy maximum relevance (mRMR) feature selection is applied as the features selection. The experiments are constructed from two corpora; Interactive Emotional Dyadic Motion Capture (IEMOCAP) and Emotional Tagged Corpus on Lakorn (EMOLA) which...

rozdział

Study on incorporating tone into speech recognition of Vietnamese

Thien Chuong Nguyen, Josef Chaloupka, Jan Nouza

2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their Application to Mechatronics (ECMSM) > 1 - 6

2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics (ECMSM)

Vietnamese is a syllable-based tonal language where the tone used in syllable pronunciation carries important information about the meaning. In this paper, we investigate several approaches how to incorporate the tone into an acoustic model. We propose 3 basic strategies: a) a phoneme-based, b) a vowel-based, and c) a rhyme-based one. Each can be modified so that we obtain 15 different schemes that...

rozdział

Designing and development of voice to machine interfacing technique

Mohammad Mahfujur Rahman, A.K.M Fazlul Haque, Mahmudul Hasan, Najnin Sultana, więcej

2015 International Conference on Electrical Engineering and Information Communication Technology (ICEEICT) > 1 - 4

2015 International Conference on Electrical Engineering and Information Communication Technology (ICEEICT)

In this paper, microcontroller based automatic door open system has been developed. The system is developed as speech recognition circuit where programmable voice is used as reference. Programmable means trained voice has been used for identification of authorized and unauthorized person. As software part, MATLAB GUI interface has been used to record authorized voice, and to synthesize the recorded...

rozdział

Design and testing of low cost three-modes of operation voice controller for wheelchairs and rehabilitation robotics

Mohammed Faeik Ruzaij, Sebastian Neubert, Norbert Stoll, Kerstin Thurow

2015 IEEE 9th International Symposium on Intelligent Signal Processing (WISP) Proceedings > 1 - 6

2015 IEEE 9th International Symposium on Intelligent Signal Processing (WISP)

People who have lost their walking and moving ability need to use a wheelchair. In cases of losing complete control of the upper and lower limbs, intelligent solutions are required to ensure the autonomy and independence of those patients. The intelligent application must be designed carefully to use the available self-controlled electrical and physical activity of the patient's body like sound, Electromyogram...

rozdział

Developmental pattern analysis and age prediction by extracting speech features and applying various classification techniques

Sumanlata Gautam, Latika Singh

International Conference on Computing, Communication & Automation > 83 - 87

2015 International Conference on Computing, Communication & Automation (ICCCA)

In speech development research, it's important to know how speech acoustic features vary as a function of age and the age when the variability and magnitude of acoustic features start to exhibit adult-like patterns. During the first few years of life, a child's speech changes from the cries and babbles of an infant to adult-like words and phrases of a young child. A number of acoustic studies observed...

rozdział

Feature extraction analysis on Indonesian speech recognition system

Untari N. Wisesty, Adiwijaya, Widi Astuti

2015 3rd International Conference on Information and Communication Technology (ICoICT) > 54 - 58

2015 3rd International Conference on Information and Communication Technology (ICoICT )

Speech recognition is widely applied to speech to text, speech to emotion, in order to make gadget and computer easier to use, or to help people with hearing disability. Feature extraction is one of significant step in the performance of speech recognition. Therefore, the proper selection is really needed. In this paper, we analyze feature extraction that can have good performance for Indonesian speech...

rozdział

A 3.13nJ/sample energy-efficient speech extraction processor for robust speech recognition in mobile head-mounted display systems

Jinmook Lee, Seongwook Park, Injoon Hong, Hoi-Jun Yoo

2015 IEEE International Symposium on Circuits and Systems (ISCAS) > 1790 - 1793

2015 IEEE International Symposium on Circuits and Systems (ISCAS)

An energy-efficient speech extraction (SE) processor is proposed for the robust speech recognition in the head-mounted display (HMD) systems. Speech extraction is essential for robust speech recognition in noisy environment. For the low-latency speech extraction, FastSE is proposed to overcome 50x larger complex cICA-based selection process which results in <2ms SE latency. Moreover, a reinforced-FastSE...

rozdział

Automatic language recognition with tonal and non-tonal language pre-classification

Liang Wang, Eliathamby Ambikairajah, Eric H.C. Choi

2007 15th European Signal Processing Conference > 2375 - 2379

2007 15th European Signal Processing Conference

Parallel Phoneme Recognition followed by Language Modelling (PPRLM) systems currently provide state of the art language identification performance on conversational telephone speech. In this paper an innovative method for tonal and non-tonal language pre-classification by using prosodie information is reported. Our motivation is to improve recognition accuracy and save the amount of CPU run-time while...

Poprzednia

Następna

Opcje filtrowania

Słowa kluczowe:
ACCURACY
SPEECH RECOGNITION

Data publikacji

Ustaw własny zakres dat

Dostępność treści

Dostępna (615)
Brak (8)

Słowa kluczowe

SPEECH (465)
HIDDEN MARKOV MODELS (250)
FEATURE EXTRACTION (187)
TRAINING (187)
ACOUSTICS (113)
DATABASES (90)
MEL FREQUENCY CEPSTRAL COEFFICIENT (88)
SPEECH PROCESSING (79)
AUTOMATIC SPEECH RECOGNITION (62)
EMOTION RECOGNITION (62)
NOISE (60)
SUPPORT VECTOR MACHINES (57)
NATURAL LANGUAGE PROCESSING (53)
ARTIFICIAL NEURAL NETWORKS (47)
DATA MINING (44)
SPEAKER RECOGNITION (40)
COMPUTATIONAL MODELING (34)
DECODING (33)
CLASSIFICATION ALGORITHMS (32)
ROBUSTNESS (32)
NOISE MEASUREMENT (31)
VOCABULARY (31)
TRAINING DATA (30)
HIDDEN MARKOV MODEL (29)
DATA MODELS (28)
CONTEXT (26)
SIGNAL TO NOISE RATIO (26)
TESTING (26)
VECTORS (26)
NEURAL NETWORKS (24)
MFCC (23)
ADAPTATION MODEL (22)
LEARNING (ARTIFICIAL INTELLIGENCE) (22)
CEPSTRAL ANALYSIS (21)
MATHEMATICAL MODEL (21)
SPEECH CODING (21)
CORRELATION (20)
HUMANS (20)
VISUALIZATION (20)
DICTIONARIES (19)
HMM (19)
LATTICES (19)
NATURAL LANGUAGES (19)
NEURAL NETS (19)
PATTERN CLASSIFICATION (19)
PROBABILITY (19)
ESTIMATION (18)
SPEAKER IDENTIFICATION (18)
SPEECH ENHANCEMENT (18)
ROBOTS (17)
COMPUTERS (16)
ERROR ANALYSIS (16)
MICROPHONES (16)
SIGNAL PROCESSING (16)
SUPPORT VECTOR MACHINE (16)
ACOUSTIC SIGNAL PROCESSING (15)
ENTROPY (15)
INFORMATION RETRIEVAL (15)
PATTERN RECOGNITION (15)
ROBUST SPEECH RECOGNITION (15)
STATISTICAL ANALYSIS (15)
SUPPORT VECTOR MACHINE CLASSIFICATION (15)
ALGORITHM DESIGN AND ANALYSIS (14)
EQUATIONS (14)
FACE RECOGNITION (14)
GAUSSIAN PROCESSES (14)
LANGUAGE MODEL (14)
OPTIMIZATION (14)
ACOUSTIC MODELING (13)
CONFERENCES (13)
CONTEXT MODELING (13)
INDEXES (13)
KERNEL (13)
LABELING (13)
SPEECH ANALYSIS (13)
DISCRETE WAVELET TRANSFORMS (12)
IMAGE SEGMENTATION (12)
INDEXING (12)
MACHINE LEARNING (12)
PHONEME RECOGNITION (12)
PRINCIPAL COMPONENT ANALYSIS (12)
SVM (12)
TRANSFORMS (12)
ADAPTATION MODELS (11)
CHARACTER RECOGNITION (11)
EDUCATIONAL INSTITUTIONS (11)
HUMAN COMPUTER INTERACTION (11)
MULTILAYER PERCEPTRONS (11)
SIGNAL CLASSIFICATION (11)
SPECTRAL ANALYSIS (11)
DISCRIMINATIVE TRAINING (10)
MAXIMUM LIKELIHOOD ESTIMATION (10)
SPECTROGRAM (10)
SPEECH SYNTHESIS (10)
ACOUSTIC MODEL (9)
ASR (9)
CLASSIFICATION (9)
DECISION TREES (9)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu