Wyniki wyszukiwania

Pozycje od 1 do 20 spośród 970 wyników

Poprzednia

Następna

rozdział

Speech based emotion recognition in Tamil and Telugu using LPCC and hurst parameters — A comparitive study using KNN and ANN classifiers

S. Renjith, K. G. Manju

2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT) > 1 - 6

2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT)

Speech based emotion recognition finds numerous applications in automated speech services such as interactive voice recognition systems. It has great implications in investigative application like lie detectors, in medical applications such as diagnosis of mental depression etc. This work aims at developing a feature based emotion recognition system. The speech recordings with the emotions-anger,...

rozdział

Determining the voiceprint recognition on the basis of emotional speech signal: Indonesia language

Kanyadian Idananta, Kristianus Oktriono

2017 3rd International Conference on Information Management (ICIM) > 388 - 392

2017 3rd International Conference on Information Management (ICIM)

Automatic voiceprint recognition, posited on human speech signal, serves many salient practical applications. A number of studies are undertaken on the basis of normal speech. This research intends to develop automatic voiceprint recognition system on the basis of emotion speech signal in Indonesia language. The study is limited to four different people with speeches of four distinctive emotional...

rozdział

Automatic speech recognition models: A characteristic and performance review

U. G. Patil, S. D. Shirbahadurkar, A. N. Paithane

2016 International Conference on Computing Communication Control and automation (ICCUBEA) > 1 - 7

2016 International Conference on Computing Communication Control and automation (ICCUBEA)

This paper presents a review on few notable speech recognition models that are reported in the last decade. Firstly, the models are categorized into sparse models, learning models and domain - specific models. Subsequently, the characteristics of the models have been observed using speech constraints, algorithmic constraints and performance constraints. The performance of these models reported in...

rozdział

Noise robust speech recognition system using Mel cepstral and genetic algorithm

Garg Mamta, Arora Ajat Shatru, Gupta Savita

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 3151 - 3155

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

This paper suggested a technique based on MFCC analysis for audio signals with speech classification application. The proposed work used multi-resolution (wavelet) analysis and spectral analysis based features for feature extraction. The proposed approach uses a no. of features like Mel Frequency Cepstral Coefficient (MFCC), and FFT Coefficients combined with wavelet based features. In addition, accuracy...

rozdział

Automatic speech annotation based on enhanced wavelet Packets Best Tree Encoding (EWPBTE) feature

Mohamed Hassan Mohamed, Ashraf Mohamed Ali Hassan, N.M. Hussein Hassan

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 2611 - 2616

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

This paper aimed at introducing a completely automated Arabic phone recognition system based on Enhanced Wavelet Packets Best Tree Encoding (EWPBTE) 15-point speech feature. The process of enhancing of WPBTE is provided by adding energy component to WPBTE, which is implemented in Matlab software and makes an enhancement of 65 % to recognizer accuracy which is the most contribution in this paper. EWPBTE...

rozdział

Automatic emotion recognition in compressed speech using acoustic and non-linear features

N. Garcia, J.C Vasquez-Correa, J.D Arias-Londono, J.F Vargas-Bonilla, więcej

2015 20th Symposium on Signal Processing, Images and Computer Vision (STSIVA) > 1 - 7

2015 20th Symposium on Signal Processing, Images and Computer Vision (STSIVA)

Automatic recognition of emotions in speech has attracted the attention of the research community in recent years. Some of the most relevant proposed applications of it are in call-centers. In these scenarios the speech is distorted by compression algorithms. The effects of such distortion on the performance of systems for automatic recognition of emotions must be assessed. In this study these effects...

rozdział

Noise impact assessment on the accuracy of the determination of speaker’s gender by using method of the cumulant coefficients

Kostiantyn Pylypenko, Arkadiy Prodeus

2015 XI International Conference on Perspective Technologies and Methods in MEMS Design (MEMSTECH) > 102 - 106

2015 XI International Conference on Perspective Technologies and Methods in MEMS Design (MEMSTECH

A new method of classification of a speaker’s gender based on cumulant coefficients is proposed. The effect of an additive noise and measurement error of classification signs on accuracy of classification is analyzed. The expediency of construction of an adaptive system of classification operating with considering of masking of a speech signal by noise is shown. Comparison of the proposed method of...

rozdział

Evaluation of methods to combine different speech recognizers

Tomas Rasymas, Vytautas Rudzionis

2015 Federated Conference on Computer Science and Information Systems (FedCSIS) > 1043 - 1047

2015 Federated Conference on Computer Science and Information Systems (FedCSIS)

The paper deals with the problem of improving speech recognition by combining outputs of several different recognizers. We are presenting our results obtained by experimenting with different classification methods which are suitable to combine outputs of different speech recognizers. Methods which were evaluated are: k-Nearest neighbors (KNN), Linear Discriminant Analysis (LDA), Quadratic Discriminant...

rozdział

Part-of-speech labeling for Reuters database

R. Cretulescu, A. David, D. Morariu, L. Vintan

2015 19th International Conference on System Theory, Control and Computing (ICSTCC) > 117 - 122

2015 19th International Conference on System Theory, Control and Computing (ICSTCC)

Even if the Vector Space Model used for document representation in information retrieval systems integrates a small quantity of knowledge it continues to be used due to its computational cost, speed execution and simplicity. We try to improve this document representation by adding some syntactic information such as the parts of speech. In this paper, we have evaluated three different tagging algorithms...

rozdział

Dynamic feature selection for detecting Parkinson's disease through voice signal

Meilin Su, Keh-Shih Chuang

2015 IEEE MTT-S 2015 International Microwave Workshop Series on RF and Wireless Technologies for Biomedical and Healthcare Applications (IMWS-BIO) > 148 - 149

2015 IEEE MTT-S 2015 International Microwave Workshop Series on RF and Wireless Technologies for Biomedical and Healthcare Applications (IMWS-BIO)

Parkinson's disease (PD) is a disorder of the central nervous system and about 89% of the people with PD suffering from speech and voice disorders. In this paper, we adopted a dynamic feature selection based on fuzzy entropy measures for speech pattern classification of Parkinson's diseases. To investigate the effect of feature selection, Linear Discriminant Analysis (LDA) was applied to distinguish...

rozdział

A learning-based approach for Romanian syllabification and stress assignment

Diana Balc, Anamaria Beleiu, Rodica Potolea, Camelia Lemnaru

2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) > 37 - 42

2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)

This paper tackles the Romanian syllabification and stress assignment problems, and proposes an efficient machine learning based solution. We show that by designing the appropriate feature sets for each specific problem, learning algorithms achieve satisfactory accuracy rates for both problems (∼92% for syllabification, ∼85% for stress assignment), even for relatively small training set sizes. We...

rozdział

On the use of EMD for automatic newborn cry segmentation

Lina Abou-Abbas, Leila Montazeri, Christian Gargour, Chakib Tadj

2015 International Conference on Advances in Biomedical Engineering (ICABME) > 262 - 265

2015 International Conference on Advances in Biomedical Engineering (ICABME)

Cry segmentation is an essential preprocessing step in any infant crying diagnosis system. Besides crying sounds consisting of expiration phases followed by short periods of inspiration episodes, each recording of newborn cries also includes silence sections as well as other sounds such as speech of caregivers, noise and sound of medical equipments. This paper is devoted to a newly developed Empirical...

rozdział

A novel feature selection based on Tibetan grammar for Tibetan text classification

Tao Jiang, Hongzhi Yu

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS) > 445 - 448

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS)

Feature selection is a strategy that aims at making text classifiers more efficient and accurate. In this paper, we proposed a novel feature selection method based on Tibetan grammar for Tibetan classification. Tibetan language express grammatical meaning through the function words and word order, and the function word has large proportions. By analyzing the Tibetan grammar and distribution of part...

rozdział

Towards live subtitling of TV ice-hockey commentary

Ales Prazak, Josef V. Psutka, Josef Psutka, Zdenek Loose

2013 International Conference on Signal Processing and Multimedia Applications (SIGMAP) > 151 - 155

2013 International Conference on Signal Processing and Multimedia Applications (SIGMAP)

This paper deals with live subtitling of TV ice-hockey commentaries using automatic speech recognition technology. Two methods are presented - a direct transcription of a TV program and a re-speaking approach. Practical issues emerging from the real subtitling system are introduced and their solutions are proposed. Acoustic and language modelling is described as well as modifications of existing live...

rozdział

Glottal pathology discrimination using ANN and SVM

Ashwini Visave, Pramod Kachare, Amutha Jeyakumar, Alice Cheeran, więcej

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1377 - 1381

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Use of modern technological advances in real-time biomedical analysis is very crucial. Current work focuses on glottal pathology discrimination based on non-invasive speech analysis techniques. Primary set back in developing such method is irregular performance depreciation of several state of the art acoustic features. To excuse such problems, we have used glottal to noise excitation ratio, which...

rozdział

A hybrid Parts Of Speech tagger for Malayalam language

Anisha Aziz T, Sunitha C

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1502 - 1507

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Parts of speech tagging is an important research topic in Natural Language Processing research are. Since it is one among the first steps of any natural language processing (NLP) techniques such as machine translation, if any error happens for tagging the same will repeat in the whole NLP process. So far works had been done on POS tagging based on SVM, MBLP, HMM, Ngram. All of these methods were not...

rozdział

Reducing morpho-phonetic confusion in sub-word based Uyghur ASR

Mijit Ablimit, Askar Hamdulla, Akbar Pattar

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 348 - 352

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

Sub-word units like morphemes are selected as the lexicon for highly inflectional languages, as they can provide better coverage and a smaller vocabulary size. However, short units shrink the context of statistical models, prone to morpho-phonetic changes, and not always outperform the word based model. When sequence of units are merged or split, unit boundaries are phonetically harmonized in the...

rozdział

Automatic evaluation of resonance and articulation disorders in cleft palate speech

Ling He, Jie Tan, HuaQing Hao, Ming Tang, więcej

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 358 - 362

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

The evaluation of cleft palate (CP) speech is a critical clinical treatment. The most typical characteristics of CP speech include hypernasality and consonant misarticulation. Currently, the evaluation of CP speech is carried out by experienced speech therapists. It strongly depends on their clinical experience and subjective judgment. This work aims to propose an automatic evaluation system of resonance...

rozdział

On statistical machine translation method for lexicon refinement in speech recognition

Haihua Xu, Xiong Xiao, Eng-Siong Chng, Haizhou Li

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 25 - 29

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

In low resource Automatic Speech Recognition (ASR), one usually resorts to the Statistical Machine Translation (SMT) technique to learn transform rules to refine grapheme lexicon. To do this, we face two challenges. One is to generate grapheme sequences from the training data as the targets, which is paired with the original transcripts to train SMT models; the other is to effectively prune the learned...

rozdział

Sentiment analysis from product reviews using SentiWordNet as lexical resource

Alexandra Cernian, Valentin Sgarciu, Bogdan Martin

2015 7th International Conference on Electronics, Computers and Artificial Intelligence (ECAI) > WE-15 - WE-18

2015 7th International Conference on Electronics, Computers and Artificial Intelligence (ECAI)

In the current social, technological and economic context, customers make their decisions based mostly on the opinion of other consumers. On the other side, companies need quick feedback from their customers in order to adapt to their needs in real time. The effective connection between these two aspects relies on opinion mining tools, which automatically process consumers' reviews and opinions about...

Poprzednia

Następna

Opcje filtrowania

Słowa kluczowe:
ACCURACY
SPEECH

Data publikacji

Ustaw własny zakres dat

Dostępność treści

Dostępna (958)
Brak (12)

Słowa kluczowe

SPEECH RECOGNITION (465)
FEATURE EXTRACTION (332)
HIDDEN MARKOV MODELS (261)
TRAINING (241)
SPEECH PROCESSING (186)
ACOUSTICS (159)
DATABASES (139)
MEL FREQUENCY CEPSTRAL COEFFICIENT (132)
SUPPORT VECTOR MACHINES (119)
NOISE (98)
SPEAKER RECOGNITION (89)
DATA MINING (85)
NATURAL LANGUAGE PROCESSING (83)
EMOTION RECOGNITION (76)
ARTIFICIAL NEURAL NETWORKS (62)
ESTIMATION (60)
CLASSIFICATION ALGORITHMS (57)
SIGNAL TO NOISE RATIO (53)
AUTOMATIC SPEECH RECOGNITION (51)
COMPUTATIONAL MODELING (48)
VECTORS (48)
CORRELATION (45)
NOISE MEASUREMENT (44)
HUMANS (40)
CEPSTRAL ANALYSIS (39)
EDUCATIONAL INSTITUTIONS (39)
ALGORITHM DESIGN AND ANALYSIS (38)
MATHEMATICAL MODEL (38)
PATTERN CLASSIFICATION (38)
SIGNAL PROCESSING (38)
SPEAKER IDENTIFICATION (37)
LEARNING (ARTIFICIAL INTELLIGENCE) (36)
ROBUSTNESS (36)
TAGGING (36)
TESTING (36)
DECODING (35)
SPEECH CODING (35)
TRAINING DATA (35)
COMPUTERS (34)
GAUSSIAN PROCESSES (34)
MFCC (34)
ADAPTATION MODEL (33)
DATA MODELS (33)
CONFERENCES (32)
CONTEXT (32)
SPEECH SYNTHESIS (32)
HIDDEN MARKOV MODEL (31)
SPEECH ENHANCEMENT (31)
KERNEL (29)
MICROPHONES (29)
VISUALIZATION (29)
DICTIONARIES (27)
INDEXES (27)
SUPPORT VECTOR MACHINE (27)
TRANSFORMS (27)
EQUATIONS (25)
SIGNAL PROCESSING ALGORITHMS (25)
TEXT ANALYSIS (25)
AUDIO SIGNAL PROCESSING (24)
GMM (24)
SVM (24)
VOCABULARY (24)
CLASSIFICATION (23)
ENTROPY (23)
GAUSSIAN MIXTURE MODEL (23)
MACHINE LEARNING (23)
PRINCIPAL COMPONENT ANALYSIS (23)
STATISTICAL ANALYSIS (23)
ACOUSTIC SIGNAL PROCESSING (22)
OPTIMIZATION (22)
PROBABILITY (22)
SPEECH ANALYSIS (21)
ANALYTICAL MODELS (20)
COMPLEXITY THEORY (20)
SIGNAL CLASSIFICATION (20)
TIME FREQUENCY ANALYSIS (20)
DECISION TREES (19)
DELAY (19)
HMM (19)
MAXIMUM LIKELIHOOD ESTIMATION (19)
PATTERN RECOGNITION (19)
SEMANTICS (19)
SUPPORT VECTOR MACHINE CLASSIFICATION (19)
ELECTRONIC MAIL (18)
ERROR ANALYSIS (18)
LABELING (18)
REAL TIME SYSTEMS (18)
ROBOTS (18)
ADAPTATION MODELS (17)
FACE (17)
FILTERING (17)
HARMONIC ANALYSIS (17)
MUSIC (17)
NEURAL NETWORKS (17)
STRESS (17)
DETECTORS (16)
INFORMATION RETRIEVAL (16)
NATURAL LANGUAGES (16)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu