Wyniki wyszukiwania

Pozycje od 21 do 40 spośród 654 wyników

Poprzednia

Następna

rozdział

A new look at the automatic mapping between Arabic distinctive phonetic features and acoustic cues

Yousef Alotaibi, Yasser Seddiq, Ali Meftah, Sid-Ahmed Selouani, więcej

2017 40th International Conference on Telecommunications and Signal Processing (TSP) > 368 - 371

2017 40th International Conference on Telecommunications and Signal Processing (TSP)

In this paper, the multidimensional phonological feature structure of Arabic is investigated. Our goal is to assess the performance of statistical and connectionist approaches in performing the complex mappings between distinctive phonetic features (DPF) and associated acoustic cues. The present study explores the mapping between 29 phonological voicing, place, and manner features and Mel-frequency...

rozdział

Throat microphone speech recognition using mfcc

Amritha Vijayan, Bipil Mary Mathai, Karthik Valsalan, Riyanka Raji Johnson, więcej

2017 International Conference on Networks & Advances in Computational Technologies (NetACT) > 392 - 395

2017 International Conference on Networks & Advances in Computational Technologies (NetACT)

The Throat Microphone (TM) is a non-acoustic device, relying on the vibrations of vocal folds rather than the audible sound produced. Correctly capturing vocal fold vibrations is difficult due to poor signal representation capabilities. The system recognizes the TM vibrations and produces the corresponding speech sound. This is done by extracting features from the spectrum of the TM vibrations and...

rozdział

Speaker-independent speech emotion recognition based on random forest feature selection algorithm

Wei-Hua Cao, Jian-Ping Xu, Zhen-Tao Liu

2017 36th Chinese Control Conference (CCC) > 10995 - 10998

2017 36th Chinese Control Conference (CCC)

Feature selection is a crucial step in the development of a system for identifying emotions in speech. How to select high correlation features is an open question. This paper focuses on feature selection method which aims to extract the most effective acoustic features to improve the performance of emotion recognition. Emotional feature selection of speaker-independent speech based on Random Forest...

rozdział

Delay based optimisation of an integrated online call recording speaker diarisation and identification system

Aleksandar Melov, Branislav Gerazov, Zoran Ivanovski

IEEE EUROCON 2017 -17th International Conference on Smart Technologies > 307 - 311

IEEE EUROCON 2017 -17th International Conference on Smart Technologies

The design of speaker diarisation and recognition systems is a mature research area and their deployment in the real world has gained momentum. There are still a number of parameters of these systems that have to be tuned and optimised for the application scenario at hand. An online call recording diarisation system is designed with integrated speaker identification of the call-centre operators. The...

rozdział

Recognition of positive and negative emotions for Romanian language

Silvia Monica Feraru, Marius Dan Zbancioc

2017 E-Health and Bioengineering Conference (EHB) > 725 - 728

2017 E-Health and Bioengineering Conference (EHB)

The paper presents the emotions recognition for positive and negative emotions for Romanian language. The main purpose of this study is to highlight how emotions are recognized if it is not wanted to identify with precision the expressed emotion, but the emotion in general: positive, negative or neutral. This can be useful for a human-machine interface. The positive emotions were recognized with an...

rozdział

Text independent gender identification in noisy environmental conditions

Seema Khanum, A Firos

2017 International Conference on Computing, Communication and Automation (ICCCA) > 63 - 66

2017 International Conference on Computing, Communication and Automation (ICCCA)

This paper proposes a competent system that is not only text independent in identifying gender of a speaker but can also work efficiently in noisy environmental conditions in real time. The noisy environmental conditions are the places where noise signals are generated at different SNRs (Signal to Noise Ratios) such as train station, restaurant, exhibition hall, airport, and so on. The algorithms...

rozdział

Speaker identification: A way to reduce call-sign confusion events

Sara Sekkate, Mohammed Khalil, Abdellah Adib

2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) > 1 - 6

2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

This paper examines the development of a speaker identification system (SIS) for future aeronautical communication systems. SIS promises to improve flight safety by reducing the incidence of call-sign confusion events. However, the practical development of such a system faces many challenges, especially related to the signal corruption by the channel noise. Due to the dynamic motion of aircraft, the...

rozdział

Automatic detection of early stages of Parkinson's disease through acoustic voice analysis with mel-frequency cepstral coefficients

Laetitia Jeancolas, Habib Benali, Badr-Eddine Benkelfat, Graziella Mangone, więcej

2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) > 1 - 6

2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

Vocal impairments are one of the earliest disrupted modalities in Parkinson's disease (PD). Most of the studies whose aim was to detect Parkinson's disease through acoustic analysis use global parameters. In the meantime, in speaker and speech recognition, analyses are carried out by short-term parameters, and more precisely by Mel-Frequency Cepstral Coefficients (MFCC), combined with Gaussian Mixture...

rozdział

Isolated Iqlab checking rules based on speech recognition system

Bilal Yousfi, Akram M. Zeki, Aminah Haji

2017 8th International Conference on Information Technology (ICIT) > 619 - 624

2017 8th International Conference on Information Technology (ICIT)

The act of learning and teaching of the Qur'an is the most important science for Muslim. The teacher and learner in this area they should have the provisions of tajweed rules when reading the Qur'an. There are numerous efforts made by previous systems on the development of feasible guiding techniques to the act of Tajweed. However, liking the major control variables of the practices of Tajweed in...

rozdział

Small-footprint convolutional neural network for spoofing detection

Heinrich Dinkel, Yanmin Qian, Kai Yu

2017 International Joint Conference on Neural Networks (IJCNN) > 3086 - 3091

2017 International Joint Conference on Neural Networks (IJCNN)

Albeit recent progress in speaker verification engendered powerful models, malicious attacks in the form of spoofed speech, are generally not coped with. In previous attempts, deep neural networks were used to extract high dimensional features which were later classified using an independent classifier. Even though the results of this approach are promising, this architecture's disadvantage is it's...

rozdział

A SVM based speech to text converter for Turkish language

Burak TombaloGlu, Hamit Erdem

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

In proposed speech to text conversion, a Support Vector Machines (SVM) based Turkish speech to text converter system has been developed. In the recognition system, Mel Frequency Cepstral Coefficients (MFCC) has been applied to extract features of Turkish speech and SVM based classifier has been used to classify the phonemes. The morphological structure of Turkish, a language based on phonemes, has...

rozdział

Vocal folds pathologies classification using Naïve Bayes Networks

Mohamed Dahmani, Mhania Guerti

2017 6th International Conference on Systems and Control (ICSC) > 426 - 432

2017 6th International Conference on Systems and Control (ICSC)

in this study the Nave Bayes Network NBN classifier is used for automatic vocal folds pathologies detection and classification. The proposed method is based on the acoustic parameters extraction such as Mel Frequency Cepstral Coefficient (MFCC), jitter, shimmer and fundamental frequency which are used as inputs to NBN classifier to discriminate between three different groups: speakers with normal...

rozdział

A speaker identification performance comparison based on the classifier, the computation time and the number of MFCC

Zubeyir Ozcan, Temel Kayikcioglu

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

Speaker identification is a field of which usage grows faster in security systems and forensic sciences. Depending on the tasks, online or offline applications are presented. It is an important problem that how much they are accurate, how much they are fast or how hard is its computation. In this study, the accuracy and the speed of the classifiers that can be used on speaker identification and the...

rozdział

A comparative study on feature dependency of the Manipuri language based phonetic engine

Sushanta Kabir Dutta, Salam Nandakishor, L Joyprakash Singh

2017 2nd International Conference on Communication Systems, Computing and IT Applications (CSCITA) > 5 - 10

2017 2nd International Conference on Communication Systems, Computing and IT Applications (CSCITA)

This paper presents a study on how the performance of Phonetic engine(PE) varies with different set of spectral features selected for it. An exclusive study is carried out with a PE developed in the Manipuri language. Here, we built the PE using phonetic transcriptions and modeling of each phonetic unit by Hidden Markov Model (HMM). The symbols of International Phonetic Alphabet (IPA) (revised in...

rozdział

Text dependent voice recognition system using MFCC and VQ for security applications

Ashwin Nair Anil Kumar, Senthil Arumugam Muthukumaraswamy

2017 International conference of Electronics, Communication and Aerospace Technology (ICECA) > 2 > 130 - 136

2017 International conference of Electronics, Communication and Aerospace Technology (ICECA)

This paper presents the implementation of a practical voice recognition system using MATLAB (R2014b) to secure a given user's system so that only the user may access it. Voice recognition systems have two phases, training and testing. During the training phase, the characteristic features of the speaker are extracted from the speech signal and stored in a database. In the testing phase, the stored...

rozdział

Emotion recognition from speech using MFCC and DWT for security system

Sonali T. Saste, S. M. Jagdale

2017 International conference of Electronics, Communication and Aerospace Technology (ICECA) > 1 > 701 - 704

2017 International conference of Electronics, Communication and Aerospace Technology (ICECA)

In recent years the emotion recognition from speech is area of more interest in human computer interaction. There are many different researchers which worked on emotion recognition from speech with different systems. This paper attempts emotion recognition from speech which is language independent. The emotional speech samples database is used for feature extraction. For feature extraction MFCC and...

rozdział

Sequential parameterizing affine projection (SPAP) windowing length for acoustic echo cancellation on speech accents identification

Noraziahtulhidayu Kamarudin, S. A. R Al-Haddad, Asem Khmag, Shaiful Jahari Hashim, więcej

2017 Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT) > 1 - 5

2017 Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT)

Echo cancellation has always in the preprocessing steps before the signals are converted to feature vectors and pattern classification. This is always the correct flow of speech identification. Therefore, in order to get the best cleaned signal, the usage of adaptive echo cancellation removed the echo and also the noise which deteriorates the signals and final results during classification process...

rozdział

Feature fusion techniques based training MLP for speaker identification system

Najiya M. Omar, M.E. El-Hawary

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 6

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE)

This paper aims to compare the Linear Predictive Cepstral Coefficients (LPCC) method, the Mel-frequency Cepstral Coefficient (MFCC) method, their concatenation (LPCC-MFCC), and a new proposed feature fusion approach based on method involving this concatenation with the respective averages normalization; Linear predictive and Mel-frequency Cepstral Coefficients (LMACC) through applying a multi-layer...

rozdział

Home automation using spoken Pashto digits recognition

Shibli Nisar, Muhammad Asadullah

2017 International Conference on Innovations in Electrical Engineering and Computational Technologies (ICIEECT) > 1 - 4

2017 International Conference on Innovations in Electrical Engineering and Computational Technologies (ICIEECT)

Home automation provides convenient, comfortable, energy saving, safety and security to people. Nowadays mostly home automation systems are based on English speech recognition. In northern Pakistan where majority of the people speaks Pashto and the literacy rate is very low, due to which most of the people deprived from the use of home automation. The aim of this research is to provide a friendly...

rozdział

Visual acuity test for isolated words using speech recognition

Saud Khan, Khalil Ullah

2017 International Conference on Innovations in Electrical Engineering and Computational Technologies (ICIEECT) > 1 - 6

2017 International Conference on Innovations in Electrical Engineering and Computational Technologies (ICIEECT)

Visual acuity tests are performed by doctors to assess a patient's visual acuity. Health practitioners carry out this test manually on daily basis. This proposed technique aims at the ease of accurately testing vision anywhere instead of planning a visit to a practitioner. In this interactive method, a user utters isolated words as a guess input to the system from a table of selected words. The system...

Poprzednia

Następna

Opcje filtrowania

Zbiór danych:
ieee
Słowa kluczowe:
FEATURE EXTRACTION
MEL FREQUENCY CEPSTRAL COEFFICIENT
SPEECH
Typ publikacji:
książka

Data publikacji

Ustaw własny zakres dat

Dostępność treści

Dostępna (651)
Brak (3)

Słowa kluczowe

SPEECH RECOGNITION (353)
TRAINING (149)
HIDDEN MARKOV MODELS (147)
SPEAKER RECOGNITION (147)
MFCC (143)
DATABASES (117)
SPEECH PROCESSING (103)
SUPPORT VECTOR MACHINES (92)
ACCURACY (90)
CEPSTRAL ANALYSIS (76)
NOISE (70)
EMOTION RECOGNITION (69)
FILTER BANKS (50)
SPEAKER IDENTIFICATION (44)
GMM (42)
ROBUSTNESS (42)
GAUSSIAN MIXTURE MODEL (39)
NOISE MEASUREMENT (37)
GAUSSIAN PROCESSES (34)
MATHEMATICAL MODEL (34)
VECTORS (33)
CLASSIFICATION ALGORITHMS (32)
ARTIFICIAL NEURAL NETWORKS (31)
DATA MINING (31)
SPEAKER VERIFICATION (31)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (30)
CORRELATION (28)
TESTING (27)
AUTOMATIC SPEECH RECOGNITION (26)
VECTOR QUANTIZATION (26)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (24)
SIGNAL TO NOISE RATIO (24)
SVM (24)
COMPUTATIONAL MODELING (23)
DISCRETE COSINE TRANSFORMS (23)
FILTER BANK (23)
AUDIO SIGNAL PROCESSING (22)
HIDDEN MARKOV MODEL (21)
KERNEL (20)
PRINCIPAL COMPONENT ANALYSIS (20)
SIGNAL CLASSIFICATION (20)
SUPPORT VECTOR MACHINE (20)
NATURAL LANGUAGE PROCESSING (18)
FILTERING THEORY (17)
SIGNAL PROCESSING (17)
HMM (16)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (16)
MUSIC (16)
ACOUSTIC SIGNAL PROCESSING (15)
LPC (15)
NEURAL NETWORKS (15)
NIST (15)
COMPUTERS (14)
SUPPORT VECTOR MACHINE CLASSIFICATION (14)
ADAPTATION MODELS (13)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (MFCC) (13)
MICROPHONES (13)
NEURAL NETWORK (13)
SPEECH CODING (13)
SPEECH EMOTION RECOGNITION (13)
SPEECH ENHANCEMENT (13)
TIME FREQUENCY ANALYSIS (13)
TRANSFORMS (13)
ALGORITHM DESIGN AND ANALYSIS (12)
DATA MODELS (12)
DISCRETE WAVELET TRANSFORMS (12)
FEATURE SELECTION (12)
GAUSSIAN MIXTURE MODELS (12)
HARMONIC ANALYSIS (12)
INDEXES (12)
LEARNING (ARTIFICIAL INTELLIGENCE) (12)
PATTERN CLASSIFICATION (12)
VECTOR QUANTISATION (12)
WAVELET TRANSFORMS (12)
ACOUSTICS (11)
CEPSTRUM (11)
CLASSIFICATION (11)
CONFERENCES (11)
NEURAL NETS (11)
ROBUST SPEECH RECOGNITION (11)
SPEAKER DIARIZATION (11)
MACHINE LEARNING (10)
PITCH (10)
SPECTRAL ANALYSIS (10)
ACOUSTIC FEATURES (9)
AUDIO CLASSIFICATION (9)
EQUATIONS (9)
ESTIMATION (9)
HEURISTIC ALGORITHMS (9)
NEURONS (9)
POLYNOMIALS (9)
SPEECH ANALYSIS (9)
SPEECH FEATURE EXTRACTION (9)
TRAINING DATA (9)
VISUALIZATION (9)
VQ (9)
ADAPTATION MODEL (8)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu