Search results

Items from 1 to 20 out of 46 results

chapter

Classification of heart sounds using linear prediction coefficients and mel-frequency cepstral coefficients as acoustic features

Pedro Narvaez, Katerine Vera, Nhikolas Bedoya, Winston S. Percybrooks

2017 IEEE Colombian Conference on Communications and Computing (COLCOM) > 1 - 6

2017 IEEE Colombian Conference on Communications and Computing (COLCOM)

This article presents a method that uses Linear Prediction Coefficients (LPC) and Mel-Frequency Cepstral Coefficients (MFCC) as features to classify normal and abnormal cardiac sounds. Three different feature vectors were tested: LPC-only, MFCC-only and LPC + MFCC. Different experiments were made with three classifiers: Support Vector Machine (SVM), K-Nearest Neighbor (KNN) and Random Forests, using...

chapter

A speaker identification performance comparison based on the classifier, the computation time and the number of MFCC

Zubeyir Ozcan, Temel Kayikcioglu

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

Speaker identification is a field of which usage grows faster in security systems and forensic sciences. Depending on the tasks, online or offline applications are presented. It is an important problem that how much they are accurate, how much they are fast or how hard is its computation. In this study, the accuracy and the speed of the classifiers that can be used on speaker identification and the...

chapter

A comparative study on feature dependency of the Manipuri language based phonetic engine

Sushanta Kabir Dutta, Salam Nandakishor, L Joyprakash Singh

2017 2nd International Conference on Communication Systems, Computing and IT Applications (CSCITA) > 5 - 10

2017 2nd International Conference on Communication Systems, Computing and IT Applications (CSCITA)

This paper presents a study on how the performance of Phonetic engine(PE) varies with different set of spectral features selected for it. An exclusive study is carried out with a PE developed in the Manipuri language. Here, we built the PE using phonetic transcriptions and modeling of each phonetic unit by Hidden Markov Model (HMM). The symbols of International Phonetic Alphabet (IPA) (revised in...

chapter

Speaker identification evaluation based on the speech biometric and i-vector model using the TIMIT and NTIMIT databases

Musab T. S. Al-Kaltakchi, Wai L. Woo, Satnam S. Dlay, Jonathon A. Chambers

2017 5th International Workshop on Biometrics and Forensics (IWBF) > 1 - 6

2017 5th International Workshop on Biometrics and Forensics (IWBF)

Physiological and behavioural human characteristics are exploited in biometrics and performance metrics are used to measure some characteristic of an individual. The measure might lead to a one-to-one match, which is called authentication or one-from-N, and a match represents identification. In this paper, we exploit a speech biometric I-vector with low and fixed dimension of 100 to identify speakers...

chapter

Unsupervised birdcall activity detection using source and system features

Anshul Thakur, Padmanabhan Rajan

2017 Twenty-third National Conference on Communications (NCC) > 1 - 6

2017 Twenty-third National Conference on Communications (NCC)

In this paper, we describe an unsupervised method to segment birdcalls from the background in bioacoustic recordings. The method utilizes information derived from both source features as well as system features. Three types of source features are extracted from the linear prediction residual signal, and Mel frequency cepstral coefficients are extracted from the system features. The source features...

chapter

Robust Automatic Speech Recognition system based on using adaptive time-frequency masking

Ahmed Mostafa Gouda, Mohamed Tamazin, Mohamed Khedr

2016 11th International Conference on Computer Engineering & Systems (ICCES) > 181 - 186

2016 11th International Conference on Computer Engineering & Systems (ICCES)

The Automatic Speech Recognition (ASR) systems suffer from many types of noises in different environments. Nowadays, developing robust ASR system is an attractive research topic due to the high demands in many commercial applications. In this paper, the Mel-Frequency Cepstral Coefficients (MFCC) is modified to robust the noise, where the spectrogram is used as time-frequency analysis tool. The proposed...

chapter

Isolated word Automatic Speech Recognition (ASR) System using MFCC, DTW & KNN

Muhammad Atif Imtiaz, Gulistan Raja

2016 Asia Pacific Conference on Multimedia and Broadcasting (APMediaCast) > 106 - 110

2016 Asia Pacific Conference on Multimedia and Broadcasting (APMediaCast)

Automatic Speech Recognition (ASR) System is defined as transformation of acoustic speech signals to string of words. This paper presents an approach of ASR system based on isolated word structure using Mel-Frequency Cepstral Coefficients (MFCC's), Dynamic Time Wrapping (DTW) and K-Nearest Neighbor (KNN) techniques. The Mel-Frequency scale used to capture the significant characteristics of the speech...

chapter

Frequency Domain Linear Prediction-based robust text-dependent speaker identification

M. A. Islam

2016 International Conference on Innovations in Science, Engineering and Technology (ICISET) > 1 - 4

2016 International Conference on Innovations in Science, Engineering and Technology (ICISET)

Speaker identification is a biometric technique of determining an unknown speaker's identity among a number of speakers using distinguish latent information of uttered speech. Crime investigation, security control, telephone banking and trading, and information reservation are some applications of this technique. Frequency Domain Linear Prediction (FDLP) is a time-frequency-based feature has been...

chapter

Evaluating the usage of short-time energy on voice biometrics system for cerebral palsy

Syifaun Nafisah, Oyas Wahyunggoro, Lukito Edi Nugroho

2016 8th International Conference on Information Technology and Electrical Engineering (ICITEE) > 1 - 6

2016 8th International Conference on Information Technology and Electrical Engineering (ICITEE)

This study was performed to evaluate the feasibility of short-time energy as an input vector features that will be used as a key of recognition in the voice biometric system to recognize the Cerebral Palsy (CP). To retrieve the characteristics of the voice, Mel-Frequencies Cepstral Coefficients (MFCC) was used as feature extraction algorithm, while Neuro Fuzzy was used as the classifier algorithm...

chapter

Performance comparison of MFCC based bangla ASR system in presence and absence of third differential coefficients

Sudipto Debnath, Fatema-E-Jannat, Susmita Saha, Mohammad Tarik Aziz, more

2016 3rd International Conference on Electrical Engineering and Information Communication Technology (ICEEICT) > 1 - 6

2016 3rd International Conference on Electrical Engineering and Information Communication Technology (ICEEICT)

Present Mel Frequency Cepstral Coefficient (MFCC) based Bangla Automatic Speech Recognition (ASR) systems are mostly implemented with delta and acceleration coefficients. With delta and acceleration coefficients of MFCC and the log energy, a vector set of 39 dimensions is obtained per 10ms. In this paper, our objective is to observe the effect of third differential coefficients on the performance...

chapter

A new speech corpus in Spanish for speaker verification

N. Garcia, T. Arias-Vergara, J. R. Orozco-Arroyave, J. F. Vargas-Bonilla

2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA) > 1 - 7

2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)

In this paper we present a new database with speech recordings in Spanish. The database contains recordings of 54 native Spanish speakers. It is appropriate to be used in the development and testing of better Speaker Verification systems. The recording procedure, equipments and speech tasks are detailed. Experiments using the GMM-UBM speaker verification methodology were performed. The methodology...

chapter

MFCC based noise reduction in ASR using Kalman filtering

Anuradha P Nair, Shoba Krishnan, Zia Saquib

2016 Conference on Advances in Signal Processing (CASP) > 474 - 478

2016 Conference on Advances in Signal Processing (CASP)

Speech enhancement using Kalman filter is an extensively researched area. The vast majority of work done in this area uses linear predictive coding (LPC) for modeling speech signal. A few important studies have revealed the superiority of Mel Frequency Cepstral Coefficients (MFCC) over LPC for speech recognition. With this paper, the shortcomings of speech enhancement using LPC with Kalman filters...

chapter

A personalized music recommender service based on Fuzzy Inference System

Md. Saidur Rahman, Md. Saifor Rahman, Shahnewaz Ul Islam Chowdhury, Ashfaq Mahmood, more

2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS) > 1 - 6

2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS)

In this paper, we are proposing a personalized music recommender service based on Mamdani Fuzzy Interference System (M-FIS). Collection of playlist is used for gathering users' choice and mood while listening to songs. Similarity between audio files is calculated based on Mel Frequency Cepstral Coefficients (MFCC). We have developed a recommender model based on M-FIS with the aforementioned similarities...

chapter

Speaker identification and verification of noisy speech using multitaper MFCC and Gaussian Mixture models

K. V. Veena, Dominic Mathew

2015 International Conference on Power, Instrumentation, Control and Computing (PICC) > 1 - 4

2015 International Conference on Power, Instrumentation, Control and Computing (PICC)

The two major applications of speaker recognition applications are speaker verification and speaker identification. But in most of the cases the signal is corrupted with background interferences such as noise and echo. This paper proposes the method of speaker recognition and identification after the noise separation. Support Vector Machine(SVM) classification based signal separation is adopted here...

chapter

Feature extraction with convolutional restricted boltzmann machine for audio classification

Min Li, Zhenjiang Miao, Cong Ma

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) > 791 - 795

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)

Feature extraction is a crucial part for a large number of audio tasks. Researchers have extracted audio features in multiple ways, among which some most recent methods are based on the hidden layer of a trained neutral network. In this paper, we present a system which can automatically extract features from unlabeled audio data, and then the features of extracted from the system are used for audio...

chapter

Popular song summarization using chorus section detection from audio signal

Sheng Gao, Haizhou Li

2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP)

Music signal is a one-dimensional temporal sequence. It thus incurs difficulty for the listeners to quickly capturing the mostly attracting parts in popular songs, unless the listeners play the song until the ending. In order to improve the listening experience, music summarization, a tool to summarize the song using the most attractive sections, is needed. In the paper, a system and method is presented...

chapter

FPGA-based real-time MFCC extraction for automatic audio indexing on FM broadcast data

Guy Wassi, Sylvain Iloga, Olivier Romain, Bertrand Granado

2015 Conference on Design and Architectures for Signal and Image Processing (DASIP) > 1 - 6

2015 Conference on Design and Architectures for Signal and Image Processing (DASIP)

This paper presents an FPGA-based real-time acoustic features extraction method based on MFCC (Mel-Frequency Cepstral Coefficients). The proposed system enables automatic audio indexing of broadcast data from the European standard Frequency Modulation (FM) radio band. Using modelbased design approach that reduces overall design time, we successfully implemented it on Virtex 6 FPGA clocked at more...

chapter

A novel approach in channel independent speaker verification system for Malayalam database using GMM-SVM frame work

Gayathri S, Anish Babu K K

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1530 - 1534

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Speaker verification deals with the task of confirming the identity of a claim using a hypothesized speaker model and a speaker model database. This work concentrates on a speaker verification system by combining GMM and SVM. The feature vectors used for modelling are Mel Frequency Cepstral Coefficients (MFCC). The database is collected through different recording equipments which is considered as...

chapter

A modified MFCC feature extraction technique For robust speaker recognition

Diksha Sharma, Israj Ali

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1052 - 1057

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

In Speaker Recognition (SR) system, feature extraction is one of the crucial steps where the particular speaker related information are extracted. The state of the art algorithm for this purpose is Mel Frequency Cepstral Coefficient (MFCC), and its complementary feature, Inverted Mel Frequency Cepstral Coefficient (IMFCC). MFCC is based on mel scale and IMFCC is based on inverted mel (imel) scale...

chapter

The effect of DC coefficient on mMFCC and mIMFCC for robust speaker recognition

Diksha Sharma, Israj Ali

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 313 - 317

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

In Speaker Recognition (SR) system, feature extraction is one of the crucial steps where the particular speaker related information is extracted. The state of the art algorithm for this purpose is Mel Frequency Cepstral Coefficient (MFCC), and its complementary feature, Inverted Mel Frequency Cepstral Coefficient (IMFCC). MFCC is based on mel scale and IMFCC is based on inverted mel (imel) scale....

Data set:
ieee
Keywords:
FEATURE EXTRACTION
MEL FREQUENCY CEPSTRAL COEFFICIENT
MATHEMATICAL MODEL

Publication date

Set your own date range

Publication type

book (45)
article (1)

Keywords

SPEECH (35)
MFCC (18)
SPEECH RECOGNITION (18)
DATABASES (15)
SPEAKER RECOGNITION (12)
TRAINING (11)
HIDDEN MARKOV MODELS (9)
EQUATIONS (8)
ACCURACY (7)
SPEECH PROCESSING (7)
SIGNAL PROCESSING (6)
CEPSTRAL ANALYSIS (5)
COMPUTATIONAL MODELING (5)
DATA MINING (5)
GMM (5)
ROBUSTNESS (5)
SPEAKER VERIFICATION (5)
SUPPORT VECTOR MACHINES (5)
ACOUSTICS (4)
CORRELATION (4)
FILTER BANKS (4)
IMFCC (4)
NOISE MEASUREMENT (4)
SIGNAL TO NOISE RATIO (4)
SUPPORT VECTOR MACHINE CLASSIFICATION (4)
TESTING (4)
ADAPTATION MODELS (3)
ALGORITHM DESIGN AND ANALYSIS (3)
ANALYTICAL MODELS (3)
CLASSIFICATION ALGORITHMS (3)
COMPUTERS (3)
CONFERENCES (3)
FREQUENCY DOMAIN ANALYSIS (3)
FUSION (3)
HARMONIC ANALYSIS (3)
LABORATORIES (3)
MAXIMUM LIKELIHOOD DETECTION (3)
MEASUREMENT (3)
MUSIC (3)
NEURAL NETWORK (3)
NOISE (3)
PATTERN RECOGNITION (3)
SPEAKER IDENTIFICATION (3)
SVM (3)
TRAINING DATA (3)
VECTORS (3)
ADDITIVE NOISE (2)
ARTIFICIAL NEURAL NETWORKS (2)
ASR (2)
AUDITORY SYSTEM (2)
BIOLOGICAL NEURAL NETWORKS (2)
CEPSTRAL MEAN NORMALIZATION (2)
CLASSIFICATION (2)
COMPLEXITY THEORY (2)
COMPUTER ARCHITECTURE (2)
DTW (2)
EDUCATIONAL INSTITUTIONS (2)
ELECTRONIC MAIL (2)
EMOTION RECOGNITION (2)
FAST FOURIER TRANSFORMS (2)
FEATURE SELECTION (2)
FILTER BANK (2)
FILTERING ALGORITHMS (2)
FILTERING THEORY (2)
FOURIER TRANSFORMS (2)
FREQUENCY CONVERSION (2)
FREQUENCY MODULATION (2)
FUSION TECHNIQUE (2)
GMM-UBM (2)
MACHINE LEARNING (2)
MATLAB (2)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (2)
MICROPHONES (2)
MIMFCC (2)
MMFCC (2)
MUSIC INFORMATION RETRIEVAL (2)
NOISE REDUCTION (2)
POLYNOMIALS (2)
PRESSES (2)
PRODUCTION (2)
PSYCHOLOGY (2)
RECOGNITION (2)
RHYTHM (2)
SEARCH METHODS (2)
SPEECH ENHANCEMENT (2)
STRESS (2)
TRANSFORMS (2)
TRIANGULAR FILTER (2)
WAVELET TRANSFORMS (2)
ACCELERATION (1)
ACOUSTIC DISTORTION (1)
ACOUSTIC MODEL (1)
ADAPTATION MODEL (1)
AFFECTIVE SPACE MODEL (1)
ANN (1)
APPROXIMATION ALGORITHMS (1)
ARABIC LANGUAGE (1)
more

INFONA - science communication portal

Search results

Classification of heart sounds using linear prediction coefficients and mel-frequency cepstral coefficients as acoustic features

A speaker identification performance comparison based on the classifier, the computation time and the number of MFCC

A comparative study on feature dependency of the Manipuri language based phonetic engine

Speaker identification evaluation based on the speech biometric and i-vector model using the TIMIT and NTIMIT databases

Unsupervised birdcall activity detection using source and system features

Robust Automatic Speech Recognition system based on using adaptive time-frequency masking

Isolated word Automatic Speech Recognition (ASR) System using MFCC, DTW & KNN

Frequency Domain Linear Prediction-based robust text-dependent speaker identification

Evaluating the usage of short-time energy on voice biometrics system for cerebral palsy

Performance comparison of MFCC based bangla ASR system in presence and absence of third differential coefficients

A new speech corpus in Spanish for speaker verification

MFCC based noise reduction in ASR using Kalman filtering

A personalized music recommender service based on Fuzzy Inference System

Speaker identification and verification of noisy speech using multitaper MFCC and Gaussian Mixture models

Feature extraction with convolutional restricted boltzmann machine for audio classification

Popular song summarization using chorus section detection from audio signal

FPGA-based real-time MFCC extraction for automatic audio indexing on FM broadcast data

A novel approach in channel independent speaker verification system for Malayalam database using GMM-SVM frame work

A modified MFCC feature extraction technique For robust speaker recognition

The effect of DC coefficient on mMFCC and mIMFCC for robust speaker recognition

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options