Search results

Items from 121 to 140 out of 970 results

1 ...
4
5
6
7
8
9
10

chapter

Dual-Domain Hierarchical Classification of Phonetic Time Series

Hossein Hamooni, Abdullah Mueen

2014 IEEE International Conference on Data Mining > 160 - 169

2014 IEEE International Conference on Data Mining (ICDM)

Phonemes are the smallest units of sound produced by a human being. Automatic classification of phonemes is a well-researched topic in linguistics due to its potential for robust speech recognition. With the recent advancement of phonetic segmentation algorithms, it is now possible to generate datasets of millions of phonemes automatically. Phoneme classification on such datasets is a challenging...

chapter

Analyzing the Impact of MFCC and LDA for the Development of Isolated Pashto Spoken Numbers ASR

Tanzeela, Arbab Waseem Abbas, Zakir Ali, Burhan Uddin

2014 12th International Conference on Frontiers of Information Technology > 350 - 354

2014 12th International Conference on Frontiers of Information Technology (FIT)

This paper revealed the analysis of speaker independent isolated Pashto spoken numbers for determination of automatic speech recognition. Initially the database was developed, the database encompasses isolated Pashto numbers from sefer (0) to sul (100). Fifty speakers (25 male, 25 females with different ages) that can frequently speak yousafzai dialect were selected for recording. The recording has...

chapter

English sentence pronunciation evaluation using rhythm and intonation

Xinguang Li, Jiahua Chen, Minfeng Yao, Dongxiong Shen, more

The 2014 2nd International Conference on Systems and Informatics (ICSAI 2014) > 366 - 371

2014 2nd International Conference on Systems and Informatics (ICSAI)

Rhythm and intonation are important factors in the English sentence pronunciation evaluation. In this paper, the Mel Frequency Cepstrum Coefficient (MFCC) feature and Hidden Markov Model (HMM) algorithm are used to establish a model for speech recognition. Then it makes an evaluation of English sentence pronunciation focusing on rhythm and intonation, and gives feedbacks and recommendations about...

chapter

Mapping gestures to speech using the kinect

Sanjivi Muttena, S. Sriram, R. Shiva

2014 International Conference on Science Engineering and Management Research (ICSEMR) > 1 - 5

2014 International Conference on Science Engineering and Management Research (ICSEMR)

Statistics state that approximately, one in 1000 people are born mute. In a population of 7.046 billion worldwide, the number is a staggering 7 million. Of all the form of disabilities, the mute have the going tough. The inability of fellow mortals to comprehend what they hope to express serves as a constant remainder of the misfortune that had befallen them. This catastrophe often bars them from...

chapter

A new direct access framework for speaker identification system

Hery Heryanto, Saiful Akbar, Benhard Sitohang

2014 International Conference on Data and Software Engineering (ICODSE) > 1 - 5

2014 International Conference on Data and Software Engineering (ICODSE)

We present in this paper a new Direct Access Framework (DAF) for speaker identification system, to identify a speaker based on original characteristics of the human voice. Direct access method is a process to identify an object based on parts of the object itself, the parts called original characteristics. The proposed framework consists of two parts, the enrolment process and the identification process...

chapter

Joined cepstral distance features two-stage multi-class classification for emotional speech

Changqin Quan, Bin Zhang, Fuji Ren

2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems > 91 - 96

2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems (CCIS)

This letter presents a joined cepstral distance and voice quality feature two-stage multi-class classification with DAG-SVM for emotional speech. The Harmonic to Noise Ratio (HNR) is applied to detect the throat diseases because it can reflect characteristics of the throat. Meanwhile, these characteristics are also strong emotional basis to distinguish emotion in speech. The cepstrum and cepstral...

chapter

Automatic pronunciation error detection of nonnative Arabic Speech

Afnan Al Hindi, Mansour Alsulaiman, Ghulam Muhammad, Saad Al-Kahtani

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA) > 190 - 197

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)

Computer assisted language learning (CALL) and, more specifically, computer assisted pronunciation training (CAPT) have received considerable attention in recent years. CAPT allows continuous feedback to the learner without requiring the sole attention of the teacher; it facilitates self study and encourages interactive use of the language in preference to rote learning. One of the important processes...

chapter

Voice pathology detection using auto-correlation of different filters bank

Ahmed Al-nasheri, Zulfiqar Ali, Ghulam Muhammad, Mansour Alsulaiman

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA) > 50 - 55

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)

This paper investigates the contribution of frequency bands for automatic voice pathology detection. First, the input voice signal is passed through a number of time-domain band-pass filters. The center frequencies are spaced on an octave scale. Each filter output is then divided into overlapping frames. Auto-correlation function is applied to each block to find the first largest peak, in areas other...

chapter

Sentence extraction in recognition textual entailment task

Yudi Wibisono, Dwi H. Widyantoro, Nur Ulfa Maulidevi

2014 International Conference on Data and Software Engineering (ICODSE) > 1 - 4

2014 International Conference on Data and Software Engineering (ICODSE)

Recognizing textual entailment (RTE) is a task that predict whether a text fragment can be inferred from another text fragment. In this paper, we tackle RTE problem using sentence extraction to cover semantic variation and then extracting subject, predicate and object from each sentence without using external resources like Wordnet. Finally, similarity function is used to predict entailment relation...

chapter

Sentence Similarity Based on Semantic Vector Model

Zhao Jingling, Zhang Huiyun, Cui Baojiang

2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing > 499 - 503

2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC)

Sentence similarity measures play an increasingly important role in text-related research and applications in areas such as text mining, Web page retrieval, and dialogue systems. Existing methods for computing sentence similarity have been adopted from approaches used for long text documents. These methods process sentences in a very high-dimensional space and are consequently inefficient, require...

chapter

An approach to lexical stress detection from transcribed continuous speech using acoustic features

Jozsef Domokos, Adriana Stan, Mircea Giurgiu

2014 22nd Telecommunications Forum Telfor (TELFOR) > 525 - 528

2014 22nd Telecommunications Forum Telfor (TELFOR)

This paper presents a first approach to the unsupervised learning and prediction of primary lexical stress starting from continuous speech data and its orthographic transcript. The approach is intended to be used in the development of text-to-speech synthesis systems for under-resourced languages. Our method is based on syllable nuclei approximation and stress detection using simple acoustic features...

chapter

A small vocabulary automatic filipino speech profanity suppression system using hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) keyword spotting framework

Fernando I. Ablaza, Timothy Oliver D. Danganan, Bryan Paul L. Javier, Kevin S. Manalang, more

2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM) > 1 - 5

2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)

This paper describes an implementation of speech recognition that recognizes and suppresses ten (10) defined profane and vulgar Filipino words. The adapted speech recognition architecture was that of the Oregon Graduate Institute's (OGI) Center for Spoken Language and Learning (CSLU). It utilizes a hybrid Hidden Markov Model/ Artificial Neural Network (HMM/ANN) keyword spotting framework. The feature...

chapter

Inter comparison of classification techniques for vowel speech imagery using EEG sensors

Anaum Riaz, Sana Akhtar, Shanza Iftikhar, Amir Ali Khan, more

The 2014 2nd International Conference on Systems and Informatics (ICSAI 2014) > 712 - 717

2014 2nd International Conference on Systems and Informatics (ICSAI)

The use of Electroencephalography (EEG) in the domain of Brain Computer Interface is a now common place. EEG for imagined speech reproduction and observation of brain response to audio stimuli are active areas of research. In this paper, we consider the case of imagined and mouthed non-audible speech recorded with EEG electrodes. We analyze different feature extraction techniques such as Mel Frequency...

chapter

Cipher-text only attack on hopping window time domain scramblers

Hamzeh Ghasemzadeh, Hamed Mehrara, Mehdi Tajik Khas

2014 4th International Conference on Computer and Knowledge Engineering (ICCKE) > 194 - 199

2014 4th International eConference on Computer and Knowledge Engineering (ICCKE)

Wireless channels are highly prone to eavesdropping. To mitigate this problem encryption systems are used. Analog scrambling systems achieve confidentiality through modification of analog signals. Unfortunately, current literatures lack a thorough security analysis of these systems. In this paper security of hopping window time domain scrambler is investigated. It is shown that cipher-text of these...

chapter

Gammatonegram based speaker identification

Aref Farhadi Pour, Mohammad Asgari, Mohammad Reza Hasanabadi

2014 4th International Conference on Computer and Knowledge Engineering (ICCKE) > 52 - 55

2014 4th International eConference on Computer and Knowledge Engineering (ICCKE)

Speech signals contains important information to use for different purposes such as surveillance, smart home, medicine, etc. Thus, classification of these signals are chief to consider for further applications. This article presents a simple method that has lower calculations for center process unit to achieve results as well as faster reaction time and high accuracy. Many method exist in speech signal...

chapter

Classifying speech related vs. idle state towards onset detection in brain-computer interfaces overt, inhibited overt, and covert speech sound production vs. idle state

YoungJae Song, Francisco Sepulveda

2014 IEEE Biomedical Circuits and Systems Conference (BioCAS) Proceedings > 568 - 571

2014 IEEE Biomedical Circuits and Systems Conference (BioCAS)

Onset detection is one of the main issues towards self-paced BCIs that can be used outside research settings. For this reason, this paper suggests a potential solution for onset detection problem by discriminating between speech related events. In this study, overt, inhibited overt and covert states were tested to classify from idle state in an off-line setting. Autoregressive model coefficients were...

chapter

Effectiveness in open-set speaker identification

Rawande Karadaghi, Heinz Hertlein, Aladdin Ariyaeeinia

2014 International Carnahan Conference on Security Technology (ICCST) > 1 - 6

2014 International Carnahan Conference on Security Technology (ICCST)

This paper presents investigations into the relative effectiveness of two alternative approaches to open-set text-independent speaker identification (OSTI-SI). The methods considered are the recently introduced i-vector and the more traditional GMM-UBM method supported by score normalisation. The study is motivated by the growing need for effective extraction of intelligence and evidence from audio...

chapter

Part of speech tagging with Naïve Bayes methods

R. Cretulescu, A. David, D. Morariu, L. Vintan

2014 18th International Conference on System Theory, Control and Computing (ICSTCC) > 446 - 451

2014 18th International Conference on System Theory, Control and Computing (ICSTCC)

In this paper we have focused on the problem of automatic prediction of parts of speech in sentences. We present an experimental framework which includes the analysis and the implementation of methods for part of speech (POS) labeling (tagging). We have tested three methods that predict the POS without current word's context and also three context awareness statistic methods. The main goal of our...

chapter

An improved pitch detection of speech combined with speech enhancement

Xin Xu, Tian-qi Zhang, Sui Shi, Ya-juan Zhang

2014 7th International Congress on Image and Signal Processing > 778 - 782

2014 7th International Congress on Image and Signal Processing (CISP)

For poor robustness issues of pitch detection of noisy speech, the improved pitch detection method combined with speech enhancement is proposed in this paper. Firstly, in order to reduce background noise and receive the clean speech relatively, we use the multi-band spectral subtraction and the masking properties of human auditory system to work on the noisy speech, and next use the energy and zero-crossing...

chapter

Speech emotion recognition

S. Lalitha, Abhishek Madhavan, Bharath Bhushan, Srinivas Saketh

2014 International Conference on Advances in Electronics Computers and Communications > 1 - 4

2014 International Conference on Advances in Electronics, Computers and Communications (ICAECC)

In the past decade a lot of research has gone into Automatic Speech Emotion Recognition(SER). The primary objective of SER is to improve man-machine interface. It can also be used to monitor the psycho physiological state of a person in lie detectors. In recent time, speech emotion recognition also find its applications in medicine and forensics. In this paper 7 emotions are recognized using pitch...

1 ...
4
5
6
7
8
9
10

Keywords:
ACCURACY
SPEECH

Publication date

Set your own date range

Content availability

Available (958)
None (12)

Keywords

SPEECH RECOGNITION (465)
FEATURE EXTRACTION (332)
HIDDEN MARKOV MODELS (261)
TRAINING (241)
SPEECH PROCESSING (186)
ACOUSTICS (159)
DATABASES (139)
MEL FREQUENCY CEPSTRAL COEFFICIENT (132)
SUPPORT VECTOR MACHINES (119)
NOISE (98)
SPEAKER RECOGNITION (89)
DATA MINING (85)
NATURAL LANGUAGE PROCESSING (83)
EMOTION RECOGNITION (76)
ARTIFICIAL NEURAL NETWORKS (62)
ESTIMATION (60)
CLASSIFICATION ALGORITHMS (57)
SIGNAL TO NOISE RATIO (53)
AUTOMATIC SPEECH RECOGNITION (51)
COMPUTATIONAL MODELING (48)
VECTORS (48)
CORRELATION (45)
NOISE MEASUREMENT (44)
HUMANS (40)
CEPSTRAL ANALYSIS (39)
EDUCATIONAL INSTITUTIONS (39)
ALGORITHM DESIGN AND ANALYSIS (38)
MATHEMATICAL MODEL (38)
PATTERN CLASSIFICATION (38)
SIGNAL PROCESSING (38)
SPEAKER IDENTIFICATION (37)
LEARNING (ARTIFICIAL INTELLIGENCE) (36)
ROBUSTNESS (36)
TAGGING (36)
TESTING (36)
DECODING (35)
SPEECH CODING (35)
TRAINING DATA (35)
COMPUTERS (34)
GAUSSIAN PROCESSES (34)
MFCC (34)
ADAPTATION MODEL (33)
DATA MODELS (33)
CONFERENCES (32)
CONTEXT (32)
SPEECH SYNTHESIS (32)
HIDDEN MARKOV MODEL (31)
SPEECH ENHANCEMENT (31)
KERNEL (29)
MICROPHONES (29)
VISUALIZATION (29)
DICTIONARIES (27)
INDEXES (27)
SUPPORT VECTOR MACHINE (27)
TRANSFORMS (27)
EQUATIONS (25)
SIGNAL PROCESSING ALGORITHMS (25)
TEXT ANALYSIS (25)
AUDIO SIGNAL PROCESSING (24)
GMM (24)
SVM (24)
VOCABULARY (24)
CLASSIFICATION (23)
ENTROPY (23)
GAUSSIAN MIXTURE MODEL (23)
MACHINE LEARNING (23)
PRINCIPAL COMPONENT ANALYSIS (23)
STATISTICAL ANALYSIS (23)
ACOUSTIC SIGNAL PROCESSING (22)
OPTIMIZATION (22)
PROBABILITY (22)
SPEECH ANALYSIS (21)
ANALYTICAL MODELS (20)
COMPLEXITY THEORY (20)
SIGNAL CLASSIFICATION (20)
TIME FREQUENCY ANALYSIS (20)
DECISION TREES (19)
DELAY (19)
HMM (19)
MAXIMUM LIKELIHOOD ESTIMATION (19)
PATTERN RECOGNITION (19)
SEMANTICS (19)
SUPPORT VECTOR MACHINE CLASSIFICATION (19)
ELECTRONIC MAIL (18)
ERROR ANALYSIS (18)
LABELING (18)
REAL TIME SYSTEMS (18)
ROBOTS (18)
ADAPTATION MODELS (17)
FACE (17)
FILTERING (17)
HARMONIC ANALYSIS (17)
MUSIC (17)
NEURAL NETWORKS (17)
STRESS (17)
DETECTORS (16)
INFORMATION RETRIEVAL (16)
NATURAL LANGUAGES (16)
more

INFONA - science communication portal

Search results

Dual-Domain Hierarchical Classification of Phonetic Time Series

Analyzing the Impact of MFCC and LDA for the Development of Isolated Pashto Spoken Numbers ASR

English sentence pronunciation evaluation using rhythm and intonation

Mapping gestures to speech using the kinect

A new direct access framework for speaker identification system

Joined cepstral distance features two-stage multi-class classification for emotional speech

Automatic pronunciation error detection of nonnative Arabic Speech

Voice pathology detection using auto-correlation of different filters bank

Sentence extraction in recognition textual entailment task

Sentence Similarity Based on Semantic Vector Model

An approach to lexical stress detection from transcribed continuous speech using acoustic features

A small vocabulary automatic filipino speech profanity suppression system using hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) keyword spotting framework

Inter comparison of classification techniques for vowel speech imagery using EEG sensors

Cipher-text only attack on hopping window time domain scramblers

Gammatonegram based speaker identification

Classifying speech related vs. idle state towards onset detection in brain-computer interfaces overt, inhibited overt, and covert speech sound production vs. idle state

Effectiveness in open-set speaker identification

Part of speech tagging with Naïve Bayes methods

An improved pitch detection of speech combined with speech enhancement

Speech emotion recognition

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options