Search results

Items from 41 to 60 out of 816 results

chapter

Estimation of noise suppression parameters for maximizing snoring activity detection performance

Keisuke Nishijima, Shingo Uenohara, Ken'ichi Furuya

2017 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW) > 21 - 22

2017 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW)

The optimal parameters of noise suppression for detection of snoring activity are analyzed and we improve performance of detection of snoring activity in this paper. For detection of snoring activity, we use a Support Vector Machine which is one of machine learning. By training of grand truth and features, the SVM model is obtained. By applying test date to the SVM model, it is classified into snoring...

chapter

A comparative study of voice conversion techniques: A review

Kadria Ezzine, Mondher Frikha

2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) > 1 - 6

2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

Speaker identity, the sound of a person's voice, is one of the most important characteristics in human communication. Voice conversion (VC) is an emergent problem in voice and speech processing that deals with the process of modifying a speaker's identity. More particularly, the speech signal spoken by the source speaker is modified to sound a sifit had been pronounced by another speaker, referred...

chapter

Ultrasonic target echo detection using neural network

Boyang Wang, Jafar Saniie

2017 IEEE International Conference on Electro Information Technology (EIT) > 286 - 290

2017 IEEE International Conference on Electro Information Technology (EIT)

Ultrasonic Non-Destructive Testing (NDT) and imaging systems has been widely used for industrial and medical applications. In NDT system, detection and characterization of target signal can be extremely challenging because of the complex echo scattering environment and the system noise. In this paper, an algorithm based on Neural Network (NN) is presented to explore the possible solutions for ultrasonic...

chapter

Vehicle detection using acoustic signatures

M Uttarakumari, Anirudh S Koushik, Anirudh S Raghavendra, Akshay R Adiga, more

2017 International Conference on Computing, Communication and Automation (ICCCA) > 1173 - 1177

2017 International Conference on Computing, Communication and Automation (ICCCA)

This paper deals with the problem of classification of vehicles based on their acoustic signatures. Each type of vehicle transmits a particular type of engine sound, which can be used as a basis of classification. The samples are first collected using a reliable recording device. The signals so obtained are de-noised using wavelet analysis. The frames to be analyzed are selected using a unique energy...

chapter

Asynchronous motor imagery detection based on a target guided sub-band filter using wavelet packets

Yujuan Sun, Zuren Feng, Jun Zhang, Qing Zhou, more

2017 29th Chinese Control And Decision Conference (CCDC) > 4850 - 4855

2017 29th Chinese Control And Decision Conference (CCDC)

For an asynchronous system based on brain-computer interface (BCI), detecting the occurrence of motor imagery by electroencephalogram (EEG) signals is the basis but also a challenge, due to the complex and non-stationary characteristics of EEG signals. This paper employs a filtering method which uses a the target guided sub-band filter combined with an energy detector for asynchronous motor imagery...

chapter

Acoustic novelty detection with adversarial autoencoders

Emanuele Principi, Fabio Vesperini, Stefano Squartini, Francesco Piazza

2017 International Joint Conference on Neural Networks (IJCNN) > 3324 - 3330

2017 International Joint Conference on Neural Networks (IJCNN)

Novelty detection is the task of recognising events the differ from a model of normality. This paper proposes an acoustic novelty detector based on neural networks trained with an adversarial training strategy. The proposed approach is composed of a feature extraction stage that calculates Log-Mel spectral features from the input signal. Then, an autoencoder network, trained on a corpus of “normal”...

chapter

Deep neural network bottleneck features for bird species verification

Jinming Zhao, Yanyan Xu, Dengfeng Ke, Kaile Su

2017 International Joint Conference on Neural Networks (IJCNN) > 927 - 933

2017 International Joint Conference on Neural Networks (IJCNN)

Recently, bottleneck features as effective representations have been successfully used in Speaker Recognition (SR) and Language Recognition (LR), but little work has focused on bottleneck features for Bird Species Verification (BSV). In SR, LR and BSR tasks, using short-time spectra features may be insufficient, so it need some more abstract and discriminative representations as complementation to...

chapter

Leveraging the urban soundscape: Auditory perception for smart vehicles

Letizia Marchegiani, Ingmar Posner

2017 IEEE International Conference on Robotics and Automation (ICRA) > 6547 - 6554

2017 IEEE International Conference on Robotics and Automation (ICRA)

Urban environments are characterised by the presence of distinctive audio signals which alert the drivers to events that require prompt action. The detection and interpretation of these signals would be highly beneficial for smart vehicle systems, as it would provide them with complementary information to navigate safely in the environment. In this paper, we present a framework that spots the presence...

chapter

A study of support vector machines for emotional speech recognition

Nattapong Kurpukdee, Sawit Kasuriya, Vataya Chunwijitra, Chai Wutiwiwatchai, more

2017 8th International Conference of Information and Communication Technology for Embedded Systems (IC-ICTES) > 1 - 6

2017 8th International Conference of Information and Communication Technology for Embedded Systems (IC-ICTES)

In this paper, efficiency comparison of Support Vector Machines (SVM) and Binary Support Vector Machines (BSVM) techniques in utterance-based emotion recognition is studied. Acoustic features including energy, Mel-frequency cepstral coefficients (MFCC), Perceptual linear predictive (PLP), Filter bank (FBANK), pitch, their first and second derivatives are used as frame-based features. Four basic emotions...

chapter

Towards intoxicated speech recognition

Zixing Zhang, Felix Weninger, Martin Wollmer, Jing Han, more

2017 International Joint Conference on Neural Networks (IJCNN) > 1555 - 1559

2017 International Joint Conference on Neural Networks (IJCNN)

In a real-life scenario, the acoustic characteristics of speech often suffer from the variations induced by diverse environmental noises and different speakers. To overcome the speaker-related speech variation problem for Automatic Speech Recognition (ASR), many speaker adaptation techniques have been proposed and studied. Almost all of these studies, however, only considered the speakers' long-term...

chapter

A convolutional neural network approach for acoustic scene classification

Michele Valenti, Stefano Squartini, Aleksandr Diment, Giambattista Parascandolo, more

2017 International Joint Conference on Neural Networks (IJCNN) > 1547 - 1554

2017 International Joint Conference on Neural Networks (IJCNN)

This paper presents a novel application of convolutional neural networks (CNNs) for the task of acoustic scene classification (ASC). We here propose the use of a CNN trained to classify short sequences of audio, represented by their log-mel spectrogram. We also introduce a training method that can be used under particular circumstances in order to make full use of small datasets. The proposed system...

chapter

The phoneme set influence for lithuanian speech commands recognition accuracy

Mindaugas Greibus, Zivile Ringeliene, Laimutis Telksnys

2017 Open Conference of Electrical, Electronic and Information Sciences (eStream) > 1 - 4

2017 Open Conference of Electrical, Electronic and Information Sciences (eStream)

The phoneme set influence for Lithuanian speech commands recognition accuracy is investigated. Four phoneme sets are discussed. LIEPA speech corpus for training of Acoustic Model is used. The phonetic representation of corpus transcriptions is generated by grapheme-to-phoneme transformation rules. Rule based transformations for Lithuanian language is proposed. Recognition engine with CMU Pocketsphinx...

chapter

HMM/MLP speech recognition system using a novel data clustering approach

Lilia Lazli, Mounir Boukadoum, Otmane Ait Mohamed

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 4

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE)

We present a novel approach for large speech databases quantization. It uses an unsupervised iterative process to regulate a similarity measure to set the number of clusters and their boundaries, thus overcoming the shortcomings of conventional clustering algorithms such as k-Means and Fuzzy C-Means, which require a priori knowledge of the number of clusters and a similarity measure that follows the...

chapter

Unsupervised query-by-example spoken term detection based on DPHMM tokenizer

Cao Jiankai, Zhang Lianhai

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1321 - 1325

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

This paper investigates the use of Dirichlet process hidden Markov model (DPHMM) tokenizer for the template matching based query-by-example spoken term detection (QbE-STD) task. DPHMM can be obtained following an unsupervised iterative procedure without any training transcriptions. The STD performance of the DPHMM tokenizer is evaluated on TIMIT Corpus. We construct three kinds of DPHMM based QbE-STD...

chapter

Deep neural network approach to frog species recognition

Norsalina Hassan, Dzati Athiar Ramli, Haryati Jaafar

2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA) > 173 - 178

2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)

Automatic frog species recognition based on acoustic signal has received attention among biologists for environmental studies as it can detect, localize and document the declining population of frog species efficiently compared to the manual survey. In this study, we investigate the possibility of the use of Deep Neural Network (DNN) as a classifier for a frog species recognition system. The Mel-Frequency...

chapter

Addressing data sparsity in DNN acoustic modeling

Seeram Tejaswi, S Umesh

2017 Twenty-third National Conference on Communications (NCC) > 1 - 5

2017 Twenty-third National Conference on Communications (NCC)

This paper presents our work on developing acoustic models using deep neural networks (DNN) for low resource languages. This is considered one of the challenging problems in automatic speech recognition (ASR) as DNNs need large amount of data for building efficient models. The techniques explored in this approach use a common idea of transferring knowledge from models of high resource language to...

chapter

Research on unified phone set for Mandarin-Tibetan Bilingual ASR

Guanyu Li, Hongzhi Yu, Shipeng Xu

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 478 - 482

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

Mandarin and Tibetan Lhasa dialect are chosen to be the research objects. Phones sets and corresponding Latin Transformation scheme of Mandarin and Tibetan Lhasa dialect are established respectively. KL distance between two GMMs are studied. GMM-HMM models for phones of two languages are trained on the basis of corpus and pronunciation dictionaries. Phones of Mandarin and Tibetan Lhasa dialect are...

chapter

DNN acoustic models for dysarthric speech

Seeram Tejaswi, S Umesh

2017 Twenty-third National Conference on Communications (NCC) > 1 - 4

2017 Twenty-third National Conference on Communications (NCC)

In this paper, we investigate various training methods for building deep neural network (DNN) based acoustic models for dysarthric speech data. Methods like multitask learning, knowledge distillation and model adaptation, which overcome data sparsity and model over-fitting problems are employed to study the merits of each method. In Knowledge distillation framework, some privilege information in addition...

chapter

Acoustic and language modeling for children's read speech assessment

Hitesh Tulsiani, Prakhar Swarup, Preeti Rao

2017 Twenty-third National Conference on Communications (NCC) > 1 - 6

2017 Twenty-third National Conference on Communications (NCC)

Automatic speech recognition can be used to evaluate the accuracy of read speech and thus serve a valuable role in literacy development by providing the needed feedback on reading skills in the absence of qualified teachers. Given the known limitations of ASR in the face of insufficient task-specific training data, the selection of acoustic and language modeling strategies can play a crucial role...

chapter

Towards bootstrapping Acoustic Models for resource poor Indian languages

Prabhat Pandey, Praful Hebbar, Prashant Borole, Sandeep Satpal, more

2017 Twenty-third National Conference on Communications (NCC) > 1 - 4

2017 Twenty-third National Conference on Communications (NCC)

There are several challenges while building Automatic Speech Recognition (ASR) system for low resource languages such as Indic languages. One problem is the access to large amounts of training data required to build Acoustic Models (AM) from scratch. In the context of Indian English, another challenge encountered is code-mixing as many Indian speakers are multilingual and exhibit code-mixing in their...

Keywords:
TRAINING
ACOUSTICS

Publication date

Set your own date range

Content availability

Available (815)
None (1)

Keywords

SPEECH (482)
HIDDEN MARKOV MODELS (426)
SPEECH RECOGNITION (384)
FEATURE EXTRACTION (189)
DATA MODELS (128)
ACCURACY (88)
ADAPTATION MODELS (87)
TRAINING DATA (83)
NEURAL NETWORKS (82)
COMPUTATIONAL MODELING (76)
SPEECH PROCESSING (70)
ARTIFICIAL NEURAL NETWORKS (66)
SUPPORT VECTOR MACHINES (63)
AUTOMATIC SPEECH RECOGNITION (61)
DATABASES (58)
TESTING (54)
DECODING (49)
NATURAL LANGUAGE PROCESSING (46)
ADAPTATION MODEL (44)
ACOUSTIC SIGNAL PROCESSING (43)
VECTORS (43)
SPEAKER RECOGNITION (42)
CONTEXT (40)
DATA MINING (39)
MATHEMATICAL MODEL (38)
SIGNAL PROCESSING (38)
ACOUSTIC MODELING (37)
HIDDEN MARKOV MODEL (36)
NOISE (36)
DEEP NEURAL NETWORK (33)
SPEECH SYNTHESIS (33)
ERROR ANALYSIS (32)
ESTIMATION (32)
LATTICES (32)
DEEP NEURAL NETWORKS (31)
LEARNING (ARTIFICIAL INTELLIGENCE) (31)
ROBUSTNESS (30)
VOCABULARY (30)
DISCRIMINATIVE TRAINING (29)
MAXIMUM LIKELIHOOD ESTIMATION (29)
TRANSFORMS (28)
CLASSIFICATION ALGORITHMS (27)
VISUALIZATION (26)
ACOUSTIC MODEL (24)
DICTIONARIES (24)
KERNEL (23)
PATTERN RECOGNITION (23)
SIGNAL TO NOISE RATIO (22)
STANDARDS (22)
CONTEXT MODELING (21)
EMOTION RECOGNITION (21)
MACHINE LEARNING (21)
NOISE MEASUREMENT (21)
PROBABILITY (21)
SIGNAL PROCESSING ALGORITHMS (21)
CONFERENCES (20)
EQUATIONS (20)
ALGORITHM DESIGN AND ANALYSIS (19)
CLUSTERING ALGORITHMS (19)
EDUCATIONAL INSTITUTIONS (19)
HMM (19)
INDEXES (19)
MICROPHONES (19)
OPTIMIZATION (19)
COMPUTERS (18)
GAUSSIAN PROCESSES (18)
RECURRENT NEURAL NETWORKS (18)
CORRELATION (17)
COMPLEXITY THEORY (16)
COMPUTER ARCHITECTURE (16)
LANGUAGE MODEL (16)
NEURAL NETS (16)
DETECTORS (15)
GAUSSIAN MIXTURE MODEL (15)
SUPPORT VECTOR MACHINE CLASSIFICATION (15)
UNSUPERVISED LEARNING (15)
ACOUSTIC MEASUREMENTS (14)
EVENT DETECTION (14)
MEASUREMENT (14)
CONVOLUTION (13)
KEYWORD SEARCH (13)
MEL FREQUENCY CEPSTRAL COEFFICIENT (13)
PATTERN CLASSIFICATION (13)
PRAGMATICS (13)
PREDICTIVE MODELS (13)
SPEAKER ADAPTATION (13)
APPROXIMATION METHODS (12)
DNN (12)
LVCSR (12)
NIST (12)
PRINCIPAL COMPONENT ANALYSIS (12)
SILICON (12)
SUPPORT VECTOR MACHINE (12)
ENTROPY (11)
LABORATORIES (11)
SHAPE (11)
SPEECH CODING (11)
SPEECH ENHANCEMENT (11)
more

INFONA - science communication portal

Search results

Estimation of noise suppression parameters for maximizing snoring activity detection performance

A comparative study of voice conversion techniques: A review

Ultrasonic target echo detection using neural network

Vehicle detection using acoustic signatures

Asynchronous motor imagery detection based on a target guided sub-band filter using wavelet packets

Acoustic novelty detection with adversarial autoencoders

Deep neural network bottleneck features for bird species verification

Leveraging the urban soundscape: Auditory perception for smart vehicles

A study of support vector machines for emotional speech recognition

Towards intoxicated speech recognition

A convolutional neural network approach for acoustic scene classification

The phoneme set influence for lithuanian speech commands recognition accuracy

HMM/MLP speech recognition system using a novel data clustering approach

Unsupervised query-by-example spoken term detection based on DPHMM tokenizer

Deep neural network approach to frog species recognition

Addressing data sparsity in DNN acoustic modeling

Research on unified phone set for Mandarin-Tibetan Bilingual ASR

DNN acoustic models for dysarthric speech

Acoustic and language modeling for children's read speech assessment

Towards bootstrapping Acoustic Models for resource poor Indian languages

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options