Search results

Items from 1 to 20 out of 816 results

chapter

Predicting underwater acoustic network variability using machine learning techniques

Vignesh Kalaiarasu, Hari Vishnu, Ahmed Mahmood, Mandar Chitre

OCEANS 2017 – Anchorage > 1 - 7

OCEANS 2017 - Anchorage

Predicting the performance of an underwater acoustic network (UAN) is a challenging task due to the spatiotemporal variability of the links and its complicated dependence on multiple factors. We present a machine-learning model based on logistic regression (LogR) to capture the spatio-temporal variation in the performance of a UAN. The model captures the effect of environmental factors such as wind...

chapter

Exploring recurrent neural network based acoustic and linguistic modeling for children's speech recognition

Sreeram Ganji, Rohit Sinha

TENCON 2017 - 2017 IEEE Region 10 Conference > 2880 - 2884

TENCON 2017 - 2017 IEEE Region 10 Conference

The conventional automatic speech recognition (ASR) systems employ the GMM-HMM for acoustic modeling and the n-gram for language modeling. Over the last decade, the deep feed-forward neural network (DFNN) has almost replaced the GMM in acoustic modeling. The current ASR systems are predominantly based on the DFNN-HMM acoustic model and the n-gram language model (LM). Owing to better long-term context...

chapter

Development and evaluation of the program for auditory training in the correction of central auditory processing disorders

Dmitriy I. Kaplun, Denis V. Gnezdilov, George A. Efimenko, Alexey A. Pochechuev, more

2017 IEEE II International Conference on Control in Technical Systems (CTS) > 106 - 109

2017 IEEE II International Conference on Control in Technical Systems (CTS)

The main indication for auditory training is central auditory processing disorder (CAPD), which inevitably develops in patients with the chronic sensorineural hearing loss as a consequence of auditory deprivation. Patients with CAPD have difficulties with understanding complex signals, especially, speech in background noise. The aim of the study was to create the optimal algorithm of auditory training...

chapter

Improving Accuracy of Automatic Fracture Detection in Borehole Images with Deep Learning and GPUs

Rommel Anatoli Quintanilla Cruz, Diego Carrico Cacau, Renato Moraes dos Santos, Evandro Jose Ribeiro Pereira, more

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 345 - 350

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

The logging and further analysis of borehole images is a major step in the interpretation of geological events. Natural fractures and beddings are features whose identification is commonly performed using acoustic and electrical borehole imaging tools. Such identification is a tedious task and is made visually by geologists, who must be experts on classification. The correct identification of planar...

chapter

Analysis of data fusion techniques for multi-microphone audio event detection in adverse environments

Irene Martin-Morato, Maximo Cobos, Francesc J. Ferri

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

Acoustic event detection (AED) is currently a very active research area with multiple applications in the development of smart acoustic spaces. In this context, the advances brought by Internet of Things (IoT) platforms where multiple distributed microphones are available have also contributed to this interest. In such scenarios, the use of data fusion techniques merging information from several sensors...

chapter

Sound event detection in synthetic audio: Analysis of the dcase 2016 task results

Gregoire Lafay, Emmanouil Benetos, Mathieu Lagrange

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 11 - 15

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

As part of the 2016 public evaluation challenge on Detection and Classification of Acoustic Scenes and Events (DCASE 2016), the second task focused on evaluating sound event detection systems using synthetic mixtures of office sounds. This task, which follows the ‘Event Detection-Office Synthetic’ task of DCASE 2013, studies the behaviour of tested algorithms when facing controlled levels of audio...

chapter

Transfer learning of weakly labelled audio

Aleksandr Diment, Tuomas Virtanen

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 6 - 10

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Many machine learning tasks have been shown solvable with impressive levels of success given large amounts of training data and computational power. For the problems which lack data sufficient to achieve high performance, methods for transfer learning can be applied. These refer to performing the new task while having prior knowledge of the nature of the data, gained by first performing a different...

chapter

Learning vocal mode classifiers from heterogeneous data sources

Zhao Shuyang, Toni Heittola, Tuomas Virtanen

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 16 - 20

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

This paper targets on a generalized vocal mode classifier (speech/singing) that works on audio data from an arbitrary data source. Previous studies on sound classification are commonly based on cross-validation using a single dataset, without considering training-recognition mismatch. In our study, two experimental setups are used: matched training-recognition condition and mismatched training-recognition...

chapter

Metric learning based data augmentation for environmental sound classification

Rui Lu, Zhiyao Duan, Changshui Zhang

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 1 - 5

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Deep neural networks have been widely applied in the field of environmental sound classification. However, due to the scarcity of carefully labeled data, their training process suffers from over-fitting. Data augmentation is a technique that alleviates this issue. It augments the training set with synthetic data that are created by modifying some parameters of the real data. However, not all kinds...

chapter

A low complexity method based on reaction-diffusion transform for ultrasound echo-based shape object classification

Mihai Bucurica, Ioana Dogaru, Radu Dogaru

2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE) > 1 - 5

2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE)

This paper presents improvements in terms of accuracy for shape object classification using a new low complexity method compared to previous implementation [1]. The method is using echoes generated by a JAVA platform capable of emulate sound propagation in a controlled 2D virtual environment [2][3]. Echoes originate from the ultrasonic waves generated inside a virtual environment which contains geometrical...

chapter

Scaper: A library for soundscape synthesis and augmentation

Justin Salamon, Duncan MacConnell, Mark Cartwright, Peter Li, more

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 344 - 348

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Sound event detection (SED) in environmental recordings is a key topic of research in machine listening, with applications in noise monitoring for smart cities, self-driving cars, surveillance, bioa-coustic monitoring, and indexing of large multimedia collections. Developing new solutions for SED often relies on the availability of strongly labeled audio recordings, where the annotation includes the...

chapter

Automatic species recognition using echolocation clicks from odontocetes

Wenyu Luo, Wuyi Yang, Zhongchang Song, Yu Zhang

2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 1 - 5

2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

The classification of different odontocetes using écholocation clicks plays a significant role in tracking and detecting animals for research and protection purposes. Echolocation clicks were detected by an automatic method based on the Teager-Kaiser Energy Operator (TKEO). Then, these clicks were represented by their FFT magnitude spectrum. To reduce the influence of high similar clicks among species,...

chapter

Simulation study on identification of delamination size of composite based on the support vector machine

Su Chen-Hui, Shen Jing-Shi, Jiang Ming-Shun, Sui Qing-Mei, more

2017 Chinese Automation Congress (CAC) > 4253 - 4256

2017 Chinese Automation Congress (CAC)

Composites are widely used in aviation, aerospace and other fields because of their high specific strength, high specific stiffness and easy molding. However, in the process of using the concentrated stress, heavy shocks may form different degrees of damage. Especially, the internal delamination will reduce the stability and safety of the structure. Based on the analysis of damage location and damage...

chapter

Ultrasonic flaw detection based on temporal and subband signals applied to neural network

Boyang Wang, Jafar Saniie

2017 IEEE International Ultrasonics Symposium (IUS) > 1

2017 IEEE International Ultrasonics Symposium (IUS)

Ultrasonic NDE uses high frequency acoustic waves to evaluate materials, and often signal processing is required to detect echoes from defects in the presence of microstructure scattering noise. Scattering noise, also known as clutter, interferes with the flaw signal and cannot be completely eliminated by using classical signal processing methods such as band-pass filtering. In this paper, neural...

chapter

Ultrasonic flaw detection based on temporal and spectral signals applied to neural network

Boyang Wang, Jafar Saniie

2017 IEEE International Ultrasonics Symposium (IUS) > 1 - 4

2017 IEEE International Ultrasonics Symposium (IUS)

Ultrasonic Non-Destructive Evaluation (NDE) uses high frequency acoustic waves to evaluate materials, and often signal processing is required to detect echoes from defects in the presence of micro-structure scattering noise. Scattering noise is known as the clutter. The clutter interferes with the flaw signal and cannot be completely separated from it by using conventional signal processing methods...

chapter

Novel alignment method for DNN TTS training using HMM synthesis models

Sinisa Suzic, Tijana Delic, Darko Pekar, Vladimir Ostojic

2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY) > 271 - 276

2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY)

In order to train neural networks (NN) for text-to-speech synthesis (TTS), phonetic segmentation must be performed. The most accurate segmentation is performed manually, but the process of creating manual alignments is costly and time-consuming, so automatic procedures are preferable. In this paper, a simple alignment method based on models trained during hidden Markov Model (HMM) based TTS system...

chapter

A hardware/software co-design architecture for ultrasonic flaw detection with Hidden Markov Model and wavelet transform

Kushal Virupakshappa, Erdal Oruklu

2017 IEEE International Ultrasonics Symposium (IUS) > 1

2017 IEEE International Ultrasonics Symposium (IUS)

This work presents an embedded hardware architecture for real-time ultrasonic NDE applications that incorporate Hidden Markov Model (HMM) based statistical signal methods. HMM has been successfully used in applications like audio segment retrieval, speech/language recognition and image processing applications. Recently, we proposed a new Hidden Markov Model (HMM) based ultrasonic flaw detection algorithm...

chapter

A hardware/software co-design architecture for ultrasonic flaw detection with Hidden Markov Model and Wavelet Transform

Kushal Virupakshappa, Erdal Oruklu

2017 IEEE International Ultrasonics Symposium (IUS) > 1 - 4

2017 IEEE International Ultrasonics Symposium (IUS)

This work presents an embedded hardware architecture for real-time ultrasonic NDE applications that incorporate Hidden Markov Model (HMM) based statistical signal methods. Proposed algorithm is a combination of Discrete Wavelet Transform (DWT) for pre-processing A-scan signals and HMM for classification of the flaw presence. For this study, a MicroZed FPGA with Xilinx Zynq-7020 System-on-Chip (SoC)...

chapter

Leveraging deep neural networks with nonnegative representations for improved environmental sound classification

Victor Bisot, Romain Serizel, Slim Essid, Gael Richard

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

This paper introduces the use of representations based on nonnegative matrix factorization (NMF) to train deep neural networks with applications to environmental sound classification. Deep learning systems for sound classification usually rely on the network to learn meaningful representations from spectrograms or hand-crafted features. Instead, we introduce a NMF-based feature learning stage before...

chapter

Speech recognition features based on deep latent Gaussian models

Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

This paper constructs speech features based on a generative model using a deep latent Gaussian model (DLGM), which is trained using stochastic gradient variational Bayes (SGVB) algorithm and performs efficient approximate inference and learning with a directed probabilistic graphical model. The trained DLGM then generate latent variables based on Gaussian distribution, which is used as new features...

Keywords:
TRAINING
ACOUSTICS

Publication date

Set your own date range

Content availability

Available (815)
None (1)

Keywords

SPEECH (482)
HIDDEN MARKOV MODELS (426)
SPEECH RECOGNITION (384)
FEATURE EXTRACTION (189)
DATA MODELS (128)
ACCURACY (88)
ADAPTATION MODELS (87)
TRAINING DATA (83)
NEURAL NETWORKS (82)
COMPUTATIONAL MODELING (76)
SPEECH PROCESSING (70)
ARTIFICIAL NEURAL NETWORKS (66)
SUPPORT VECTOR MACHINES (63)
AUTOMATIC SPEECH RECOGNITION (61)
DATABASES (58)
TESTING (54)
DECODING (49)
NATURAL LANGUAGE PROCESSING (46)
ADAPTATION MODEL (44)
ACOUSTIC SIGNAL PROCESSING (43)
VECTORS (43)
SPEAKER RECOGNITION (42)
CONTEXT (40)
DATA MINING (39)
MATHEMATICAL MODEL (38)
SIGNAL PROCESSING (38)
ACOUSTIC MODELING (37)
HIDDEN MARKOV MODEL (36)
NOISE (36)
DEEP NEURAL NETWORK (33)
SPEECH SYNTHESIS (33)
ERROR ANALYSIS (32)
ESTIMATION (32)
LATTICES (32)
DEEP NEURAL NETWORKS (31)
LEARNING (ARTIFICIAL INTELLIGENCE) (31)
ROBUSTNESS (30)
VOCABULARY (30)
DISCRIMINATIVE TRAINING (29)
MAXIMUM LIKELIHOOD ESTIMATION (29)
TRANSFORMS (28)
CLASSIFICATION ALGORITHMS (27)
VISUALIZATION (26)
ACOUSTIC MODEL (24)
DICTIONARIES (24)
KERNEL (23)
PATTERN RECOGNITION (23)
SIGNAL TO NOISE RATIO (22)
STANDARDS (22)
CONTEXT MODELING (21)
EMOTION RECOGNITION (21)
MACHINE LEARNING (21)
NOISE MEASUREMENT (21)
PROBABILITY (21)
SIGNAL PROCESSING ALGORITHMS (21)
CONFERENCES (20)
EQUATIONS (20)
ALGORITHM DESIGN AND ANALYSIS (19)
CLUSTERING ALGORITHMS (19)
EDUCATIONAL INSTITUTIONS (19)
HMM (19)
INDEXES (19)
MICROPHONES (19)
OPTIMIZATION (19)
COMPUTERS (18)
GAUSSIAN PROCESSES (18)
RECURRENT NEURAL NETWORKS (18)
CORRELATION (17)
COMPLEXITY THEORY (16)
COMPUTER ARCHITECTURE (16)
LANGUAGE MODEL (16)
NEURAL NETS (16)
DETECTORS (15)
GAUSSIAN MIXTURE MODEL (15)
SUPPORT VECTOR MACHINE CLASSIFICATION (15)
UNSUPERVISED LEARNING (15)
ACOUSTIC MEASUREMENTS (14)
EVENT DETECTION (14)
MEASUREMENT (14)
CONVOLUTION (13)
KEYWORD SEARCH (13)
MEL FREQUENCY CEPSTRAL COEFFICIENT (13)
PATTERN CLASSIFICATION (13)
PRAGMATICS (13)
PREDICTIVE MODELS (13)
SPEAKER ADAPTATION (13)
APPROXIMATION METHODS (12)
DNN (12)
LVCSR (12)
NIST (12)
PRINCIPAL COMPONENT ANALYSIS (12)
SILICON (12)
SUPPORT VECTOR MACHINE (12)
ENTROPY (11)
LABORATORIES (11)
SHAPE (11)
SPEECH CODING (11)
SPEECH ENHANCEMENT (11)
more

INFONA - science communication portal

Search results

Predicting underwater acoustic network variability using machine learning techniques

Exploring recurrent neural network based acoustic and linguistic modeling for children's speech recognition

Development and evaluation of the program for auditory training in the correction of central auditory processing disorders

Improving Accuracy of Automatic Fracture Detection in Borehole Images with Deep Learning and GPUs

Analysis of data fusion techniques for multi-microphone audio event detection in adverse environments

Sound event detection in synthetic audio: Analysis of the dcase 2016 task results

Transfer learning of weakly labelled audio

Learning vocal mode classifiers from heterogeneous data sources

Metric learning based data augmentation for environmental sound classification

A low complexity method based on reaction-diffusion transform for ultrasound echo-based shape object classification

Scaper: A library for soundscape synthesis and augmentation

Automatic species recognition using echolocation clicks from odontocetes

Simulation study on identification of delamination size of composite based on the support vector machine

Ultrasonic flaw detection based on temporal and subband signals applied to neural network

Ultrasonic flaw detection based on temporal and spectral signals applied to neural network

Novel alignment method for DNN TTS training using HMM synthesis models

A hardware/software co-design architecture for ultrasonic flaw detection with Hidden Markov Model and wavelet transform

A hardware/software co-design architecture for ultrasonic flaw detection with Hidden Markov Model and Wavelet Transform

Leveraging deep neural networks with nonnegative representations for improved environmental sound classification

Speech recognition features based on deep latent Gaussian models

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options