Search results

Items from 21 to 40 out of 816 results

chapter

Sensor characteristic invariant feature for acoustic stationary pattern classification

S. Thirachai, S. Khomsay, J. Suwatthikul

2017 56th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE) > 141 - 144

2017 56th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE)

A calibration of various microphones that have different characteristics is very difficult. This paper presents a feature extraction method as an alternative. The method provides acoustic features that are strongly robust against various characteristic transfer functions. The proposed method applies Local Binary Patterns (LBP) and Compressive Sensing (CS) which compare spectral details with spectral...

chapter

Audio/video supervised independent vector analysis through multimodal pilot dependent components

Francesco Nesta, Saeed Mosayyebpour, Zbynek Koldovsky, Karel Palecek

2017 25th European Signal Processing Conference (EUSIPCO) > 1150 - 1164

2017 25th European Signal Processing Conference (EUSIPCO)

Independent Vector Analysis is a powerful tool for estimating the broadband acoustic transfer function between multiple sources and the microphones in the frequency domain. In this work, we consider an extended IVA model which adopts the concept of pilot dependent signals. Without imposing any constraint on the de-mixing system, pilot signals depending on the target source are injected into the model...

chapter

Detection of alarm sounds in noisy environments

Dean Carmel, Ariel Yeshurun, Yair Moshe

2017 25th European Signal Processing Conference (EUSIPCO) > 1839 - 1843

2017 25th European Signal Processing Conference (EUSIPCO)

Sirens and alarms play an important role in everyday life since they warn people of hazardous situations, even when these are out of sight. Automatic detection of this class of sounds can help hearing impaired or distracted people, e.g., on the road, and contribute to their independence and safety. In this paper, we present a technique for the detection of alarm sounds in noisy environments. The technique...

chapter

Automatic detection of bird species from audio field recordings using HMM-based modelling of frequency tracks

Peter Jancovic, Munevver Kokuer

2017 25th European Signal Processing Conference (EUSIPCO) > 1779 - 1783

2017 25th European Signal Processing Conference (EUSIPCO)

This paper presents an automatic system for detection of bird species in field recordings. A sinusoidal detection algorithm is employed to segment the acoustic scene into isolated spectro-temporal segments. Each segment is represented as a temporal sequence of frequencies of the detected sinusoid, referred to as frequency track. Each bird species is represented by a set of hidden Markov models (HMMs),...

chapter

FPGA implementation of a support vector machine classifier for Ultrasonic flaw detection

Yiyue Jiang, Kushal Virupakshappa, Erdal Oruklu

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS) > 180 - 183

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS)

In this work, we investigate the hardware implementation of Support Vector Machine (SVM) prediction on an FPGA platform for industrial ultrasound applications. Specifically, SVM is used as classifier for identifying ultrasonic A-scan signals as signals with flaw or signals without flaw. Hardware acceleration using FPGA is the main theme of the presented work. The architecture used to implement the...

chapter

Optimizing acoustic feature extractor for anomalous sound detection based on Neyman-Pearson lemma

Yuma Koizumi, Shoichiro Saito, Hisashi Uematsu, Noboru Harada

2017 25th European Signal Processing Conference (EUSIPCO) > 698 - 702

2017 25th European Signal Processing Conference (EUSIPCO)

We propose a method for optimizing an acoustic feature extractor for anomalous sound detection (ASD). Most ASD systems adopt outlier-detection techniques because it is difficult to collect a massive amount of anomalous sound data. To improve the performance of such outlier-detection-based ASD, it is essential to extract a set of efficient acoustic features that is suitable for identifying anomalous...

chapter

A neural network approach for sound event detection in real life audio

Michele Valenti, Dario Tonelli, Fabio Vesperini, Emanuele Principi, more

2017 25th European Signal Processing Conference (EUSIPCO) > 2754 - 2758

2017 25th European Signal Processing Conference (EUSIPCO)

This paper presents and compares two algorithms based on artificial neural networks (ANNs) for sound event detection in real life audio. Both systems have been developed and evaluated with the material provided for the third task of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2016 challenge. For the first algorithm, we make use of an ANN trained on different features extracted...

chapter

Development of multilingual phone recognition system for Indian languages

K E Manjunath, K. Sreenivasa Rao, Dinesh Babu Jayagopi

2017 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES) > 1 - 6

2017 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES)

In this paper, the development of Multilingual Phone Recognition System (MPRS) in the context of Indian languages is described. MPRS is a language independent Phone Recognition System (PRS) that could recognise the phonetic units present in a speech utterance of any language. We have developed two Bilingual and a quadrilingual PRS using four Indian languages — Kannada, Telugu, Bengali, and Odia. International...

chapter

Automated detection of geometric defects on connecting rod via acoustic resonance testing

Yun Zheng, Matthias Heinrich, Ahmad Osman, Bernd Valeske

2017 25th European Signal Processing Conference (EUSIPCO) > 1868 - 1872

2017 25th European Signal Processing Conference (EUSIPCO)

Fully automated defect detection and classification of automobile components are crucial for solving quality and efficiency problems for automotive manufacturers, due to the rising wage, production costs and warranty claims. However, metrological deviations in form still represent unsolved problems using state-of-the-art techniques, especially for forged or casted components with complex geometry...

chapter

Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging

Pawel Rosciszewski, Jakub Kaliski

2017 International Conference on High Performance Computing & Simulation (HPCS) > 560 - 565

2017 International Conference on High Performance Computing & Simulation (HPCS)

In the paper we investigate the performance of parallel deep neural network training with parameter averaging for acoustic modeling in Kaldi, a popular automatic speech recognition toolkit. We describe experiments based on training a recurrent neural network with 4 layers of 800 LSTM hidden states on a 100-hour corpora of annotated Polish speech data. We propose a MPI-based modification of the training...

chapter

An improved residual LSTM architecture for acoustic modeling

Lu Huang, Ji Xu, Jiasong Sun, Yi Yang

2017 2nd International Conference on Computer and Communication Systems (ICCCS) > 101 - 105

2017 2nd International Conference on Computer and Communication Systems (ICCCS)

Long Short-Term Memory (LSTM) is the primary recurrent neural networks architecture for acoustic modeling in automatic speech recognition systems. Residual learning is an efficient method to help neural networks converge easier and faster. In this paper, we propose several types of residual LSTM methods for our acoustic modeling. Our experiments indicate that, compared with classic LSTM, our architecture...

chapter

Select-additive learning: Improving generalization in multimodal sentiment analysis

Haohan Wang, Aaksha Meghawat, Louis-Philippe Morency, Eric P. Xing

2017 IEEE International Conference on Multimedia and Expo (ICME) > 949 - 954

2017 IEEE International Conference on Multimedia and Expo (ICME)

Multimodal sentiment analysis is drawing an increasing amount of attention these days. It enables mining of opinions in video reviews which are now available aplenty on online platforms. However, multimodal sentiment analysis has only a few high-quality data sets annotated for training machine learning algorithms. These limited resources restrict the generalizability of models, where, for example,...

chapter

Random forest classification based acoustic event detection

Xianjun Xia, Roberto Togneri, Ferdous Sokel, David Huang

2017 IEEE International Conference on Multimedia and Expo (ICME) > 163 - 168

2017 IEEE International Conference on Multimedia and Expo (ICME)

This paper deals with the acoustic event detection (AED) to improve the detection accuracy of acoustic events. Acoustic event detection task is performed by a regression via classification (RvC) based approach along with the random forest technique. A discretization process is used to convert the continuous frame positions within acoustic events into event duration class labels. Outputs of the category-specific...

chapter

Random forest regression based acoustic event detection with bottleneck features

Xianjun Xia, Roberto Togneri, Ferdous Sohel, David Huang

2017 IEEE International Conference on Multimedia and Expo (ICME) > 157 - 162

2017 IEEE International Conference on Multimedia and Expo (ICME)

This paper deals with random forest regression based acoustic event detection (AED) by combining acoustic features with bottleneck features (BN). The bottleneck features have a good reputation of being inherently discriminative in acoustic signal processing. To deal with the unstructured and complex real-world acoustic events, an acoustic event detection system is constructed using bottleneck features...

chapter

Improving acoustic modeling using audio-visual speech

Ahmed Hussen Abdelaziz

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1081 - 1086

2017 IEEE International Conference on Multimedia and Expo (ICME)

Reliable visual features that encode the articulator movements of speakers can dramatically improve the decoding accuracy of automatic speech recognition systems when combined with the corresponding acoustic signals. In this paper, a novel framework is proposed to utilize audio-visual speech not only during decoding but also for training better acoustic models. In this framework, a multi-stream hidden...

chapter

SpeeD's DNN approach to Romanian speech recognition

Alexandru-Lucian Georgescu, Horia Cucu, Corneliu Burileanu

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 8

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper presents the main improvements brought recently to the large-vocabulary, continuous speech recognition (LVCSR) system for Romanian language developed by the Speech and Dialogue (SpeeD) research laboratory. While the most important improvement consists in the use of DNN-based acoustic models, instead of the classic HMM-GMM approach, several other aspects are discussed in the paper: a significant...

chapter

Towards a continuous speech corpus for banking domain automatic speech recognition

George Suciu, Stefan-Adrian Toma, Romulus Cheveresan

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper presents the work done towards developing a speech corpus for Romanian, for automatic speech recognition for the banking domain. This work is done in the context of the Speech2Process project, which aims at creating a system which allows interaction between customers and agents in the contact center much easier. The application to use the banking corpus will provide automatic response to...

chapter

Vehicle Classification and Identification Using Multi-Modal Sensing and Signal Learning

Ryan A. Kerekes, Thomas P. Karnowski, Mike Kuhn, Michael R. Moore, more

2017 IEEE 85th Vehicular Technology Conference (VTC Spring) > 1 - 5

2017 IEEE 85th Vehicular Technology Conference: VTC2017-Spring

Vehicle counting, time-of-travel analysis, and other traffic studies frequently require the classification and identification of vehicles in a roadway. Unfortunately, many current technologies for identifying vehicles, such as image-based methods that use cameras and machine vision, are not appropriate for studies that require low-power consumption and low cost. Additionally, privacy issues are becoming...

chapter

Mixed reality voice training for lecturers

Laura Lenz, Daniela Janssen, Valerie Stehling

2017 4th Experiment@International Conference (exp.at'17) > 107 - 108

2017 4th Experiment@International Conference (exp.at'17)

An often underestimated challenge for lecturers is a considerate use of their voice in teaching auditoriums. Even experienced lecturers are challenged by speaking in front of large classes or in new surroundings for the first time. Universities therefore often offer special voice trainings in which lecturers can be trained to use their voice correctly by a professional voice coach. Those trainings,...

chapter

Automotive surface identification system

Aleksandr Bystrov, Edward Hoare, Thuy-Yung Tran, Nigel Clarke, more

2017 IEEE International Conference on Vehicular Electronics and Safety (ICVES) > 115 - 120

2017 IEEE International Conference on Vehicular Electronics and Safety (ICVES)

In this paper the practical issues of automotive surface identification system development are considering. The novelty of this work is the combining of different training algorithms, neural network structures and methods to increase the classification accuracy and avoid overfitting of real-world data. The obtained results thereby demonstrate that the use of proposed system architecture and statistical...

Keywords:
TRAINING
ACOUSTICS

Publication date

Set your own date range

Content availability

Available (815)
None (1)

Keywords

SPEECH (482)
HIDDEN MARKOV MODELS (426)
SPEECH RECOGNITION (384)
FEATURE EXTRACTION (189)
DATA MODELS (128)
ACCURACY (88)
ADAPTATION MODELS (87)
TRAINING DATA (83)
NEURAL NETWORKS (82)
COMPUTATIONAL MODELING (76)
SPEECH PROCESSING (70)
ARTIFICIAL NEURAL NETWORKS (66)
SUPPORT VECTOR MACHINES (63)
AUTOMATIC SPEECH RECOGNITION (61)
DATABASES (58)
TESTING (54)
DECODING (49)
NATURAL LANGUAGE PROCESSING (46)
ADAPTATION MODEL (44)
ACOUSTIC SIGNAL PROCESSING (43)
VECTORS (43)
SPEAKER RECOGNITION (42)
CONTEXT (40)
DATA MINING (39)
MATHEMATICAL MODEL (38)
SIGNAL PROCESSING (38)
ACOUSTIC MODELING (37)
HIDDEN MARKOV MODEL (36)
NOISE (36)
DEEP NEURAL NETWORK (33)
SPEECH SYNTHESIS (33)
ERROR ANALYSIS (32)
ESTIMATION (32)
LATTICES (32)
DEEP NEURAL NETWORKS (31)
LEARNING (ARTIFICIAL INTELLIGENCE) (31)
ROBUSTNESS (30)
VOCABULARY (30)
DISCRIMINATIVE TRAINING (29)
MAXIMUM LIKELIHOOD ESTIMATION (29)
TRANSFORMS (28)
CLASSIFICATION ALGORITHMS (27)
VISUALIZATION (26)
ACOUSTIC MODEL (24)
DICTIONARIES (24)
KERNEL (23)
PATTERN RECOGNITION (23)
SIGNAL TO NOISE RATIO (22)
STANDARDS (22)
CONTEXT MODELING (21)
EMOTION RECOGNITION (21)
MACHINE LEARNING (21)
NOISE MEASUREMENT (21)
PROBABILITY (21)
SIGNAL PROCESSING ALGORITHMS (21)
CONFERENCES (20)
EQUATIONS (20)
ALGORITHM DESIGN AND ANALYSIS (19)
CLUSTERING ALGORITHMS (19)
EDUCATIONAL INSTITUTIONS (19)
HMM (19)
INDEXES (19)
MICROPHONES (19)
OPTIMIZATION (19)
COMPUTERS (18)
GAUSSIAN PROCESSES (18)
RECURRENT NEURAL NETWORKS (18)
CORRELATION (17)
COMPLEXITY THEORY (16)
COMPUTER ARCHITECTURE (16)
LANGUAGE MODEL (16)
NEURAL NETS (16)
DETECTORS (15)
GAUSSIAN MIXTURE MODEL (15)
SUPPORT VECTOR MACHINE CLASSIFICATION (15)
UNSUPERVISED LEARNING (15)
ACOUSTIC MEASUREMENTS (14)
EVENT DETECTION (14)
MEASUREMENT (14)
CONVOLUTION (13)
KEYWORD SEARCH (13)
MEL FREQUENCY CEPSTRAL COEFFICIENT (13)
PATTERN CLASSIFICATION (13)
PRAGMATICS (13)
PREDICTIVE MODELS (13)
SPEAKER ADAPTATION (13)
APPROXIMATION METHODS (12)
DNN (12)
LVCSR (12)
NIST (12)
PRINCIPAL COMPONENT ANALYSIS (12)
SILICON (12)
SUPPORT VECTOR MACHINE (12)
ENTROPY (11)
LABORATORIES (11)
SHAPE (11)
SPEECH CODING (11)
SPEECH ENHANCEMENT (11)
more

INFONA - science communication portal

Search results

Sensor characteristic invariant feature for acoustic stationary pattern classification

Audio/video supervised independent vector analysis through multimodal pilot dependent components

Detection of alarm sounds in noisy environments

Automatic detection of bird species from audio field recordings using HMM-based modelling of frequency tracks

FPGA implementation of a support vector machine classifier for Ultrasonic flaw detection

Optimizing acoustic feature extractor for anomalous sound detection based on Neyman-Pearson lemma

A neural network approach for sound event detection in real life audio

Development of multilingual phone recognition system for Indian languages

Automated detection of geometric defects on connecting rod via acoustic resonance testing

Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging

An improved residual LSTM architecture for acoustic modeling

Select-additive learning: Improving generalization in multimodal sentiment analysis

Random forest classification based acoustic event detection

Random forest regression based acoustic event detection with bottleneck features

Improving acoustic modeling using audio-visual speech

SpeeD's DNN approach to Romanian speech recognition

Towards a continuous speech corpus for banking domain automatic speech recognition

Vehicle Classification and Identification Using Multi-Modal Sensing and Signal Learning

Mixed reality voice training for lecturers

Automotive surface identification system

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options