Search results

Items from 81 to 100 out of 2,639 results

chapter

A learning based training and skill assessment platform with haptic guidance for endovascular catheterization

Wenqiang Chi, Hedyeh Rafii-Tari, Christopher J. Payne, Jindong Liu, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2357 - 2363

2017 IEEE International Conference on Robotics and Automation (ICRA)

Increasing demands in endovascular intervention have motivated technical skill training and competency-based measures of performance. However, there are no well-established online metrics for technical skill assessment; few studies have explored operator behavioral patterns from catheter motion and operator hand motions. This paper proposes a platform for active online training and objective assessment...

chapter

Recognizing social touch gestures using recurrent and convolutional neural networks

Dana Hughes, Alon Krauthammer, Nikolaus Correli

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2315 - 2321

2017 IEEE International Conference on Robotics and Automation (ICRA)

Deep learning approaches have been used to perform classification in several applications with high-dimensional input data. In this paper, we investigate the potential for deep learning for classifying affective touch on robotic skin in a social setting. Three models are considered, a convolutional neural network, a convolutional-recurrent neural network and an autoencoder-recurrent neural network...

chapter

Automatic creation of a word aligned Sinhala-Tamil parallel corpus

Maryam Ziyad Mohamed, Anusha Ihalapathirana, Riyafa Abdul Hameed, Nadeeshani Pathirennehelage, more

2017 Moratuwa Engineering Research Conference (MERCon) > 425 - 430

2017 Moratuwa Engineering Research Conference (MERCon)

A parallel corpus aligned at both sentence and word level is an important prerequisite in statistical machine translation. However, manual creation of such a parallel corpus is time consuming, and requires experts fluent in both languages. This paper presents the first ever empirical evaluation carried out to identify the best unsupervised word alignment technique for Sinhala and Tamil. It also presents...

chapter

Acoustic novelty detection with adversarial autoencoders

Emanuele Principi, Fabio Vesperini, Stefano Squartini, Francesco Piazza

2017 International Joint Conference on Neural Networks (IJCNN) > 3324 - 3330

2017 International Joint Conference on Neural Networks (IJCNN)

Novelty detection is the task of recognising events the differ from a model of normality. This paper proposes an acoustic novelty detector based on neural networks trained with an adversarial training strategy. The proposed approach is composed of a feature extraction stage that calculates Log-Mel spectral features from the input signal. Then, an autoencoder network, trained on a corpus of “normal”...

chapter

An investigation of high-resolution modeling units of deep neural networks for acoustic scene classification

Xiao Bao, Tian Gao, Jun Du, Li-Rong Dai

2017 International Joint Conference on Neural Networks (IJCNN) > 3028 - 3035

2017 International Joint Conference on Neural Networks (IJCNN)

In this paper, we investigate high-resolution modeling units of deep neural networks (DNNs) from concrete to abstract for acoustic scene classification based on Gaussian mixture model (GMM) and ergodic hidden Markov model (HMM). A direct modeling strategy for DNN to classify acoustic scenes is to map each frame feature of an audio to one scene category. However, all frames tagged with the same label...

chapter

Interpretable models for fast activity recognition and anomaly explanation during collaborative robotics tasks

Bradley Hayes, Julie A. Shah

2017 IEEE International Conference on Robotics and Automation (ICRA) > 6586 - 6593

2017 IEEE International Conference on Robotics and Automation (ICRA)

In this paper, we present Rapid Activity Prediction Through Object-oriented Regression (RAPTOR), a scalable method for performing rapid, real-time activity recognition and prediction that achieves state-of-the-art classification accuracy on both a generic human activity dataset and two domain-specific collaborative robotics manufacturing datasets. Our approach is designed to be human-interpretable:...

chapter

Segmentation the speech of hard of hearing children

Laszlo Czap, Judit Maria Pinter, Attila K. Varga

2017 18th International Carpathian Control Conference (ICCC) > 446 - 450

2017 18th International Carpathian Control Conference (ICCC)

One service provided by our application ‘Speech Assistant System’ assisting the teaching of the hearing impaired to speak is the automatic assessment of words and sentences in the course of practice and feedback to the person. Individual speech sounds can only be correctly evaluated if they are compared with the appropriate reference speech sounds. This requires segmenting the speech to be examined...

chapter

Deep neural network bottleneck features for bird species verification

Jinming Zhao, Yanyan Xu, Dengfeng Ke, Kaile Su

2017 International Joint Conference on Neural Networks (IJCNN) > 927 - 933

2017 International Joint Conference on Neural Networks (IJCNN)

Recently, bottleneck features as effective representations have been successfully used in Speaker Recognition (SR) and Language Recognition (LR), but little work has focused on bottleneck features for Bird Species Verification (BSV). In SR, LR and BSR tasks, using short-time spectra features may be insufficient, so it need some more abstract and discriminative representations as complementation to...

chapter

Leveraging the urban soundscape: Auditory perception for smart vehicles

Letizia Marchegiani, Ingmar Posner

2017 IEEE International Conference on Robotics and Automation (ICRA) > 6547 - 6554

2017 IEEE International Conference on Robotics and Automation (ICRA)

Urban environments are characterised by the presence of distinctive audio signals which alert the drivers to events that require prompt action. The detection and interpretation of these signals would be highly beneficial for smart vehicle systems, as it would provide them with complementary information to navigate safely in the environment. In this paper, we present a framework that spots the presence...

chapter

Transfer learning of shared latent spaces between robots with similar kinematic structure

Brian Delhaisse, Domingo Esteban, Leonel Rozo, Darwin Caldwell

2017 International Joint Conference on Neural Networks (IJCNN) > 4142 - 4149

2017 International Joint Conference on Neural Networks (IJCNN)

Learning complex manipulation tasks often requires to collect a large training dataset to obtain a model of a specific skill. This process may become laborious when dealing with high-DoF robots, and even more tiresome if the skill needs to be learned by multiple robots. In this paper, we investigate how this learning process can be accelerated by using shared latent variable models for knowledge transfer...

chapter

Towards intoxicated speech recognition

Zixing Zhang, Felix Weninger, Martin Wollmer, Jing Han, more

2017 International Joint Conference on Neural Networks (IJCNN) > 1555 - 1559

2017 International Joint Conference on Neural Networks (IJCNN)

In a real-life scenario, the acoustic characteristics of speech often suffer from the variations induced by diverse environmental noises and different speakers. To overcome the speaker-related speech variation problem for Automatic Speech Recognition (ASR), many speaker adaptation techniques have been proposed and studied. Almost all of these studies, however, only considered the speakers' long-term...

chapter

An improved QRS detection method using Hidden Markov Models

M.A. Belkadi, A. Daamouche

2017 6th International Conference on Systems and Control (ICSC) > 81 - 84

2017 6th International Conference on Systems and Control (ICSC)

Hidden Markov Models are very efficient in speech recognition. Based on machine states, HMMs combine Bayesian probability and decision making to approximate each output to its appropriate class. In this paper, we propose to use HMMs for ECG QRS detection. We select a set of models to represent QRS complex and noise aiming to a better discrimination between them. For a total of 44510 beats of the MIT/BIH...

chapter

Malware detection using GA optimized K-means and HMM

Anjly Chanana, Surjeet Singh, K.K. Paliwal

2017 International Conference on Computing, Communication and Automation (ICCCA) > 355 - 362

2017 International Conference on Computing, Communication and Automation (ICCCA)

In this research, we consider the related problem of malware classification based on HMMs. We train HMMs for a variety of malware generators and a variety of compilers. The results of HMM are further classified using k means algorithm but k means algorithm has drawback of stuck into local minima so we optimized the k means with genetic algorithm (GA). Genetic algorithm (GA) tuned k means clustering...

chapter

Combining deep learning and language modeling for segmentation-free OCR from raw pixels

Stephen Rawls, Huaigu Cao, Ekraam Sabir, Prem Natarajan

2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR) > 119 - 123

2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR)

We present a simple yet effective LSTM-based approach for recognizing machine-print text from raw pixels. We use a fully-connected feed-forward neural network for feature extraction over a sliding window, the output of which is directly fed into a stacked bi-directional LSTM. We train the network using the CTC objective function and use a WFST language model during recognition. Experimental results...

chapter

Arabic handwriting recognition using sequential minimal optimization

Hanadi Hassen, Somaya Al-Maadeed

2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR) > 79 - 84

2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR)

Due to the variability of writing styles and to other problems related to the nature of Arabic scripts, the recognition of Arabic handwriting is still awaiting accurate results. Segmentation of Arabic handwritten words into graphemes poses a major challenge in Arabic handwriting recognition and is highly error prone. In this paper, we adopt the holistic approach which handles the whole word image...

chapter

Recurrent neural network based user classification for smart grids

Kalman Tornai, Andras Olah, Rajmund Drenyovszki, Lorant Kovacs, more

2017 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT) > 1 - 5

2017 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT)

Power consuming users and buildings with different power consumption patterns may be treated with different conditions and can be taken into consideration with different parameters during capacity planning and distribution. Thus the automated, unsupervised categorization of power consumers is a very important task of smart power transmission systems. Knowing the behavioral categories of power consumers...

chapter

Lip-reading via a DNN-HMM hybrid system using combination of the image-based and model-based features

Mohammad Hasan Rahmani, Farshad Almasganj

2017 3rd International Conference on Pattern Recognition and Image Analysis (IPRIA) > 195 - 199

2017 3rd International Conference on Pattern Recognition and Image Analysis (IPRIA)

Introducing features that better represent the visual information of speakers during the speech production is still an open issue that highly affects the quality of the lip-reading and Audio Visual Speech Recognition (AVSR) tasks. In this paper, three different types of visual features from both the image-based and model-based ones are investigated inside a professional lip reading task. The simple...

chapter

The phoneme set influence for lithuanian speech commands recognition accuracy

Mindaugas Greibus, Zivile Ringeliene, Laimutis Telksnys

2017 Open Conference of Electrical, Electronic and Information Sciences (eStream) > 1 - 4

2017 Open Conference of Electrical, Electronic and Information Sciences (eStream)

The phoneme set influence for Lithuanian speech commands recognition accuracy is investigated. Four phoneme sets are discussed. LIEPA speech corpus for training of Acoustic Model is used. The phonetic representation of corpus transcriptions is generated by grapheme-to-phoneme transformation rules. Rule based transformations for Lithuanian language is proposed. Recognition engine with CMU Pocketsphinx...

chapter

HMM/MLP speech recognition system using a novel data clustering approach

Lilia Lazli, Mounir Boukadoum, Otmane Ait Mohamed

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 4

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE)

We present a novel approach for large speech databases quantization. It uses an unsupervised iterative process to regulate a similarity measure to set the number of clusters and their boundaries, thus overcoming the shortcomings of conventional clustering algorithms such as k-Means and Fuzzy C-Means, which require a priori knowledge of the number of clusters and a similarity measure that follows the...

chapter

Implementation of ANN based speech recognition system on an embedded board

Pranjali P. Patange, John Sahaya Rani Alex

2017 International Conference on Nextgen Electronic Technologies: Silicon to Software (ICNETS2) > 408 - 412

2017 International Conference on Nextgen Electronic Technologies: Silicon to Software (ICNETS2)

Speech recognition systems are ubiquitous and find its application in automated voice control, voice dialling and automated directory assistance. This paper aims at implementing a neural network based isolated spoken word recognition system on an embedded board — Raspberry Pi using open source software called octave. Mel-Frequency Cepstral Coefficient (MFCC) features are extracted from speech signal...

Keywords:
TRAINING
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Content availability

Available (2,627)
None (12)

Keywords

SPEECH (916)
SPEECH RECOGNITION (756)
FEATURE EXTRACTION (722)
ACOUSTICS (426)
HIDDEN MARKOV MODEL (334)
ACCURACY (312)
COMPUTATIONAL MODELING (307)
DATABASES (290)
DATA MODELS (286)
DATA MINING (233)
SUPPORT VECTOR MACHINES (228)
TRAINING DATA (212)
HMM (211)
HANDWRITING RECOGNITION (184)
TESTING (183)
NATURAL LANGUAGE PROCESSING (175)
MATHEMATICAL MODEL (161)
ARTIFICIAL NEURAL NETWORKS (151)
VECTORS (146)
NEURAL NETWORKS (137)
LEARNING (ARTIFICIAL INTELLIGENCE) (136)
ADAPTATION MODELS (135)
SPEECH PROCESSING (132)
SPEECH SYNTHESIS (129)
CONTEXT (116)
MEL FREQUENCY CEPSTRAL COEFFICIENT (116)
DECODING (111)
IMAGE SEGMENTATION (111)
SPEAKER RECOGNITION (109)
PROBABILITY (105)
AUTOMATIC SPEECH RECOGNITION (104)
HUMANS (99)
ADAPTATION MODEL (97)
TRAJECTORY (94)
CLASSIFICATION ALGORITHMS (93)
VOCABULARY (93)
GESTURE RECOGNITION (89)
GAUSSIAN PROCESSES (85)
MAXIMUM LIKELIHOOD ESTIMATION (85)
MARKOV PROCESSES (83)
CHARACTER RECOGNITION (82)
ERROR ANALYSIS (82)
PATTERN RECOGNITION (81)
TEXT ANALYSIS (80)
ESTIMATION (79)
DICTIONARIES (77)
VITERBI ALGORITHM (77)
NOISE (76)
PREDICTIVE MODELS (76)
IMAGE RECOGNITION (75)
MACHINE LEARNING (74)
PATTERN CLASSIFICATION (73)
OPTIMIZATION (72)
VISUALIZATION (71)
KERNEL (67)
ROBUSTNESS (66)
TAGGING (66)
CLUSTERING ALGORITHMS (63)
STATISTICAL ANALYSIS (63)
CONTEXT MODELING (62)
SHAPE (62)
FACE RECOGNITION (61)
JOINTS (57)
NOISE MEASUREMENT (57)
RECURRENT NEURAL NETWORKS (57)
IMAGE CLASSIFICATION (56)
NEURONS (56)
SUPPORT VECTOR MACHINE (55)
LABELING (53)
TRANSFORMS (53)
CONDITIONAL RANDOM FIELDS (52)
SENSORS (52)
STANDARDS (52)
ALGORITHM DESIGN AND ANALYSIS (51)
HANDWRITTEN CHARACTER RECOGNITION (51)
NEURAL NETS (50)
SEMANTICS (49)
BAYES METHODS (47)
DETECTORS (47)
FACE (47)
IMAGE SEQUENCES (47)
PRINCIPAL COMPONENT ANALYSIS (47)
CONFERENCES (46)
CAMERAS (45)
PROBABILISTIC LOGIC (43)
TEXT RECOGNITION (43)
DISCRIMINATIVE TRAINING (42)
NATURAL LANGUAGES (42)
COMPUTER VISION (41)
ENTROPY (41)
INFORMATION RETRIEVAL (41)
SIGNAL TO NOISE RATIO (41)
HEURISTIC ALGORITHMS (40)
IMAGE MOTION ANALYSIS (40)
LATTICES (40)
PATTERN CLUSTERING (40)
PIXEL (40)
BIOLOGICAL SYSTEM MODELING (39)
more

INFONA - science communication portal

Search results

A learning based training and skill assessment platform with haptic guidance for endovascular catheterization

Recognizing social touch gestures using recurrent and convolutional neural networks

Automatic creation of a word aligned Sinhala-Tamil parallel corpus

Acoustic novelty detection with adversarial autoencoders

An investigation of high-resolution modeling units of deep neural networks for acoustic scene classification

Interpretable models for fast activity recognition and anomaly explanation during collaborative robotics tasks

Segmentation the speech of hard of hearing children

Deep neural network bottleneck features for bird species verification

Leveraging the urban soundscape: Auditory perception for smart vehicles

Transfer learning of shared latent spaces between robots with similar kinematic structure

Towards intoxicated speech recognition

An improved QRS detection method using Hidden Markov Models

Malware detection using GA optimized K-means and HMM

Combining deep learning and language modeling for segmentation-free OCR from raw pixels

Arabic handwriting recognition using sequential minimal optimization

Recurrent neural network based user classification for smart grids

Lip-reading via a DNN-HMM hybrid system using combination of the image-based and model-based features

The phoneme set influence for lithuanian speech commands recognition accuracy

HMM/MLP speech recognition system using a novel data clustering approach

Implementation of ANN based speech recognition system on an embedded board

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options