Search results

chapter

Punctuated transcription of multi-genre broadcasts using acoustic and lexical approaches

Ondrej Klejch, Peter Bell, Steve Renals

2016 IEEE Spoken Language Technology Workshop (SLT) > 433 - 440

2016 IEEE Spoken Language Technology Workshop (SLT)

In this paper we investigate the punctuated transcription of multi-genre broadcast media. We examine four systems, three of which are based on lexical features, the fourth of which uses acoustic features by integrating punctuation into the speech recognition acoustic models. We also explore the combination of these component systems using voting and log-linear interpolation. We performed experiments...

chapter

Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting

Ming Sun, Anirudh Raju, George Tucker, Sankaran Panchapagesan, more

2016 IEEE Spoken Language Technology Workshop (SLT) > 474 - 480

2016 IEEE Spoken Language Technology Workshop (SLT)

We propose a max-pooling based loss function for training Long Short-Term Memory (LSTM) networks for small-footprint keyword spotting (KWS), with low CPU, memory, and latency requirements. The max-pooling loss training can be further guided by initializing with a cross-entropy loss trained network. A posterior smoothing based evaluation approach is employed to measure keyword spotting performance...

chapter

Parallel Long Short-Term Memory for multi-stream classification

Mohamed Bouaziz, Mohamed Morchid, Richard Dufour, Georges Linares, more

2016 IEEE Spoken Language Technology Workshop (SLT) > 218 - 223

2016 IEEE Spoken Language Technology Workshop (SLT)

Recently, machine learning methods have provided a broad spectrum of original and efficient algorithms based on Deep Neural Networks (DNN) to automatically predict an outcome with respect to a sequence of inputs. Recurrent hidden cells allow these DNN-based models to manage long-term dependencies such as Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM). Nevertheless, these RNNs process...

chapter

LIUM ASR systems for the 2016 Multi-Genre Broadcast Arabic challenge

Natalia Tomashenko, Kevin Vythelingum, Anthony Rousseau, Yannick Esteve

2016 IEEE Spoken Language Technology Workshop (SLT) > 285 - 291

2016 IEEE Spoken Language Technology Workshop (SLT)

This paper describes the automatic speech recognition (ASR) systems developed by LIUM in the framework of the 2016 Multi-Genre Broadcast (MGB-2) Challenge in the Arabic language. LIUM participated in the first of the two proposed tasks, namely the speech-to-text transcription of Aljazeera recordings. We present the approaches and details found in our systems, as well as our results in the evaluation...

chapter

A grapheme-level approach for constructing a Korean morphological analyzer without linguistic knowledge

Jihun Choi, Jonghem Youn, Sang-goo Lee

2016 IEEE International Conference on Big Data (Big Data) > 3872 - 3879

2016 IEEE International Conference on Big Data (Big Data)

Morphological analysis is an essential step for processing the Korean language, due to highly agglutinative properties of the language. In this paper, we propose a novel approach for constructing a Korean morphological analyzer that can capture linguistic properties using graphemes as basic processing units. Since our model does not utilize prior linguistic knowledge, the model can be applied to other...

chapter

Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations

Wei Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, we investigate a DNN tone-based extended recognition network (ERN) approach to Mandarin tone recognition and tone mispronunciation detection. Given a toneless syllable sequence, a tone-based ERN is constructed by assigning five different tones to each toneless syllable, obtaining a fully expanded tonal syllable network. Next, Viterbi decoding is carried out on the tone-based ERN to...

chapter

Behaviour recognition of traffic participants by using manoeuvre primitives for automated vehicles in urban traffic

Susanne Ernst, Jens Rieken, Markus Maurer

2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC) > 976 - 983

2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC)

To work safely, efficiently and robustly, Advanced Driver Systems (ADAS) need a substantial understanding of the environment. Just as a human driver, the system needs to interpret the current situation and its possible developments, especially when it comes to longer prediction horizons or complex urban scenarios. The prerequisite of prediction is the recognition of traffic participants' behaviour...

chapter

A user-personalized model for real time destination and route prediction

Francisco Dantas Nobre Neto, Claudio de Souza Baptista, Claudio E. C. Campelo

2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC) > 401 - 407

2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC)

The act of predicting a destination and route a user will take, as soon as he/she begins to move, has several benefits. A system with this kind of information is able to help the user to avoid a congested route or to suggest a Place of Interest (POI). Nowadays, the task of tracking a user movement is more feasible thanks to current smartphones, with embedded GPS devices. Many related work addresses...

chapter

Context-Based Service Recommendation System Using Probability Model in Mobile Devices

Weng Wen, Huaikou Miao

2016 4th International Conference on Enterprise Systems (ES) > 178 - 182

2016 4th International Conference on Enterprise Systems (ES)

As wireless communication and mobile devicesadvances, recommendation system is one of the keytechnologies to realize personalized service. This paperproposes a service recommendation mechanism using aprobabilistic model in mobile devices. With the contextualinformation and the use's demand state inferred by the model, we can recommend a service to meet the user's preferencesand needs at real time...

chapter

Generating Manipuri English pronunciation dictionary using sequence labelling problem

Rajlakshmi Saikia, Sanasam Ranbir Singh

2016 International Conference on Asian Language Processing (IALP) > 67 - 70

2016 International Conference on Asian Language Processing (IALP)

Creating a highly accurate pronunciation dictionary plays an important role in building English TTS system to produce high quality synthesised speech. Majority of the existing studies related to building Indian English TTS systems adapt CMU pronunciation dictionary to corresponding target Indian accent. Majority of these studies use hand-crafted rule-based approaches to adapt to the target language...

chapter

Survey of the word sense disambiguation and challenges for the Slovak language

Daniel Hladek, Jan Stas, Matus Pleva, Stanislav Ondas, more

2016 IEEE 17th International Symposium on Computational Intelligence and Informatics (CINTI) > 225 - 230

2016 IEEE 17th International Symposium on Computational Intelligence and Informatics (CINTI)

The main goal of this paper is to explain important terms of the word sense disambiguation (WSD) in the Slovak language. A comprehensive survey of current approaches and evaluation methodologies is provided. Special attention is given to necessary language resources and tools. The paper deals with problems specific to Slovak language: missing language resources, rich morphology, free word order and...

chapter

Characterization of mango tree patchiness using a tree-segmentation/clustering approach

Pierre Fernique, Anaeile Dambreville, Jean-Baptiste Durand, Christophe Pradal, more

2016 IEEE International Conference on Functional-Structural Plant Growth Modeling, Simulation, Visualization and Applications (FSPMA) > 68 - 74

2016 IEEE International Conference on Functional-Structural Plant Growth Modeling, Simulation, Visualization and Applications (FSPMA)

In functional-structural plant models, inferring latent levels of organization from data while accounting for both connections between levels and within-individual heterogeneity is a challenging task. Here, we develop an approach based on multiple change-point models. It aims at partitioning a heterogeneous tree into homogeneous subtrees of consequent sizes. While multiple change-point models for...

chapter

A study on compression-based sequential prediction methods for occupancy prediction in smart homes

Sreerama Krishna Sama, Mahshid Rahnamay-Naeini

2016 IEEE 7th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON) > 1 - 8

2016 IEEE 7th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON)

This paper studies the occupancy and movement prediction of residents in the smart home based on a compression-based sequential prediction approach and discusses home automation applications that can benefit from such predictions. The prediction approach studied here is based on the Active LeZi algorithm, which is a compression-based approach that uses an order-k Markov model. The effects of the order...

chapter

Measurement and analysis of physical parameters of the handshake between two persons according to simple social contexts

Gilles Tagne, Patrick Henaff, Nicolas Gregori

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) > 674 - 679

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

In order to facilitate and improve robots social acceptance, they must be equipped with behaviors similar to those of humans. It is therefore necessary to study and model the phenomenon to be reproduce. This paper studies and analyzes the physical parameters of the handshake in order to have its characteristic features (frequency, duration, strength, synchronization, etc.) used to model this interaction...

chapter

Recognition of different daily living activities using hidden Markov model regression

Khaled Safi, Samer Mohammed, Ferhat Attal, Mohamad Khalil, more

2016 3rd Middle East Conference on Biomedical Engineering (MECBME) > 16 - 19

2016 3rd Middle East Conference on Biomedical Engineering (MECBME)

The human activity recognition is widely used for human behavior prediction especially for dependent people. This is achieved to provide safety, health monitoring, and well being of this population at home. In this paper, the problem of human activity recognition is reformulated as joint segmentation of multidimensional time series. The hidden Markov model regression (HMMR) is used to perform unsupervised...

chapter

Alignment classification for professional writing assistance

Mai Duong, Minh-Quoc Nghiem, Ngan Luu-Thuy Nguyen

2016 Eighth International Conference on Knowledge and Systems Engineering (KSE) > 181 - 186

2016 Eighth International Conference on Knowledge and Systems Engineering (KSE)

Proofreading, the act of checking first-draft writings performed by native experts, is essential for professional writing by non-native speakers. How to automatically proofread could be an interesting topic of NLP, but have not yet been well-explored. Our research carried out the first step toward automatic proof-reading by automatically analyzing the correspondences between original and proofreading...

chapter

Evaluating the use of word embeddings for part-of-speech tagging in Bahasa Indonesia

Achmad F. Abka

2016 International Conference on Computer, Control, Informatics and its Applications (IC3INA) > 209 - 214

2016 International Conference on Computer, Control, Informatics and its Applications (IC3INA)

This paper studies the use of word embeddings for POS tagging in Bahasa Indonesia. The experiments are conducted with an architecture based on neural network model, that is a simple feed forward neural network with one hidden layer. The word embeddings (i.e., CBOW, skip-gram, and GloVe) are trained on unlabelled text corpus created from Wikipedia Bahasa Indonesia. The results show that word embeddings...

chapter

The design and implementation of HMM-based Dai speech synthesis

Zhan Wang, Jian Yang, Xin Yang

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

By far there are more than 1.2 million Dai compatriots using Dai language in Yunnan province, researching Dai speech synthesis has great significance in advancing the informationization of Dai. This paper focuses on the study of the implementation of Dai speech synthesis by taking the HMM speech synthesis framework and STRAIGHT synthesizer into account. The methods of collection and selection of Dai...

chapter

DNN-based unit selection using frame-sized speech segments

Zhi-Ping Zhou, Zhen-Hua Ling

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

This paper presents a deep neural network (DNN)-based unit selection method for waveform concatenation speech synthesis using frame-sized speech segments. In this method, three DNNs are adopted to calculate target costs and concatenation costs respectively for selecting frame-sized candidate units. The first DNN is built in the same way as the DNN-based statistical parametric speech synthesis, which...

chapter

Rich punctuations prediction using large-scale deep learning

Xueyang Wu, Su Zhu, Yue Wu, Kai Yu

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Punctuation plays an important role in language processing. However, automatic speech recognition systems only output plain word sequences. It is then of interest to predict punctuations on plain word sequences. Previous works have focused on using lexical features or prosodic cues captured from small corpus to predict simple punctuations. Compared with simple punctuations, rich punctuations provide...

INFONA - science communication portal

Search results

Punctuated transcription of multi-genre broadcasts using acoustic and lexical approaches

Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting

Parallel Long Short-Term Memory for multi-stream classification

LIUM ASR systems for the 2016 Multi-Genre Broadcast Arabic challenge

A grapheme-level approach for constructing a Korean morphological analyzer without linguistic knowledge

Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations

Behaviour recognition of traffic participants by using manoeuvre primitives for automated vehicles in urban traffic

A user-personalized model for real time destination and route prediction

Context-Based Service Recommendation System Using Probability Model in Mobile Devices

Generating Manipuri English pronunciation dictionary using sequence labelling problem

Survey of the word sense disambiguation and challenges for the Slovak language

Characterization of mango tree patchiness using a tree-segmentation/clustering approach

A study on compression-based sequential prediction methods for occupancy prediction in smart homes

Measurement and analysis of physical parameters of the handshake between two persons according to simple social contexts

Recognition of different daily living activities using hidden Markov model regression

Alignment classification for professional writing assistance

Evaluating the use of word embeddings for part-of-speech tagging in Bahasa Indonesia

The design and implementation of HMM-based Dai speech synthesis

DNN-based unit selection using frame-sized speech segments

Rich punctuations prediction using large-scale deep learning

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options