Wyniki wyszukiwania dla: Bo Xu

Pozycje od 1 do 20 spośród 29 wyników

Poprzednia

Następna

rozdział

A class-specific copy network for handling the rare word problem in neural machine translation

Feng Wang, Wei Chen, Zhen Yang, Xiaowei Zhang, więcej

2017 International Joint Conference on Neural Networks (IJCNN) > 2658 - 2664

2017 International Joint Conference on Neural Networks (IJCNN)

Neural machine translation (NMT) has shown promising results and rapidly gained adoption in many large-scale settings. With the NMT model being widely used in empirical productions, its long-standing weakness in handling the rare and out of vocabulary words has been amplified a lot. In order to release the model from the stress of “understanding” the rare words, copy mechanism has been proposed to...

rozdział

Combining unidirectional long short-term memory with convolutional output layer for high-performance speech synthesis

Wenfu Wang, Bo Xu

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5500 - 5504

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we target improving the accuracy of acoustic modelling for statistical parametric speech synthesis (SPSS) and introduce the convolutional neural network (CNN) due to its powerful capacity in locality modelling. A novel model architecture combining unidirectional long short-term memory (LSTM) and a time-domain convolutional output layer (COL) is proposed and employed to acoustic modelling...

rozdział

Applying connectionist temporal classification objective function to Chinese Mandarin speech recognition

Pengrui Wang, Jie Li, Bo Xu

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

In automatic speech recognition (ASR), connectionist temporal classification (CTC) is regarded as a method to achieve end-to-end system. Actually, not only characters (Chars) but also context independent phonemes (CI-Phns) or context dependent phoneme (CD-Phns) can be used as output units of CTC-trained neural network. The contribution of this paper mainly lies in three aspects: First, we trained...

rozdział

Compositional Recurrent Neural Networks for Chinese Short Text Classification

Yujun Zhou, Bo Xu, Jiaming Xu, Lei Yang, więcej

2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI) > 137 - 144

2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)

Word segmentation is the first step in Chinese natural language processing, and the error caused by word segmentation can be transmitted to the whole system. In order to reduce the impact of word segmentation and improve the overall performance of Chinese short text classification system, we propose a hybrid model of character-level and word-level features based on recurrent neural network (RNN) with...

rozdział

A Convolutional Architecture for Short Text Expansion and Classification

Peng Wang, Jiaming Xu, Bo Xu, Chenglin Liu, więcej

2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) > 1 > 75 - 78

2015 IEEE / WIC / ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)

In this paper, we propose a convolutional framework for short texts expansion and classification. Particularly, by using additive composition over word embeddings from context with variable window width, the representations of multi-scale semantic units are computed first. Empirically, the semantically related words are usually close to each other in embedding spaces. Thus, the restricted nearest...

rozdział

Performance comparison of local directional pattern to local binary pattern in off-line signature verification system

Bo Xu, Daozhi Lin, Longbiao Wang, Hongyang Chao, więcej

2014 7th International Congress on Image and Signal Processing > 308 - 312

2014 7th International Congress on Image and Signal Processing (CISP)

There are several papers about pseudo dynamic methods used in signature authentication. Recently, the gray scale features local binary pattern(LBP) originate from texture analysis has been widely used in signature verification system with advantage of robustness to illumination change. The major problem of LBP is its sensitivity to noise, hence many solutions has been applied to solve this problem...

rozdział

Phrase-based data selection for language model adaptation in spoken language translation

Shixiang Lu, Wei Wei, Xiaoyin Fu, Lichun Fan, więcej

2012 8th International Symposium on Chinese Spoken Language Processing > 193 - 196

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

In this paper, we propose an unsupervised phrase-based data selection model, address the problem of selecting no-domain-specific language model (LM) training data to build adapted LM for use. In spoken language translation (SLT) system, we aim at finding the LM training sentences which are similar to the translation task. Compared with the traditional bag-of-words models, the phrase-based data selection...

rozdział

Nesting hierarchical phrase-based model for speech-to-speech translation

Xiaoyin Fu, Wei Wei, Lichun Fan, Shixiang Lu, więcej

2012 8th International Symposium on Chinese Spoken Language Processing > 368 - 372

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

Hierarchical phrase-based (HPB) translation has been introduced to speech-to-speech (S2S) translation system on mobile terminals, such as smartphones. However, it suffers from the explosive growth in the number of rules along with the increment in decoding time for S2S translation system when the memory and decoding speed is restricted. In this paper, we propose a nesting HPB model to capture the...

rozdział

A platform for the development and evaluation of passive safety applications

Piotr Szczurek, Bo Xu, Ouri Wolfson, Jie Lin

2012 IEEE Intelligent Vehicles Symposium > 808 - 813

2012 IEEE Intelligent Vehicles Symposium (IV)

In this paper, we present a platform for aiding in the development and evaluation of novel ITS passive safety applications. Such applications work by having vehicles detect certain events that may be dangerous to other vehicles and disseminating reports about these events using wireless communication. A vehicle receiving the report about the event can then be warned. However, a large number of false...

rozdział

Discriminative training of weighted polynomial vector for acoustic language recognition

Ce Zhang, Rong Zheng, Bo Xu

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4849 - 4852

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we propose a discriminative method for the acoustic feature based language recognizer, which is a modification of the polynomial expansion in generalized linear discriminant sequence (GLDS) kernel. It is inspired by the Gaussian mixture model-support vector machine (GMM-SVM) system which has been successfully used in both speaker and language recognition. Because of the restriction...

rozdział

Unsupervised training of subspace gaussian mixture models for conversational telephone speech recognition

Zejun Ma, Xiaorui Wang, Bo Xu

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4829 - 4832

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper presents our preliminary works on exploring unsupervised training of subspace gaussian mixture models for under-resourced CTS recognition task. The subspace model yields better performance than conventional GMM model, particularly in small or middle-sized training set. As an effective way to save human efforts, unsupervised learning is often applied to automatically transcribe a large amount...

rozdział

Exploring nuisance attribute projection and score normalization for GLDS-SVM based automatic mispronunciation detection method

HongYan Li, Shen Huang, ShiJin Wang, JiaEn Liang, więcej

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5668 - 5671

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In the task of mispronunciation detection, the cross-speaker degradation and some other confusing nuisances are the challenging problems demanding prompt solution. In this paper, we will attempt to remove the non-pronunciation variations in the GLDS-SVM expansion space by using nuisance attribute projection strategy, in order to increase the separating capacity between different phoneme instances...

rozdział

Confidence estimation for spoken language translation based on Round Trip Translation

Dong Yu, Wei Wei, Lei Jia, Bo Xu

2010 7th International Symposium on Chinese Spoken Language Processing > 426 - 429

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

In this paper we propose a Round Trip Translation (RTT) based approach to sentence-level confidence estimation (CE) for spoken language translation without the assistant of reference translations generated by human. A number of novel RTT based features are introduced to reflect the quality of spoken language translation in more detail. After combing various kinds of features together, support vector...

rozdział

Similar Handwritten Chinese Characters Recognition by Critical Region Selection Based on Average Symmetric Uncertainty

Bo Xu, Kaizhu Huang, Cheng-Lin Liu

2010 12th International Conference on Frontiers in Handwriting Recognition > 527 - 532

2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010)

We consider the problem of similar Chinese character recognition in this paper. Engaging the Average Symmetric Uncertainty (ASU) criterion to measure the correlation between different image regions and the class label, we manage to detect the most critical regions for each pair of similar characters. These critical regions are proved to contain more discriminative information and hence can largely...

rozdział

The Asian network-based speech-to-speech translation system

S. Sakti, N. Kimura, M. Paul, C. Hori, więcej

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 507 - 512

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

This paper outlines the first Asian network-based speech-to-speech translation system developed by the Asian Speech Translation Advanced Research (A-STAR) consortium. The system was designed to translate common spoken utterances of travel conversations from a certain source language into multiple target languages in order to facilitate multiparty travel conversations between people speaking different...

rozdział

Experimental Teaching Design and Exploration of System Dynamics Simulation

Haiyan Yan, Bo Xu

2009 Second International Conference on Education Technology and Training > 249 - 252

2009 International Conference on Education Technology and Training (ETT 2009)

Experimental teaching is widely applied in the fields of natural science, engineering, ecology and social science etc. It is also witnessed the increasingly penetrating into business related teaching and training. In particular, simulation based experiment has got immense attention in many management and economics schools of the universities. Simulation based experimental teaching helps the students...

rozdział

Residual Factor Analysis for Text-Independent Speaker Verification

Lei Zhu, Rong Zheng, Bo Xu

2009 Chinese Conference on Pattern Recognition > 1 - 5

2009 Chinese Conference on Pattern Recognition. (CCPR 2009) and the First CJK Joint Workshop on Pattern Recognition (CJKPR)

Joint factor analysis (JFA) has become the state-of-the-art technique in the problem of speaker verification. At the same time, the training of eigenvoice matrix seems to be a heavy burden to us, because it requires lots of multi-channel data, which largely determines the performance of the system. In this paper, we first try to exploit an upper bound performance of the JFA system in a non-normal...

rozdział

Mandarin pitch accent prediction using hierarchical model based ensemble machine learning

Chongjia Ni, Wenju Liu, Bo Xu

2009 IEEE Youth Conference on Information, Computing and Telecommunication > 327 - 330

2009 IEEE Youth Conference on Information, Computing and Telecommunication (YC-ICT 2009)

In this study, we combine the Mandarin characteristics with Mandarin acoustic attribute and text information and use hierarchical model based ensemble machine learning to predict Mandarin pitch accent. Our model could make the best of advantages of prosody hierarchical structure and ensemble machine learning. When comparing our model with classification and regression tree (CART), support vector machine...

rozdział

The application of decision-feedback equalizer in optical burst-mode receiver

Shan Shan Wang, Bo Xu, Kun Qiu

2009 Conference on Lasers&Electro Optics&The Pacific Rim Conference on Lasers and Electro-Optics > 1 - 2

2009 Conference on Lasers & Electro Optics & The Pacific Rim Conference on Lasers and Electro-Optics (CLEO/PACIFIC RIM)

This paper investigates the combined feedforward and decision-feedback equalizer (DFE) in optical burst-mode receivers. The main focus is on the challenge of using DFE to improve bite-error rate (BER) performance for burst-mode receiver.

rozdział

Context Dependent Feature Based Bottom-up Rescoring SVM Classifier in Children's English Stress Mis-pronunciation Detection

Shen Huang, Hongyan Li, Shijin Wang, Jiaen Iiang, więcej

2009 Ninth IEEE International Conference on Advanced Learning Technologies > 236 - 238

2009 Ninth IEEE International Conference on Advanced Learning Technologies (ICALT)

Automatic assessment of word stress error is an integral part for oral language grading system. However, problems that the property of vowels depends on its context information and the data sparseness of different vowel class are yet to be solved. This paper shall briefly introduce a hybrid method consisting of both traditional prosodic features and proposed context dependent strategies. In classification...

Poprzednia

Następna

Opcje filtrowania

Słowa kluczowe:
TRAINING

Data publikacji

Ustaw własny zakres dat

Słowa kluczowe

SPEECH (13)
FEATURE EXTRACTION (11)
SPEECH RECOGNITION (9)
SUPPORT VECTOR MACHINES (9)
ACOUSTICS (5)
COMPUTATIONAL MODELING (5)
HIDDEN MARKOV MODELS (5)
KERNEL (4)
NATURAL LANGUAGE PROCESSING (4)
SPEECH PROCESSING (4)
CLASSIFICATION ALGORITHMS (3)
CONTEXT (3)
DECODING (3)
MEL FREQUENCY CEPSTRAL COEFFICIENT (3)
NEURAL NETWORKS (3)
SEMANTICS (3)
SUPPORT VECTOR MACHINE (3)
TESTING (3)
ADAPTATION MODELS (2)
AUTOMATIC MISPRONUNCIATION DETECTION (2)
AUTOMATIC SPEECH RECOGNITION (2)
BIOLOGICAL SYSTEM MODELING (2)
COMPUTER AIDED INSTRUCTION (2)
CONTEXT MODELING (2)
DATA MINING (2)
DATA MODELS (2)
DECISION FEEDBACK EQUALISERS (2)
DECISION FEEDBACK EQUALIZERS (2)
GAUSSIAN PROCESSES (2)
GENERALIZED LINEAR DISCRIMINANT SEQUENCE (2)
IMAGE CLASSIFICATION (2)
IMAGE FUSION (2)
LANGUAGE TRANSLATION (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
LINEAR PROGRAMMING (2)
LOGIC GATES (2)
MACHINE LEARNING (2)
MATRIX ALGEBRA (2)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (2)
ORGANIZATIONS (2)
PROBABILITY (2)
REGRESSION ANALYSIS (2)
SPEAKER RECOGNITION (2)
SPEECH-TO-SPEECH TRANSLATION (2)
SPOKEN LANGUAGE TRANSLATION (2)
TRAINING DATA (2)
ABSTRACTS (1)
ACCURACY (1)
ADABOOST (1)
ADAPTATION MODEL (1)
ADAPTIVE DECISION FEEDBACK EQUALIZER (1)
ADAPTIVE DECISION FEEDBACK EQUALIZER (ADFE) (1)
ADAPTIVE EQUALISERS (1)
ADDITIVES (1)
ADFE (1)
AMPLITUDE SHIFT KEYING (1)
ARTIFICIAL NEURAL NETWORKS (1)
ASIAN LANGUAGES (1)
ASIAN NETWORK (1)
ASIAN SPEECH TRANSLATION ADVANCED RESEARCH (1)
AUDIO CODING (1)
AUDIO DATABASES (1)
AUDIO FUSION (1)
AUTHENTICATION (1)
AUTOMATIC ASSESSMENT (1)
AUTOMATIC PRONUNCIATION EVALUATION (1)
AVERAGE SYMMETRIC UNCERTAINTY (1)
BER (1)
BIT ERROR RATE (1)
BIT RATE 10 GBIT/S (1)
BITE-ERROR RATE (1)
BLEU (1)
BLUE MOVIES (1)
BLUE MOVIES RECOGNITION (1)
BUSINESS (1)
BUSINESS RELATED TEACHING (1)
CALL (1)
CART ALGORITHM (1)
CASIA CHINESE CHARACTER DATA SET (1)
CHARACTER RECOGNITION (1)
CHINESE (1)
CHROMATIC DISPERSION (1)
CHROMATIC DISPERSION (CD) (1)
CLASS LABEL (1)
CLASSIFICATION ACCURACY (1)
CLASSIFICATION AND REGRESSION TREE (1)
CLASSIFIER FUSION (1)
CNN (1)
COMPLEX SYSTEM (1)
COMPOUNDS (1)
COMPUTER AIDED LANGUAGE LEARNING (1)
COMPUTER ARCHITECTURE (1)
COMPUTER ASSISTED LANGUAGE LEARNING (1)
COMPUTER ASSISTED LANGUAGE LEARNING (CALL) (1)
COMPUTER ASSISTED LANGUAGE LEARNING SYSTEM (1)
CONFIDENCE ESTIMATION (1)
CONFUSABLE MANDARIN PHONE (1)
CONFUSION MATRIX (1)
CONNECTIONIST TEMPORAL CLASSIFICATION (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Bo Xu

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu