Search results for: Lei Xie

Items from 1 to 13 out of 13 results

chapter

Monitoring of dynamic process using hierarchical probability density decomposition

Anni Ying, Shihua Luo, Jiusun Zeng, Lei Xie, more

2017 Chinese Automation Congress (CAC) > 4793 - 4797

2017 Chinese Automation Congress (CAC)

Monitoring of dynamic industrial process has been increasingly important due to more and more strict safety and reliability requirements. Popular methods like time lagged arrangement-based and subspace-based approaches exhibit good performance in fault detection, however, they suffer from difficulty in accurately isolating faulty variables and diagnosing fault types. To alleviate this difficulty,...

chapter

Meta-activity recognition: A wearable approach for logic cognition-based activity sensing

Lei Xie, Xu Dong, Wei Wang, Dawei Huang

IEEE INFOCOM 2017 - IEEE Conference on Computer Communications > 1 - 9

IEEE INFOCOM 2017 - IEEE Conference on Computer Communications

Activity sensing has become a key technology for many ubiquitous applications, such as exercise monitoring and elder care. Most traditional approaches track the human motions and perform activity recognition based on the waveform matching schemes in the raw data representation level. In regard to the complex activities with relatively large moving range, they usually fail to accurately recognize these...

chapter

Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection

Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5645 - 5649

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose to use a feature representation obtained by pairwise learning in a low-resource language for query-by-example spoken term detection (QbE-STD). We assume that word pairs identified by humans are available in the low-resource target language. The word pairs are parameterized by a multi-lingual bottleneck feature (BNF) extractor that is trained using transcribed data in high-resource languages...

chapter

On the use of I-vectors and average voice model for voice conversion without parallel data

Jie Wu, Zhizheng Wu, Lei Xie

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Recently, deep and/or recurrent neural networks (DNNs/RNNs) have been employed for voice conversion, and have significantly improved the performance of converted speech. However, DNNs/RNNs generally require a large amount of parallel training data (e.g., hundreds of utterances) from source and target speakers. It is expensive to collect such a large amount of data, and impossible in some applications,...

chapter

On the training of DNN-based average voice model for speech synthesis

Shan Yang, Zhizheng Wu, Lei Xie

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Adaptability and controllability are the major advantages of statistical parametric speech synthesis (SPSS) over unit-selection synthesis. Recently, deep neural networks (DNNs) have significantly improved the performance of SPSS. However, current studies are mainly focusing on the training of speaker-dependent DNNs, which generally requires a significant amount of data from a single speaker. In this...

chapter

A bi-directional LSTM approach for polyphone disambiguation in Mandarin Chinese

Changhao Shan, Lei Xie, Kaisheng Yao

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Polyphone disambiguation in Mandarin Chinese aims to pick up the correct pronunciation from several candidates for a polyphonic character. It serves as an essential component in human language technologies such as text-to-speech synthesis. Since the pronunciation for most polyphonic characters can be easily decided from their contexts in the text, in this paper, we address the polyphone disambiguation...

chapter

Category driven deep recurrent neural network for video summarization

Xinhui Song, Ke Chen, Jie Lei, Li Sun, more

2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) > 1 - 6

2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

A large number of videos are generated and uploaded to video websites (like youku, youtube) every day and video websites play more and more important roles in human life. While bringing convenience, the big video data raise the difficulty of video summarization to allow users to browse a video easily. However, although there are many existing video summarization approaches, the key frames selected...

chapter

Urban land cover change types identification using fully polarimetric SAR descriptors

Lei Xie, Hong Zhang, Chao Wang

2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 4718 - 4721

IGARSS 2016 - 2016 IEEE International Geoscience and Remote Sensing Symposium

Land cover change detection has long been a hot field in polarimetric synthetic aperture radar (SAR) applications. In certain cases, we care not only the changed areas but also from which type to another. This paper presents a supervised urban land cover change types identification method using a series of polarimetric descriptors from SAR observables and polarimetric decomposition. The normalized...

chapter

A density peak clustering approach to unsupervised acoustic subword units discovery

Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng, more

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 178 - 183

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

This paper studies unsupervised acoustic units discovery from unlabelled speech data. This task is usually approached by two steps, i.e., partitioning speech utterances into segments and clustering these segments into subword categories. In previous approaches, the clustering step usually assumes the number of subword units are known beforehand, which is unreasonable for zero-resource languages. Moreover,...

chapter

Multi-view features in a DNN-CRF model for improved sentence unit detection on English broadcast news

Guangpu Huang, Chenglin Xu, Xiong Xiao, Lei Xie, more

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific > 1 - 9

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

This paper presents a deep neural network-conditional random field (DNN-CRF) system with multi-view features for sentence unit detection on English broadcast news. We proposed a set of multi-view features extracted from the acoustic, articulatory, and linguistic domains, and used them together in the DNN-CRF model to predict the sentence boundaries. We tested the accuracy of the multi-view features...

chapter

Active data based Gaussian process models for nonlinear spatiotemporal systems

Pei Sun, Lei Xie, Junghui Chen

Proceeding of the 11th World Congress on Intelligent Control and Automation > 6061 - 6066

2014 11th World Congress on Intelligent Control and Automation (WCICA)

A new data-driven system identification method, called KL-GP, is proposed for spatiotemporal system. It combines Karhunen-Loève (KL) decomposition and Gaussian process (GP) models. As the nonlinear spatial-temporal spatiotemporal system has strong spatiotemporal characteristics, KL decomposition with good characteristics is employed for time/space separation and dimension reduction. Then the spatiotemporal...

article

Novel Just-In-Time Learning-Based Soft Sensor Utilizing Non-Gaussian Information

Lei Xie, Jiusun Zeng, Chuanhou Gao

IEEE Transactions on Control Systems Technology > 2014 > 22 > 1 > 360 - 368

This brief develops a novel just-in-time (JIT) learning-based soft sensor for modeling of industrial processes. The recorded data is assumed to exhibit non-Gaussian signal components, which are extracted by a non-Gaussian regression (NGR) technique. Unlike previous work on JIT modeling which uses distance-based similarity measure for local modeling, this brief introduces a new similarity measure for...

chapter

Face sketch-to-photo synthesis from simple line drawing

Yang Liang, Mingli Song, Lei Xie, Jiajun Bu, more

Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference > 1 - 5

2012 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

Face sketch-to-photo synthesis has attracted increasing attention in recent years for its useful applications on both digital entertainment and law enforcement. Although great progress has been made, previous methods only work on face sketches with rich textures which are not easily to obtain. In this paper, we propose a robust algorithm for synthesizing a face photo from a simple line drawing that...

Filter options

Keywords:
TRAINING

Publication date

Set your own date range

Publication type

book (12)
article (1)

Keywords

SPEECH (5)
ACOUSTICS (4)
DATA MINING (4)
FEATURE EXTRACTION (4)
DATA MODELS (3)
HIDDEN MARKOV MODELS (3)
ADAPTATION MODELS (2)
COMPUTATIONAL MODELING (2)
PRAGMATICS (2)
ACTIVITY RECOGNITION (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTOENCODER (1)
AVERAGE VOICE MODEL (1)
BI-DIRECTIONAL LSTM (1)
BIDIRECTIONAL CONTROL (1)
BOTTLENECK FEATURES (1)
CHANGE TYPES IDENTIFICATION (1)
CLUSTERING ALGORITHMS (1)
CLUSTERING METHODS (1)
COMPUTER VISION (1)
CONTEXT (1)
CONTEXT MODELING (1)
COORDINATE MEASURING MACHINES (1)
DATABASES (1)
DENSITY ESTIMATION (1)
DICTIONARIES (1)
DYNAMIC PROCESS MONITORING (1)
EARTH (1)
ESTIMATION (1)
EUCLIDEAN DISTANCE (1)
FACE (1)
GAUSSIAN PROCESS MODEL (1)
GAUSSIAN PROCESSES (1)
GRAPHEME-TO-PHONEME CONVERSION (1)
HIERARCHICAL DECOMPOSITION (1)
I-VECTOR (1)
IMAGE SEGMENTATION (1)
INDEXES (1)
INPUT VARIABLES (1)
JUST-IN-TIME (JIT) (1)
KARHUNEN-LOèVE DECOMPOSITION (1)
KERNEL (1)
LAPLACE EQUATIONS (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEARNING SYSTEMS (1)
LOAD MODELING (1)
LOGIC GATES (1)
LONG SHORT-TERM MEMORY (1)
LOW-RESOURCE SPEECH PROCESSING (1)
MODELING (1)
MONITORING (1)
MOTION MEASUREMENT (1)
NEURAL NETWORKS (1)
NON-GAUSSIAN COMPONENTS (1)
NON-GAUSSIAN REGRESSION (NGR) (1)
NONPARALLEL TRAINING (1)
OBJECT RECOGNITION (1)
OPTICAL WAVELENGTH CONVERSION (1)
PAIRWISE LEARNING (1)
POLARIMETRIC DESCRIPTORS (1)
POLSAR (1)
POLYPHONE DISAMBIGUATION (1)
PREDICTIVE MODELS (1)
PRINCIPAL COMPONENT ANALYSIS (1)
RECURRENT NEURAL NETWORKS (1)
RECURRENT VIDEO SUMMARIZATION (1)
REINFORCEMENT LEARNING (1)
REMOTE SENSING (1)
SAFETY (1)
SEQUENCE TAGGING (1)
SILICON (1)
SIMILARITY MEASURE (1)
SPATIAL TEMPORAL SYSTEM (1)
SPATIOTEMPORAL PHENOMENA (1)
SPEECH PROCESSING (1)
SPEECH SYNTHESIS (1)
SPOKEN TERM DETECTION (1)
STATE SPACE MODEL (1)
SUPERPIXEL SEGMENTATION (1)
SUPPORT VECTOR DATA DESCRIPTION (SVDD) (1)
SUPPORT VECTOR MACHINES (1)
SWITCHES (1)
SYNTHETIC APERTURE RADAR (1)
TAGGING (1)
TEXT-TO-SPEECH (1)
TONGUE (1)
TRAINING DATA (1)
TRANSFORMS (1)
URBAN AREAS (1)
VIDEO CATEGORIZATION (1)
VOICE CONVERSION (1)
more

INFONA - science communication portal

Search results for: Lei Xie

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options