Search results for: Kai Yu

Items from 1 to 13 out of 13 results

chapter

Traffic Signs Detection Based on Faster R-CNN

Zhongrong Zuo, Kai Yu, Qiao Zhou, Xu Wang, more

2017 IEEE 37th International Conference on Distributed Computing Systems Workshops (ICDCSW) > 286 - 288

2017 IEEE 37th International Conference on Distributed Computing Systems Workshops (ICDCSW)

In this paper, we use a advanced method called Faster R-CNN to detect traffic signs. This new method represents the highest level in object recognition, which don't need to extract image feature manually anymore and can segment image to get candidate region proposals automatically. Our experiment is based on a traffic sign detection competition in 2016 by CCF and UISEE company. The mAP(mean average...

chapter

Small-footprint convolutional neural network for spoofing detection

Heinrich Dinkel, Yanmin Qian, Kai Yu

2017 International Joint Conference on Neural Networks (IJCNN) > 3086 - 3091

2017 International Joint Conference on Neural Networks (IJCNN)

Albeit recent progress in speaker verification engendered powerful models, malicious attacks in the form of spoofed speech, are generally not coped with. In previous attempts, deep neural networks were used to extract high dimensional features which were later classified using an independent classifier. Even though the results of this approach are promising, this architecture's disadvantage is it's...

chapter

End-to-end spoofing detection with raw waveform CLDNNS

Heinrich Dinkel, Nanxin Chen, Yanmin Qian, Kai Yu

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4860 - 4864

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Albeit recent progress in speaker verification generates powerful models, malicious attacks in the form of spoofed speech, are generally not coped with. Recent results in ASVSpoof2015 and BTAS2016 challenges indicate that spoof-aware features are a possible solution to this problem. Most successful methods in both challenges focus on spoof-aware features, rather than focusing on a powerful classifier...

chapter

Phone-aware LSTM-RNN for voice conversion

Jiahao Lai, Bo Chen, Tian Tan, Sibo Tong, more

2016 IEEE 13th International Conference on Signal Processing (ICSP) > 177 - 182

2016 IEEE 13th International Conference on Signal Processing (ICSP)

This paper investigates a new voice conversion technique using phone-aware Long Short-Term Memory Recurrent Neural Networks (LSTM-RNNs). Most existing voice conversion methods, including Joint Density Gaussian Mixture Models (JDGMMs), Deep Neural Networks (DNNs) and Bidirectional Long Short-Term Memory Recurrent Neural Networks (BLSTM-RNNs), only take acoustic information of speech as features to...

chapter

Multi-task joint-learning for robust voice activity detection

Yimeng Zhuang, Sibo Tong, Maofan Yin, Yanmin Qian, more

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Model based VAD approaches have been widely used and achieved success in practice. These approaches usually cast VAD as a frame-level classification problem and employ statistical classifiers, such as Gaussian Mixture Model (GMM) or Deep Neural Network (DNN) to assign a speech/silence label for each frame. Due to the frame independent assumption classification, the VAD results tend to be fragile....

chapter

An efficient band selection method for hyperspectral imageries based on covariance matrix

Kang Sun, Tong Shuai, Jinyong Chen, Xiurui Geng, more

2016 8th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS) > 1 - 4

2016 8th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS)

Band selection plays an important role in reducing the dimensionality of hyperspectral data sets. It is a combinatorial optimization problem for optimal band (feature) subset selection which generally involves high computational complexity. In this paper, we present an efficient band selection methods based on the covariance matrix. The method tries to compute the subset of bands with the largest...

chapter

An investigation on DNN-derived bottleneck features for GMM-HMM based robust speech recognition

Yongbin You, Yanmin Qian, Tianxing He, Kai Yu

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 30 - 34

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

In recent years, deep neural network(DNN) has achieved great success when used as acoustic model in speech recognition. An important application of DNN is to derive bottleneck feature. In this paper, firstly we investigate the robustness of bottleneck features generated by three types of DNN structures on the Aurora 4 task without any explicit noise compensation. Secondly, we propose the node-pruning...

chapter

Night Video Surveillance Based on the Second-Order Statistics Features

Cheng Chang Lien, Wen Kai Yu, Chang Hsing Lee, Chin Chuan Han

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 353 - 356

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

Night video surveillance is crucial to construct an all-weather video surveillance system. However, night video surveillance faces several problems: no color information, low brightness, low contrast, and low signal to noise ratio (SNR). These problems can introduce serious false and missing object detections. In this paper, we propose a novel night video surveillance method based on the image second-order...

article

3D Convolutional Neural Networks for Human Action Recognition

Shuiwang Ji, Wei Xu, Ming Yang, Kai Yu

IEEE Transactions on Pattern Analysis and Machine Intelligence > 2013 > 35 > 1 > 221 - 231

We consider the automated recognition of human actions in surveillance videos. Most current methods build classifiers based on complex handcrafted features computed from the raw inputs. Convolutional neural networks (CNNs) are a type of deep model that can act directly on the raw inputs. However, such models are currently limited to handling 2D inputs. In this paper, we develop a novel 3D CNN model...

chapter

Training Conditional Random Fields Using Transfer Learning for Gesture Recognition

Jie Liu, Kai Yu, Yi Zhang, Yalou Huang

2010 IEEE International Conference on Data Mining > 314 - 323

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

Recently, combining Conditional Random Fields (CRF) with Neural Network has shown the success of learning high-level features in sequence labeling tasks. However, such models are difficult to train because of the increase of the parameters to tune which needs enormous of labeled data to avoid over fitting. In this paper, we propose a transfer learning framework for the sequence labeling task of gesture...

chapter

Locality-constrained Linear Coding for image classification

Jinjun Wang, Jianchao Yang, Kai Yu, Fengjun Lv, more

2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition > 3360 - 3367

2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The traditional SPM approach based on bag-of-features (BoF) requires nonlinear classifiers to achieve good image classification performance. This paper presents a simple but effective coding scheme called Locality-constrained Linear Coding (LLC) in place of the VQ coding in traditional SPM. LLC utilizes the locality constraints to project each descriptor into its local-coordinate system, and the projected...

chapter

Human action detection by boosting efficient motion features

Ming Yang, Fengjun Lv, Wei Xu, Kai Yu, more

2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops > 522 - 529

2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops

Recent years have witnessed significant progress in detection of basic human actions. However, most existing methods rely on assumptions such as known spatial locations and temporal segmentations or employ very computationally expensive approaches such as sliding window search through a spatio-temporal volume. It is difficult for such methods to scale up to handle the challenges in real applications...

chapter

The identification of electromagnetic interferences source by feature extraction using the wavelet packet decomposition

Xi-Lai Ma, Yin-Han Gao, Kai-Yu Yang, Chang-Ying Liu, more

2007 International Conference on Wavelet Analysis and Pattern Recognition > 4 > 1817 - 1821

International Conference on Wavelet Analysis and Pattern Recognition, ICWAPR '07

Many techniques for recognizing and identifying disturbed signals waveforms are primarily based on visual inspection. This paper proposes a wavelet packet decomposition based technique to perform a feature extraction from the disturbed signals in order to identify the possible causes of the disturbance. On the basis of definition groups of Electromagnetic Interference, the interested information about...

Filter options

Keywords:
FEATURE EXTRACTION

Publication date

Set your own date range

Publication type

book (12)
article (1)

Keywords

TRAINING (7)
SPEECH (4)
ACTION RECOGNITION (2)
COMPUTATIONAL MODELING (2)
CONVOLUTION (2)
NEURAL NETWORKS (2)
3D CONVOLUTION (1)
ACCURACY (1)
ACOUSTICS (1)
ADAPTATION MODELS (1)
APPROXIMATION METHODS (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTOMATIC DRIVING (1)
BAG-OF-FEATURES (1)
BAND SELECTION (1)
BAYESIAN DISCRIMINATION METHOD (1)
BIOLOGICAL NEURAL NETWORKS (1)
BISMUTH (1)
BOOSTING (1)
BOTTLENECK FEATURE (1)
BTAS2016 (1)
CELL PHONE CALLS (1)
CLDNN (1)
COMPUTATION COMPLEXITY (1)
COMPUTATIONAL COMPLEXITY (1)
COMPUTER ARCHITECTURE (1)
CONDITIONAL RANDOM FIELD TRAINING (1)
CONDITIONAL RANDOM FIELDS (1)
CONFERENCES (1)
CONSTRAINED LEAST SQUARE FITTING PROBLEM (1)
CONTEXT (1)
CONVOLUTIONAL NEURAL NETWORKS (1)
DATA MODELS (1)
DEEP LEARNING (1)
DEEP NEURAL NETWORK (1)
DEEP NEURAL NETWORKS (1)
DIMENSIONALITY REDUCTION (1)
DISTURBED SIGNALS WAVEFORMS (1)
DUAL FOREGROUNDS FUSION (1)
ELECTROMAGNETIC COMPATIBILITY (1)
ELECTROMAGNETIC INTERFERENCE (1)
ELECTROMAGNETIC INTERFERENCES (1)
ENCODING (1)
END-TO-END (1)
ENTROPY (1)
FASTER R-CNN (1)
FRAME CORRELATION (1)
GESTURE RECOGNITION (1)
GRAY-SCALE (1)
HAND POINTING (1)
HEAD (1)
HEURISTIC ALGORITHMS (1)
HISTOGRAMS (1)
HISTORY (1)
HUMAN ACTION DETECTION (1)
HUMANS (1)
HYPERSPECTRAL DATA (1)
IMAGE CLASSIFICATION (1)
IMAGE CODING (1)
IMAGE EDGE DETECTION (1)
IMAGE MATCHING (1)
IMAGE MOTION ANALYSIS (1)
IMAGE RECOGNITION (1)
IMAGE SEGMENTATION (1)
IMAGE TEXTURE (1)
JOINTS (1)
K-NEAREST-NEIGHBOR SEARCH (1)
KERNEL (1)
KTH DATASET (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEARNING SYSTEMS (1)
LEAST SQUARES APPROXIMATIONS (1)
LOCALITY-CONSTRAINED LINEAR CODING (1)
LONG SHORT-TERM MEMORY (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MODEL COMBINATION (1)
MOTION FEATURES BOOSTING (1)
MOTION PATTERNS (1)
MULTI-FRAME PREDICTIONS (1)
MULTI-TASK LEARNING (1)
NEURAL NETWORK (1)
NIGHT VIDEO SURVEILLANCE (1)
NODE-PRUNING (1)
NOISE MEASUREMENT (1)
NONLINEAR CLASSIFIERS (1)
OBJECT DETECTION (1)
PEDESTRIAN RECOGNITION (1)
PHONE (1)
PRAGMATICS (1)
PROPOSALS (1)
PUTTING DOWN OBJECTS (1)
RECURRENT NEURAL NETWORKS (1)
ROBUST SPEECH RECOGNITION (1)
ROBUSTNESS (1)
SECOND-ORDER STATISTICS FEATURES (1)
SEMI-SUPERVISED LEARNING (1)
SEMISUPERVISED LEARNING (1)
SEQUENCE LABELING TASKS (1)
SOLID MODELING (1)
more

INFONA - science communication portal

Search results for: Kai Yu

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options