Search results for: Yuexian Zou

Items from 1 to 20 out of 52 results

chapter

Enhancing speaker verification with short voice commands via autoencoder and phonetic bottleneck learning

Yichi Huang, Yuexian Zou

2017 22nd International Conference on Digital Signal Processing (DSP) > 1 - 5

2017 22nd International Conference on Digital Signal Processing (DSP)

Deep learning based speaker verification methods (SV) have achieved the state-of-the-art performance. However, SV with short voice commands (SV-SVC) is still challenging and its performance degrades significantly when noise presents. Carefully examining of SV-SVC task in real applications reveals that there are two unavoidable limitations. One is the very short utterances used (less than 1 second)...

chapter

Dilated convolution neural network with LeakyReLU for environmental sound classification

Xiaohu Zhang, Yuexian Zou, Wei Shi

2017 22nd International Conference on Digital Signal Processing (DSP) > 1 - 5

2017 22nd International Conference on Digital Signal Processing (DSP)

Environmental sound classification task (ESC) is still open and challenging. In contrast to speech, sounds of a specific acoustic event may be produced by a wide variety of sources. Thus for one class, feature spectrums of acoustic events are much more transformative than human speech. In order to learn better high-level feature representations from these transformative feature spectrums, convolution...

chapter

Sequence-guided siamese neural network for video summarization of unmanned aerial vehicles

Jin Chen, Yi Wang, Zehan Chen, Yuexian Zou

2017 22nd International Conference on Digital Signal Processing (DSP) > 1 - 5

2017 22nd International Conference on Digital Signal Processing (DSP)

Video summarization (VS) is one of key video signal processing techniques for unmanned aerial vehicles (UAVs). Essentially VS aims at eliminating redundant frames in aerial videos (AVs) with high similarity, which is helpful for quick browsing, retrieving and efficient storage without losing important information. For VS technique, how to measure the similarity between video frames is not a trivial...

chapter

Robust speaker DOA estimation based on the inter-sensor data ratio model and binary mask estimation in the bispectrum domain

Yanhan Jin, Yuexian Zou, C. H. Ritz

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3266 - 3270

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

When noise is directional instead of diffuse, the majority of conventional direction of arrival (DOA) estimation techniques suffer from performance degradation because of mismatched noise models. In this paper, a novel robust DOA estimation algorithm is developed as an initial investigation into DOA estimation of speech under directional non-speech interference (DNSI) and non-directional background...

chapter

Wireless capsule endoscopy video summarization: A learning approach based on Siamese neural network and support vector machine

Jin Chen, Yuexian Zou, Yi Wang

2016 23rd International Conference on Pattern Recognition (ICPR) > 1303 - 1308

2016 23rd International Conference on Pattern Recognition (ICPR)

Wireless capsule endoscopy video summarization (WCE-VS) is highly demanded for eliminating redundant frames with high similarity. Conventional WCE-VS methods extract various hand-crafted features as image representations. Researches show that such features only reflect the low-level characteristics of single frame and essentially are not effective to capture the semantic similarity between WCE frames...

chapter

Fast visual object counting via example-based density estimation

Yi Wang, Yuexian Zou

2016 IEEE International Conference on Image Processing (ICIP) > 3653 - 3657

2016 IEEE International Conference on Image Processing (ICIP)

Density estimation based visual object counting (DE-VOC) methods estimate the counts of an image by integrating over its predicted density map. They perform effectively but inefficiently. This paper proposes a fast DE-VOC method but maintains its effectiveness. Essentially, the feature space of image patches from VOC can be clustered into subspaces, and the examples of each subspace can be collected...

chapter

Cost-sensitive sparse linear regression for crowd counting with imbalanced training data

Xiaolin Huang, Yuexian Zou, Yi Wang

2016 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2016 IEEE International Conference on Multimedia and Expo (ICME)

Video-based crowd counting (VCC) is a high demanded technique in many video applications. Existing supervised VCC methods essentially learn an intrinsic mapping function between image features and corresponding crowd counts. However, imbalanced training dataset degrades the performance of VCC significantly. Encouraged by recent success in cost-sensitive learning for image classification with imbalance...

chapter

Robust vehicle logo recognition based on locally collaborative representation with principal components

Zhiqiang Xiang, Yuexian Zou, Xiaoqun Zhou, Xiaolin Huang

2016 Sixth International Conference on Information Science and Technology (ICIST) > 487 - 491

2016 Sixth International Conference on Information Science and Technology (ICIST)

Vehicle logo recognition (VLR) is a main issue in vehicle identification system. Logo recognition is still a challenge technique since VLR methods suffer from the large within-class variations due to the different illumination conditions, different viewpoints et al. In this paper, motivated by the excellent performance of the collaborative representation based classification (CRC), we formulate VLR...

chapter

An Efficient Learning Based Smartphone Playback Attack Detection Using GMM Supervector

Chun Wang, Yuexian Zou, Shihan Liu, Wei Shi, more

2016 IEEE Second International Conference on Multimedia Big Data (BigMM) > 385 - 389

2016 IEEE Second International Conference on Multimedia Big Data (BigMM)

Playback attack detection (PAD) is essentially a binary classification task which is used to identify the authentic recordings from the playback recordings. For PAD problem, the difference of the acoustic feature between the authentic and playback recordings mainly comes from the recording channel and the ambient noise. Motivated by the excellent performance of the Gaussian Mixture Model-Universal...

article

KCRC-LCD: Discriminative kernel collaborative representation with locality constrained dictionary for visual categorization

Weiyang Liu, Zhiding Yu, Lijia Lu, Yandong Wen, more

Pattern Recognition > 2015 > 48 > 10 > 3076-3092

We consider the image classification problem via kernel collaborative representation classification with locality constrained dictionary (KCRC-LCD). Specifically, we propose a kernel collaborative representation classification (KCRC) approach in which kernel method is used to improve the discrimination ability of collaborative representation classification (CRC). We then measure the similarities between...

chapter

Multi-kernel collaborative representation for image classification

Weiyang Liu, Zhiding Yu, Yandong Wen, Meng Yang, more

2015 IEEE International Conference on Image Processing (ICIP) > 21 - 25

2015 IEEE International Conference on Image Processing (ICIP)

We consider the image classification problem via multiple kernel collaborative representation (MKCR). We generalize the kernel collaborative representation based classification to a multi-kernel framework where multiple kernels are jointly learned with the representation coefficients. The intrinsic idea of multiple kernel learning is adopted in our MKCR model. Experimental results show MKCR converges...

chapter

Classifying digestive organs in wireless capsule endoscopy images based on deep convolutional neural network

Yuexian Zou, Lei Li, Yi Wang, Jiasheng Yu, more

2015 IEEE International Conference on Digital Signal Processing (DSP) > 1274 - 1278

2015 IEEE International Conference on Digital Signal Processing (DSP)

This paper studies the classification problem of the digestive organs in wireless capsule endoscopy (WCE) images based on deep convolutional neural network (DCNN) framework. Essentially, DCNN proves having powerful ability to learn layer-wise hierarchy models with huge training data, which works similar to human biological visual systems. Classifying digestive organs in WCE images intuitively means...

chapter

Two stages signal strength difference localization algorithm using SDP relaxation

Xiansheng Guo, Lei Chu, Yiming Pi, Yuexian Zou

2015 IEEE International Conference on Digital Signal Processing (DSP) > 957 - 961

2015 IEEE International Conference on Digital Signal Processing (DSP)

We present a novel two stages signal strength difference (TS-SSD) localization algorithm in this letter. A new model using TS-SSD technique is derived to eliminate the effects of path loss exponent and unknown transmit power. And a total least squares (TLS) solution is given to estimate the distances between anchor and target nodes. Then a low-rank matrix completion framework is established to estimate...

chapter

Joint kernel dictionary and classifier learning for sparse coding via locality preserving K-SVD

Weiyang Liu, Zhiding Yu, Meng Yang, Lijia Lu, more

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

We present a locality preserving K-SVD (LP-KSVD) algorithm for joint dictionary and classifier learning, and further incorporate kernel into our framework. In LP-KSVD, we construct a locality preserving term based on the relations between input samples and dictionary atoms, and introduce the locality via nearest neighborhood to enforce the locality of representation. Motivated by the fact that locality-related...

chapter

Fast least mean M-estimate algorithms for robust adaptive filtering in impulse noise

Yuexian Zou, Shing-Chow Chan, Tung-Sang Ng

2000 10th European Signal Processing Conference > 1 - 4

2000 10th European Signal Processing Conference

Adaptive filters with suitable nonlinear devices are very effective in suppressing the adverse effect due to impulse noise. In a previous work, the authors have proposed a new class of nonlinear adaptive filters using the concept of robust statistics [1,2]. The robust M-estimator is used as the objective function, instead of the mean square errors, to suppress the impulse noise. The optimal coefficient...

chapter

Automatical gender detection for unconstrained video sequences based on collaborative representation

Lijia Lu, Weiyang Liu, Yandong Wen, Yuexian Zou

2014 12th International Conference on Signal Processing (ICSP) > 1263 - 1267

2014 12th International Conference on Signal Processing (ICSP 2014)

Many intelligent systems are required to deal with the situation of human-computer interaction. As one of the most important front ends, gender classification plays an irreplaceable role. For practical use, a real-time robust gender classification system is presented in this paper. The system consists of three principal modules: image preprocessing, face detector and gender classifier. To enhance...

chapter

Blind timing error estimation based on the phasic relationship between nonoverlapping frequency points in time-interleaved ADCS

Sujuan Liu, Jiashuai Cui, Haixiao Ma, Yuexian Zou

2014 12th IEEE International Conference on Solid-State and Integrated Circuit Technology (ICSICT) > 1 - 3

2014 IEEE 12th International Conference on Solid -State and Integrated Circuit Technology (ICSICT)

A time-interleaved analog-to-digital converter (TIADC) system is a good option to significantly increase the sampling rate of an ADC. However, the performance of a TIADC suffers from mismatch errors among the sub-channels, especially the timing error. This paper presents a method to estimate the channel timing error by using the output data from TIADC and its corresponding reference channel. The proposed...

chapter

A novel kernel collaborative representation approach for image classification

Weiyang Liu, Lijia Lu, Hui Li, Wei Wang, more

2014 IEEE International Conference on Image Processing (ICIP) > 4241 - 4245

2014 IEEE International Conference on Image Processing (ICIP)

Sparse representation classification (SRC) plays an important role in pattern recognition. Recently, a more generic method named as collaborative representation classification (CRC) has greatly improved the efficiency of SRC. By taking advantage of recent development of CRC, this paper explores to smoothly apply the kernel technique to further improve its performance and proposes the kernel CRC (KCRC)...

chapter

Wireless capsule endoscopy image classification based on vector sparse coding

Tao Ma, Yuexian Zou, Zhiqiang Xiang, Lei Li, more

2014 IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP) > 582 - 586

2014 IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP)

Wireless capsule endoscopy (WCE) is a promising technology for gastrointestinal disease detection. Since there are more than 50,000 frames in one WCE video of a patient, classifying the whole frame set of the digestive tract into subsets corresponding to esophagus, stomach, small intestine, and colon is necessary, which can help physicians review and diagnose rapidly and accurately. The digestive...

chapter

Long-term auto-correlation statistics based voice activity detection for strong noisy speech

Wei Shi, Yuexian Zou, Yi Liu

2014 IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP) > 100 - 104

2014 IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP)

This paper proposes a voice activity detection (VAD) algorithm based on a novel long-term metric. By assuming that the most significant difference between noisy speech and non-speech is the harmonicity of the noisy speech spectrum caused by human nature, the long-term auto-correlation statistics (LTACS) measure is designed to be shown as a powerful metric used in VAD. The LTACS measure is calculated...

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Yuexian Zou

Enhancing speaker verification with short voice commands via autoencoder and phonetic bottleneck learning

Dilated convolution neural network with LeakyReLU for environmental sound classification

Sequence-guided siamese neural network for video summarization of unmanned aerial vehicles

Robust speaker DOA estimation based on the inter-sensor data ratio model and binary mask estimation in the bispectrum domain

Wireless capsule endoscopy video summarization: A learning approach based on Siamese neural network and support vector machine

Fast visual object counting via example-based density estimation

Cost-sensitive sparse linear regression for crowd counting with imbalanced training data

Robust vehicle logo recognition based on locally collaborative representation with principal components

An Efficient Learning Based Smartphone Playback Attack Detection Using GMM Supervector

KCRC-LCD: Discriminative kernel collaborative representation with locality constrained dictionary for visual categorization

Multi-kernel collaborative representation for image classification

Classifying digestive organs in wireless capsule endoscopy images based on deep convolutional neural network

Two stages signal strength difference localization algorithm using SDP relaxation

Joint kernel dictionary and classifier learning for sparse coding via locality preserving K-SVD

Fast least mean M-estimate algorithms for robust adaptive filtering in impulse noise

Automatical gender detection for unconstrained video sequences based on collaborative representation

Blind timing error estimation based on the phasic relationship between nonoverlapping frequency points in time-interleaved ADCS

A novel kernel collaborative representation approach for image classification

Wireless capsule endoscopy image classification based on vector sparse coding

Long-term auto-correlation statistics based voice activity detection for strong noisy speech

Filter options

Publication date

Publication type

Keywords

Data set

Journal

INFONA - science communication portal

Search results for: Yuexian Zou

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Journal

Reporting an error / abuse

Sending the report failed

Accessibility options