Search results for: Zhen Yang

Items from 1 to 16 out of 16 results

chapter

Multidimensional speaker information recognition based on proposed baseline system

Shan Li, Longting Xu, Zhen Yang

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1776 - 1780

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

Traditional speech-related identity recognition commonly pays attention to individual aspect of speech signals but in reality, the speech signals are made up of semantics, speaker dependent features, etc. This paper therefore presents a new study that recognizes simultaneously multidimensional speaker information. In order to extract sufficient relational features, both high-level and low-level features...

chapter

An Adaptive Multiscale Framework for Compressed Sensing of Speech Signal

Linhui Sun, Xi Shao, Zhen Yang

2010 6th International Conference on Wireless Communications Networking and Mobile Computing (WiCOM) > 1 - 4

2010 6th International Conference on Wireless Communications, Networking and Mobile Computing (WiCOM)

In this paper, a matrix form of Sym wavelet synthesis is deduced, keeping the length of the coefficient no more than the length of original speech signals, and then we propose an Adaptive Multiscale Compressed Sensing (AMCS) method, which design the sensing matrix and the num of level of wavelet decomposition adaptively, according to the sparsity of each level wavelet coefficients of the speech signals...

chapter

Simple and effective speech steganography in G.723.1 low-rate codes

Tingting Xu, Zhen Yang

2009 International Conference on Wireless Communications&Signal Processing > 1 - 4

2009 International Conference on Wireless Communications & Signal Processing

This paper presents a simple and effective steganography approach applied to 5.3 Kbps G.723.1 low-rate coded speech, based on analyzing the redundancy of coded parameters. Augmented identity matrix is used to reduce the modification to cover speech and enhance the imperceptibility of the mixed speech correspondingly. The scheme is with good transparency and low computational complexity, which is easy...

chapter

Transcoding Scheme between AMR-WB and VMR-WB

Shi-kui Wang, Zhi-hong Yang, Zhen-yang Wu

2009 2nd International Congress on Image and Signal Processing > 1 - 5

2009 2nd International Congress on Image and Signal Processing (CISP)

The adaptive multirate wideband (AMR-WB) speech codec and variable-rate multimode wideband (VMR-WB) speech codec are two coding standards based on CELP model for processing wideband input speech. When communication occurs between them, transcoding must be performed to translate the encoding format from one standard to another one. In this paper, an effective transcoding scheme is presented which makes...

chapter

Adaptive compressed sensing of speech signal based on data-driven dictionary

Tingting Xu, Zhen Yang, Xi Shao

2009 15th Asia-Pacific Conference on Communications > 257 - 260

2009 15th Asia-Pacific Conference on Communications (APCC 2009)

Compressed sensing (CS) is an emerging signal acquisition theory that provides a universal approach for characterizing signals which are sparse or compressible on some basis at sub-Nyquist sampling rate. This paper focuses on the realization of CS on natural speech signals. We construct an over-complete data-driven dictionary as the sparse basis specialized for speech signals. Based on this, CS sampling...

chapter

Novel Speech Secure Communication System Based on Information Hiding and Compressed Sensing

Tingting Xu, Zhen Yang, Xi Shao

2009 Fourth International Conference on Systems and Networks Communications > 201 - 206

2009 Fourth International Conference on Systems and Networks Communications (ICSNC)

This paper proposes a novel scheme for speech secure communication based on information hiding and Compressed Sensing (CS). The scheme first uses CS technology to compress the secret speech and reduce the information bit rate to be embedded, which is significantly different from state-of-art secret speech processing methods. Secret bit stream is then embedded into cover speech based on SCS (Scalar...

chapter

Modeling Articulatory Movements for Voice Conversion Using State-Space Model

Ning Xu, Zhen Yang, Wei-Ping Zhu

2009 Fifth International Conference on Natural Computation > 5 > 236 - 240

2009 Fifth International Conference on Natural Computation (ICNC 2009)

In this paper, we present a new voice conversion method based on the state-space model (SSM). A modified version of the conventional SSM model is first proposed to describe the relationship between the source speech and the target speech in the spectral domain. Then the expectation maximum (EM) and variational Bayesian (VB) algorithms are individually employed to estimate the SSM parameters, resulting...

chapter

A Robust Voice Activity Detection Algorithm in Nonstationary Noise

Jianjun Lei, Jiachen Yang, Jian Wang, Zhen Yang

2009 International Conference on Industrial and Information Systems > 195 - 198

2009 International Conference on Industrial and Information Systems (IIS 2009)

In this paper, we propose a new voice activity detection (VAD) algorithm to improve the speech detection robustness in nonstationary noisy environments. At front-end, Wiener filtering speech enhancement is adopted to suppress noise from noisy speech. Then, at back-end, the voice activity detector based on mel filter-bank spectral entropy is presented to distinguish speech from noise. We have evaluated...

chapter

A Robust Feature Normalization Algorithm for Automatic Speech Recognition

Jianjun Lei, Zhen Yang, Jian Wang

2009 International Joint Conference on Artificial Intelligence > 473 - 475

2009 International Joint Conference on Artificial Intelligence (JCAI)

In this paper, we present an effective feature normalization algorithm to improve the robustness of automatic speech recognition systems. At front-end, minimum mean square error log-spectral amplitude estimation speech enhancement is adopted to suppress noise from noisy speech. Then, at back-end, the histogram equalization feature normalization is used to deal with the residual mismatch between enhanced...

chapter

Robust Voice Activity Detection Based on Spectral Entropy and Two-Stage Mel-Warped Wiener Filtering

Jianjun Lei, Jian Wang, Zhen Yang

2008 Second International Symposium on Intelligent Information Technology Application > 2 > 306 - 309

2008 Second International Symposium on Intelligent Information Technology Application

This paper proposes a novel voice activity detection (VAD) algorithm to improve the speech detection robustness in noisy environments. In the proposed algorithm, two-stage mel-warped Wiener filter is introduced to improve the performance of voice activity detector based on spectral entropy. Then an improved decision rule based on spectral entropy was derived. We have evaluated system performance under...

chapter

A precise estimation of vocal tract parameters for high quality voice morphing

Ning Xu, Zhen Yang

2008 9th International Conference on Signal Processing > 684 - 687

2008 9th International Conference on Signal Processing (ICSP 2008)

One of the most recent models for voice conversion is the classical LPC analysis-synthesis model combined with GMM, which aims to separate information from excitation and vocal tract and to learn the transformation rules with statistical methods. However, it does not work well as it is supposed to be due to the inaccuracy of the extracted feature information as well as the overly-smoothed spectral...

chapter

An improved phase-space voicing-state classification for co-channel speech based on pitch detection

Haiyan Guo, Xi Shao, Zhen Yang

2008 9th International Conference on Signal Processing > 680 - 683

2008 9th International Conference on Signal Processing (ICSP 2008)

This paper presents an improved phase-space voicing state classification method based on pitch detection to simultaneously determine the voicing state of two speakers present in a segment of co-channel speech. Three possible voicing states are considered: Unvoiced/Unvoiced (U/U), Voice/Unvoiced (V/U), Voiced/Voiced (V/V). Firstly, the method employs a phase-space voicing-state classification algorithm...

chapter

Single-channel speaker separation based on sub-spectrum GMM and Bayesian theory

Haiyan Guo, Xi Shao, Zhen Yang

2008 9th International Conference on Signal Processing > 701 - 704

2008 9th International Conference on Signal Processing (ICSP 2008)

The problem of single-channel speaker separation attempts to extract the speech signal uttered by the speaker of interest from one channel signals containing a mixture of acoustic signals. Most of current techniques failed to eliminate the interfering signal completely. In this paper, we present a new approach to solve this problem. Itpsilas an iterative separation approach based on sub-spectrum GMM...

chapter

A Novel Voice Morphing System Using Bi-GMM for High Quality Transformation

Ning Xu, Xi Shao, Zhen Yang

2008 Ninth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing > 485 - 489

2008 Ninth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD)

This paper presents a novel voice morphing system which reproduces high quality speech while maintaining the majority of the target characteristics. Bi-GMM is named for using GMM technique to estimate mapping functions as well as a codebook generated by GMM either. Compared with the traditional GMM technique, a maximum likelihood estimation framework combined with codebook compensation technique is...

chapter

Voice Conversion Using Canonical Correlation Analysis Based on Gaussian Mixture Model

ZhiHua Jian, Zhen Yang

Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007) > 1 > 210 - 215

2007 8th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing

A novel algorithm for voice conversion is proposed in this paper. The mapping function of spectral vectors of the source and target speakers is calculated by the canonical correlation analysis (CCA) estimation based on Gaussian mixture models. Since the spectral envelope feature remains a majority of second order statistical information contained in speech after linear prediction (LPC) analysis, the...

chapter

A Real-time Secure Voice Communication System Based on Speech Recognition

Zongyuan Deng, Zhen Yang, Lixin Deng

2006 International Conference on Systems and Networks Communication (ICSNC'6) > 22

2006 International Conference on Systems and Networks Communication (ICSNC'06)

This paper proposes a scheme of real-time secure communication system based on information hiding and speech recognition. The algorithm uses speech recognition to reduce the bit-rate of secret speech greatly. Then we design an information hiding algorithm by adaptively choosing embedding locations and adopting the multi-nary modulation technique. Experimental results show that this algorithm has good...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Keywords

SPEECH PROCESSING (6)
SPEECH ENHANCEMENT (5)
TRAINING (5)
SPEECH CODING (4)
SPEECH RECOGNITION (4)
COMPRESSED SENSING (3)
COMPUTATIONAL COMPLEXITY (3)
DATA MINING (3)
GAUSSIAN PROCESSES (3)
INFORMATION HIDING (3)
NOISE (3)
NOISE MEASUREMENT (3)
ROBUSTNESS (3)
SPEECH SIGNALS (3)
STEGANOGRAPHY (3)
AMPLITUDE ESTIMATION (2)
BAYES METHODS (2)
BAYESIAN METHODS (2)
DATA COMPRESSION (2)
DICTIONARIES (2)
ENTROPY (2)
FEATURE EXTRACTION (2)
HIDDEN MARKOV MODELS (2)
NOISE SUPPRESSION (2)
SENSORS (2)
SIGNAL DENOISING (2)
SIGNAL DETECTION (2)
SPARSE MATRICES (2)
SPEAKER RECOGNITION (2)
SPECTRAL ANALYSIS (2)
SPECTRAL ENTROPY (2)
SPEECH DETECTION (2)
SPEECH SIGNAL (2)
TRAJECTORY (2)
VOICE ACTIVITY DETECTION (2)
VOICE CONVERSION (2)
WIENER FILTER (2)
WIENER FILTERS (2)
ACOUSTIC SIGNALS (1)
ADAPTIVE COMPRESSED SENSING (1)
ADAPTIVE MULTIRATE WIDEBAND (1)
ADAPTIVE MULTISCALE COMPRESSED SENSING METHOD (1)
ADAPTIVE SENSING MATRIX (1)
AMR-WB (1)
ANALYTICAL MODELS (1)
ARTICULATORY MOVEMENT (1)
ARTICULATORY MOVEMENT MODELING (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUGMENTED IDENTITY MATRIX (1)
AUTOMATIC SPEECH RECOGNITION (1)
BAYESIAN MODEL (1)
BAYESIAN THEORY (1)
BI-GMM (1)
BIGAUSSIAN MIXTURE MODEL (1)
BIT ERROR RATE (1)
BIT RATE (1)
BIT RATE 12.65 KBIT/S TO 23.85 KBIT/S (1)
BIT RATE 5.3 KBIT/S (1)
BLIND-EXTRACTION (1)
CANONICAL CORRELATION ANALYSIS ESTIMATION (1)
CHANNEL BANK FILTERS (1)
CLASSICAL LPC ANALYSIS-SYNTHESIS MODEL (1)
CLASSIFICATION ALGORITHMS (1)
CO-CHANNEL SPEECH (1)
CODEBOOK (1)
CODEBOOK COMPENSATION TECHNIQUE (1)
CONVOLUTION (1)
CORRELATION (1)
CORRELATION METHODS (1)
DATA DRIVEN DICTIONARY (1)
DATA ENCAPSULATION (1)
DECISION RULE (1)
DECODING (1)
DISCRETE COSINE TRANSFORMS (1)
EMOTION RECOGNITION (1)
ENERGY DISTRIBUTION (1)
ENERGY MEASUREMENT (1)
ENHANCED SUMMARY AUTOCORRELATION FUNCTION (1)
EQUATIONS (1)
ESTIMATION (1)
EXPECTATION MAXIMUM ALGORITHM (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
FEATURE NORMALIZATION (1)
G.723.1 (1)
G.723.1 LOW-RATE CODES (1)
GAUSSIAN MIXTURE MODEL (1)
GENDER RECOGNITION (1)
GLOTTAL WAVEFORMS (1)
GMM (1)
GMM-BASED METHOD (1)
HIGH QUALITY SPEECH TRANSFORMATION (1)
HIGH QUALITY VOICE MORPHING (1)
HISTOGRAM EQUALIZATION (1)
HISTOGRAMS (1)
INFORMATION BIT RATE REDUCTION (1)
INTEROPERABILITY (1)
JOINTS (1)
K-SVD (1)
LINEAR PREDICTION ANALYSIS (1)
more

INFONA - science communication portal

Search results for: Zhen Yang

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options