Search results

Items from 41 to 60 out of 704 results

chapter

Time-multiplexed / superimposed pilot selection for massive MIMO pilot decontamination

Karthik Upadhya, Sergiy A. Vorobyov, Mikko Vehkapera

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3459 - 3463

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In massive multiple-input multiple-output (MIMO) systems, superimposed (SP) and time-multiplexed (TM) pilots exhibit a complementary behavior, with the former and latter schemes offering a higher throughput in high and low inter-cell interference scenarios, respectively. Based on this observation, in this paper, we propose an algorithm for partitioning users into two disjoint sets comprising users...

chapter

Learning discriminative features from electroencephalography recordings by encoding similarity constraints

Sebastian Stober

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6175 - 6179

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper introduces a pre-training technique for learning discriminative features from electroencephalography (EEG) recordings using deep neural networks. EEG data are generally only available in small quantities, they are high-dimensional with a poor signal-to-noise ratio, and there is considerable variability between individual subjects and recording sessions. Similarity-constraint encoders as...

chapter

Improving the perceptual quality of ideal binary masked speech

Leo Lightburn, Enzo De Sena, Alastair Moore, Patrick A. Naylor, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 661 - 665

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

It is known that applying a time-frequency binary mask to very noisy speech can improve its intelligibility but results in poor perceptual quality. In this paper we propose a new approach to applying a binary mask that combines the intelligibility gains of conventional binary masking with the perceptual quality gains of a classical speech enhancer. The binary mask is not applied directly as a time-frequency...

chapter

Robust Automatic Recognition of Speech with background music

Jiri Malek, Jindrich Zdansky, Petr Cerva

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5210 - 5214

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper addresses the task of Automatic Speech Recognition (ASR) with music in the background, where the accuracy of recognition may deteriorate significantly. To improve the robustness of ASR in this task, e.g. for broadcast news transcription or subtitles creation, we adopt two approaches: 1) multi-condition training of the acoustic models and 2) denoising autoencoders followed by acoustic model...

chapter

Digital predistortion for hybrid precoding architecture in millimeter-wave massive mimo systems

Han Yan, Danijela Cabric

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3479 - 3483

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Millimeter-wave (mmWave) systems require massive antennas at both transmitter and receiver to reach desirable link budget. Analog-digital hybrid beamforming is a promising architecture since it significantly reduces the hardware cost while approaches the performance of digital beamforming. However, the nonlinear power amplifier (PA) introduces intermodulation interference and degrades the spectral...

chapter

Data interface buffer compensation scheme for fast calibration

Sameer Shekhar, Amit K. Jain, Pooja Nukala

2017 18th International Symposium on Quality Electronic Design (ISQED) > 296 - 300

2017 18th International Symposium on Quality Electronic Design (ISQED)

Microprocessors and FPGAs need to enable simpler and compact platforms via integration of self-contained test and training circuits, training of data interface buffers for process and temperature variation being a prime example. This paper studies package embedding of resistors for buffer tuning and presents a scheme to utilize a single resistor to train a large number of buffers without increase...

chapter

Multiple-target deep learning for LSTM-RNN based speech enhancement

Lei Sun, Jun Du, Li-Rong Dai, Chin-Hui Lee

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA) > 136 - 140

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA)

In this study, we explore long short-term memory recurrent neural networks (LSTM-RNNs) for speech enhancement. First, a regression LSTM-RNN approach for a direct mapping from the noisy to clean speech features is presented and verified to be more effective than deep neural network (DNN) based regression techniques in modeling long-term acoustic context. Then, a comprehensive comparison between the...

chapter

Efficient target activity detection based on recurrent neural networks

Daniel Gerber, Stefan Meier, Walter Kellermann

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA) > 46 - 50

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA)

This paper addresses the problem of Target Activity Detection (TAD) for binaural listening devices. TAD denotes the problem of robustly detecting the activity of a target speaker in a harsh acoustic environment, which comprises interfering speakers and noise ('cocktail party scenario'). In previous work, it has been shown that employing a Feed-forward Neural Network (FNN) for detecting the target...

chapter

Learning to communicate: Channel auto-encoders, domain specific regularizers, and attention

Timothy J. O'Shea, Kiran Karra, T. Charles Clancy

2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 223 - 228

2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

We address the problem of learning an efficient and adaptive physical layer encoding to communicate binary information over an impaired channel. In contrast to traditional work, we treat the problem an unsupervised machine learning problem focusing on optimizing reconstruction loss through artificial impairment layers in an autoencoder (we term this a channel autoencoder) and introduce several new...

chapter

A speech enhancement algorithm using computational auditory scene analysis with spectral subtraction

Cong Guo, Like Hui, Wei-Qiang Zhang, Jia Liu

2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 6 - 10

2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and spectral subtraction to get better enhanced speech. The CASA part consists of the latest method deep neural networks (DNNs). The original way to reconstruct the denoise signal is to use the estimated masks with direct overlap-add method ignoring...

chapter

Discrimination of RF Harmonics Using Classification Restricted Boltzmann Machine

Hao Li, Kwangyul Kim, Yoan Shin

2016 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData) > 590 - 593

This paper proposes a new detection scheme for concealed micro-electronic devices by analyzing harmonic waves which are reflected from targets with classification restricted Boltzmann machine algorithm (Class/RBM). This new method exploits the characteristics of the second and the third harmonics waves to classify metal and electronic devices, as is done in all other Pdetection schemes. Moreover the...

chapter

Beam tracking for mobile millimeter wave communication systems

Vutha Va, Haris Vikalo, Robert W. Heath

2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 743 - 747

2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

Millimeter wave (mmWave) is an attractive option for high data rate applications. Enabling mmWave communications requires appropriate beamforming, which is conventionally realized by a lengthy beam training process. Such beam training will be a challenge for applying mmWave to mobile environments. As a solution, a beam tracking method requiring to train only one beam pair to track a path in the analog...

chapter

Compressive sensing based initial beamforming training for massive MIMO millimeter-wave systems

Han Yan, Danijela Cabria

2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 620 - 624

2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

The abundant spectrum at millimeter-wave (mmWave) has the potential to greatly increase the capacity of 5G cellular systems. However, to overcome the high pathloss in the mmWave frequencies, beamforming with large antenna arrays is required at both the base station and user equipments for sufficient link budget. This feature is a challenge for beamforming training during initial access due to low...

chapter

Speech enhancement using Long Short-Term Memory based recurrent Neural Networks for noise robust Speaker Verification

Morten Kolboek, Zheng-Hua Tan, Jesper Jensen

2016 IEEE Spoken Language Technology Workshop (SLT) > 305 - 311

2016 IEEE Spoken Language Technology Workshop (SLT)

In this paper we propose to use a state-of-the-art Deep Recurrent Neural Network (DRNN) based Speech Enhancement (SE) algorithm for noise robust Speaker Verification (SV). Specifically, we study the performance of an i-vector based SV system, when tested in noisy conditions using a DRNN based SE front-end utilizing a Long Short-Term Memory (LSTM) architecture. We make comparisons to systems using...

chapter

Deep neural network driven mixture of PLDA for robust i-vector speaker verification

Na Li, Man-Wai Mak, Jen-Tzung Chien

2016 IEEE Spoken Language Technology Workshop (SLT) > 186 - 191

2016 IEEE Spoken Language Technology Workshop (SLT)

In speaker recognition, the mismatch between the enrollment and test utterances due to noise with different signal-to-noise ratios (SNRs) is a great challenge. Based on the observation that noise-level variability causes the i-vectors to form heterogeneous clusters, this paper proposes using an SNR-aware deep neural network (DNN) to guide the training of PLDA mixture models. Specifically, given an...

chapter

How Much Training Is Needed in One-Bit Massive MIMO Systems at Low SNR?

Yongzhi Li, Cheng Tao, Liu Liu, Amine Mezghani, more

2016 IEEE Global Communications Conference (GLOBECOM) > 1 - 6

GLOBECOM 2016 - 2016 IEEE Global Communications Conference

This paper considers training-based transmissions in massive multi-input multi-output (MIMO) systems with one-bit analog-to-digital converters (ADCs). We assume that each coherent transmission block consists of a pilot training stage and a data transmission stage. The base station (BS) first employs the linear minimum mean-square-error (LMMSE) method to estimate the channel and then uses the maximum-ratio...

chapter

Packet Structure and Receiver Design for Low-Latency Communications with Ultra-Small Packets

Byungju Lee, Sunho Park, David J. Love, Hyoungju Ji, more

2016 IEEE Global Communications Conference (GLOBECOM) > 1 - 6

GLOBECOM 2016 - 2016 IEEE Global Communications Conference

5G wireless standards require a much lower latency than what current wireless systems can guarantee. The main challenge to fulfill this requirement is the capability to support short packet transmission, in contrast to most of the current standards which use a long data packet structure. In this paper, we propose an efficient receiver technique that exploits information obtained during the data transmission...

chapter

Boosting DNN-based speech enhancement via explicit transformations

Qing Wang, Jun Du, Li-Rong Dai

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this study, we investigate on the learning behaviors of DNN by explicit feature transformations. As a demonstration, linear and logarithm transformations, corresponding to the amplitude spectra and log-power spectra, are compared with the same minimum mean squared error (MMSE) objective function for optimizing DNN parameters. Based on the experimental analysis of the DNN learning behaviors, we...

chapter

Audio-visual speech enhancement using deep neural networks

Jen-Cheng Hou, Syu-Siang Wang, Ying-Hui Lai, Jen-Chun Lin, more

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

This paper proposes a novel framework that integrates audio and visual information for speech enhancement. Most speech enhancement approaches consider audio features only to design filters or transfer functions to convert noisy speech signals to clean ones. Visual data, which provide useful complementary information to audio data, have been integrated with audio data in many speech-related approaches...

chapter

Unsupervised single-channel speech separation via deep neural network for different gender mixtures

Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this study, we propose a regression approach via deep neural network (DNN) for unsupervised speech separation in a single-channel setting. We rely on a key assumption that two speakers could be well segregated if they are not too similar to each other. A dissimilarity measure between two speakers is then proposed to characterize the separation ability between competing speakers. We demonstrate...

Keywords:
TRAINING
SIGNAL TO NOISE RATIO

Publication date

Set your own date range

Content availability

Available (697)
None (7)

Keywords

CHANNEL ESTIMATION (209)
SPEECH (105)
ESTIMATION (103)
MIMO (98)
OFDM (89)
FEATURE EXTRACTION (88)
INTERFERENCE (84)
MIMO COMMUNICATION (73)
RECEIVERS (73)
ARTIFICIAL NEURAL NETWORKS (67)
NOISE MEASUREMENT (65)
OFDM MODULATION (53)
CORRELATION (51)
ARRAY SIGNAL PROCESSING (48)
RELAYS (43)
FADING (42)
MAXIMUM LIKELIHOOD ESTIMATION (42)
HIDDEN MARKOV MODELS (41)
NOISE (40)
BIT ERROR RATE (39)
ROBUSTNESS (39)
NEURAL NETWORKS (38)
SPEECH RECOGNITION (38)
WIRELESS COMMUNICATION (38)
SYNCHRONIZATION (35)
SUPPORT VECTOR MACHINES (34)
ACCURACY (33)
COVARIANCE MATRIX (33)
FADING CHANNELS (33)
MODULATION (33)
VECTORS (32)
DETECTORS (31)
NEURAL NETS (31)
ANTENNAS (30)
CHANNEL STATE INFORMATION (30)
RECEIVING ANTENNAS (30)
SIGNAL PROCESSING (29)
SIGNAL PROCESSING ALGORITHMS (29)
LEAST MEAN SQUARES METHODS (28)
TESTING (28)
ALGORITHM DESIGN AND ANALYSIS (27)
TRANSMITTERS (27)
DATA MINING (26)
DATA MODELS (26)
COVARIANCE MATRICES (25)
DOWNLINK (25)
OPTIMIZATION (25)
NEURONS (24)
SENSORS (24)
SPEECH ENHANCEMENT (24)
FREQUENCY ESTIMATION (23)
ACOUSTICS (22)
COGNITIVE RADIO (22)
MATHEMATICAL MODEL (22)
DECODING (21)
ENCODING (21)
TRAINING DATA (21)
TRANSMITTING ANTENNAS (21)
COMPLEXITY THEORY (20)
COMPUTATIONAL MODELING (20)
MEL FREQUENCY CEPSTRAL COEFFICIENT (20)
WIRELESS CHANNELS (20)
LEAST SQUARES APPROXIMATIONS (19)
RADAR (19)
ANTENNA ARRAYS (18)
ORTHOGONAL FREQUENCY DIVISION MULTIPLEXING (18)
SPEECH PROCESSING (18)
DATABASES (17)
ERROR STATISTICS (17)
MEAN SQUARE ERROR METHODS (17)
SIMULATION (17)
APPROXIMATION METHODS (16)
GAIN (16)
PHASE SHIFT KEYING (16)
SIGNAL-TO-NOISE RATIO (16)
SNR (16)
THROUGHPUT (16)
WIRELESS LAN (16)
COMPUTATIONAL COMPLEXITY (15)
DATA COMMUNICATION (15)
DIVERSITY RECEPTION (15)
EQUALIZERS (15)
ITERATIVE METHODS (15)
OPTICAL NOISE (15)
ADAPTATION MODEL (14)
CONVERGENCE (14)
DELAY (14)
DOPPLER EFFECT (14)
EIGENVALUES AND EIGENFUNCTIONS (14)
MULTIPATH CHANNELS (14)
RADIO NETWORKS (14)
RESOURCE MANAGEMENT (14)
SYNCHRONISATION (14)
WHITE NOISE (14)
CLASSIFICATION ALGORITHMS (13)
DICTIONARIES (13)
ELECTROENCEPHALOGRAPHY (13)
ELECTRONIC MAIL (13)
more

INFONA - science communication portal

Search results

Time-multiplexed / superimposed pilot selection for massive MIMO pilot decontamination

Learning discriminative features from electroencephalography recordings by encoding similarity constraints

Improving the perceptual quality of ideal binary masked speech

Robust Automatic Recognition of Speech with background music

Digital predistortion for hybrid precoding architecture in millimeter-wave massive mimo systems

Data interface buffer compensation scheme for fast calibration

Multiple-target deep learning for LSTM-RNN based speech enhancement

Efficient target activity detection based on recurrent neural networks

Learning to communicate: Channel auto-encoders, domain specific regularizers, and attention

A speech enhancement algorithm using computational auditory scene analysis with spectral subtraction

Discrimination of RF Harmonics Using Classification Restricted Boltzmann Machine

Beam tracking for mobile millimeter wave communication systems

Compressive sensing based initial beamforming training for massive MIMO millimeter-wave systems

Speech enhancement using Long Short-Term Memory based recurrent Neural Networks for noise robust Speaker Verification

Deep neural network driven mixture of PLDA for robust i-vector speaker verification

How Much Training Is Needed in One-Bit Massive MIMO Systems at Low SNR?

Packet Structure and Receiver Design for Low-Latency Communications with Ultra-Small Packets

Boosting DNN-based speech enhancement via explicit transformations

Audio-visual speech enhancement using deep neural networks

Unsupervised single-channel speech separation via deep neural network for different gender mixtures

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options