Search results

Items from 1 to 20 out of 34 results

chapter

Robust speaker recognition based on multi-stream features

Ning Wang, Lei Wang

2016 IEEE International Conference on Consumer Electronics-China (ICCE-China) > 1 - 4

2016 IEEE International Conference on Consumer Electronics-China (ICCE-China)

In this paper, we investigate the effect of the G.723.1 (6.3kbps) on speaker recognition system. In order to improve the robustness of codec mismatch, we used the Power Normalized Cepstral Coefficients (PNCC) which is a new robustness acoustic feature, to improve the performance of speaker verification system. And a modified SCF speech feature is propose to improve the robustness under codec mismatch...

chapter

On the robustness of action recognition methods in compressed and pixel domain

Vignesh Srinivasan, Serhan Gul, Sebastian Bosse, Jan Timo Meyer, more

2016 6th European Workshop on Visual Information Processing (EUVIP) > 1 - 6

2016 6th European Workshop on Visual Information Processing (EUVIP)

This paper investigates the robustness of two state-of-theart action recognition algorithms: a pixel domain approach based on 3D convolutional neural networks (C3D) and a compressed domain approach requiring only partial decoding of the video, based on feature description using motion vectors and Fisher vector encoding (MV-FV). We study the robustness of the two algorithms against: (i) quality variations,...

chapter

Error robust low delay audio coding using spherical logarithmic quantization

Stephan Preihs, Timm Lamprecht, Jorn Ostermann

2016 24th European Signal Processing Conference (EUSIPCO) > 1970 - 1974

2016 24th European Signal Processing Conference (EUSIPCO)

This paper reveals the potential gain in audio quality that can be achieved by combining Spherical Logarithmic Quantization (SLQ) with advanced broadband error robust low delay audio coding based on ADPCM. We briefly summarize the basic properties and mechanisms of SLQ and the employed ADPCM scheme and show how they can be combined in a freely parameterizable coding algorithm. The resulting codec...

chapter

Quality and Error Robustness Assessment of Low-Latency Lightweight Intra-Frame Codecs

Alexandre Willeme, Benoit Macq

2016 Data Compression Conference (DCC) > 637

2016 Data Compression Conference (DCC)

Up to now, many existing video transmission and storage infrastructures are not able to handle UHD uncompressed video in real-time. For instance, the transmission of 4K UHD 4:2:2 10 bits 60p requires approximatively 4 times the bandwidth available on a 3-G SDI cable. To reduce the required bitrates, a low-latency lightweight compression scheme is needed. To this end, several standardization efforts...

chapter

Robust speech coding with EVS

Anssi Ramo, Adriana Vasilache, Henri Toukomaa

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 775 - 779

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

This paper discusses the voice and audio quality characteristics of EVS, the recently standardized 3GPP codec. Especially frame erasure conditions were evaluated. Comparison to industry standard voice codecs: 3GPP AMR and AMR-WB as well as direct signals at varying bandwidths was made. Speech quality was evaluated with two subjective listening tests containing clean and noisy speech in Finnish language...

chapter

Adaptive pre- and post-filtering for a subband ADPCM-based low delay audio codec

Stephan Preihs, Christoph Wacker, Jorn Ostermann

2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 1 - 5

2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

This paper addresses the combination of a low delay subband ADPCM-based audio codec with adaptive pre- and post-filtering for psychoacoustic noise shaping. We present how our basic scheme for error robust subband coding can be combined with two cascades of band shelving filters. The gain parameters of these filters are adapted by an algorithm that is based on power estimates which are obtained from...

chapter

An Automatic Watermarking in CELP Speech Codec Based on Formant Tuning

Erick Christian Garcia Alvarez, Shengbei Wang, Masashi Unoki

2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP) > 160 - 163

2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

This paper proposes the unification of the codeexcited linear prediction (CELP) codec process with watermarking based on formant tuning. The serial problem in atermarking and then encoding with the CELP codec was thereby reduced by using the proposed method which also ncreased the bit detection rate. We took advantage of two key properties: I) humans do not perceive alterations applied to formants...

chapter

Privacy-enhanced perceptual hashing of audio data

Heiko Knospe

2013 International Conference on Security and Cryptography (SECRYPT) > 1 - 6

2013 International Conference on Security and Cryptography (SECRYPT)

Audio hashes are compact and robust representations of audio data and allow the efficient identification of specific recordings and their transformations. Audio hashing for music identification is well established and similar algorithms can also be used for speech data. A possible application is the identification of replayed telephone spam. This contribution investigates the security and privacy...

chapter

Rapid and generalized identification of packetized voice traffic flows

Philip Branch, Jason But

37th Annual IEEE Conference on Local Computer Networks > 85 - 92

2012 IEEE 37th Conference on Local Computer Networks (LCN 2012)

In this paper we describe the construction and performance of classifiers able to identify Variable Rate VoIP traffic flows rapidly, reliably and independently of the application version that generated it. We show that features calculated on short sequences of packets extracted from the flow (sub-flows) are sufficient to identify VoIP flows with Recall of 99% and Precision of 90%. The features we...

chapter

Detection of Tampering in Speech Signals with Inaudible Watermarking Technique

Masashi Unoki, Ryota Miyauchi

2012 Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 118 - 121

2012 Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

There have recently been serious social issues involved in multimedia signal processing such as malicious attacks and tampering with digital audio/speech signals. Fragile speech watermarking is a technique that enables the detection of tampering with the original signals. We previously proposed an inaudible digital-audio watermarking approach based on cochlear delay. We investigated how the proposed...

chapter

Time-domain audio watermarking using multiple marking spaces

Md. Rifat Shahriar, Sangjin Cho, Ui-pil Chong

2012 International Conference on Informatics, Electronics & Vision (ICIEV) > 974 - 979

2012 International Conference on Informatics, Electronics & Vision (ICIEV)

In this paper, a time-domain audio watermarking scheme is proposed where embedding is done in two different marking spaces which are obtained from the host audio by exploiting the properties of Polar coordinate system. This technique has the advantage of higher embedding capacity due to its double utilization of the same set of audio samples during insertion of watermark message. Simulation results...

chapter

FEC-based packet loss recovery for AVS-M audio codec

Jianli Liu, Shenghui Zhao, Jing Wang, Jingming Kuang

2011 International Conference on Multimedia Technology > 3069 - 3072

2011 International Conference on Multimedia Technology (ICMT)

In this paper, we utilize sender-based Forward Error Correction (FEC) techniques to enhance the robustness of packet loss recovery for AVS Mobile speech and audio (AVS-M) codec. Two FEC schemes are proposed which take the advantage of the codec's structure characteristics and do not introduce extra delay. The objective and subjective listening tests results show that the two methods achieve higher...

chapter

Design of error-resilient M-description codec over wireless broadcasting networks

Meng Yang, Xuguang Lan, Nanning Zheng

2011 IEEE Consumer Communications and Networking Conference (CCNC) > 531 - 532

2011 IEEE Consumer Communications and Networking Conference (CCNC 2011)

A practical error-resilient M-description codec scheme is designed to combat the bit errors of the wireless broadcasting networks and raise the quality of the reconstructed signal. The signal is coded into large number of mutually refinable descriptions by robust staggered M-description scalar quantizer (RSMDSQ). Then an index assignment method is used to enhance the error-resilient capacity of any...

chapter

Robust Multiuser Precoder for Base Station Cooperative Transmission with Non-Ideal Channel Reciprocity

Shengqian Han, Liyan Su, Chenyang Yang, Gang Wang, more

2010 IEEE Global Telecommunications Conference GLOBECOM 2010 > 1 - 5

2010 IEEE Global Communications Conference (GLOBECOM 2010)

In this paper we present a method to alleviate the performance degradation led by non-ideal channel reciprocity in TDD downlink base station (BS) cooperative transmission systems, which comes from imperfect antenna calibration among BSs. By exploiting the statistics of the ambiguity factors between uplink and downlink channels, a robust multiuser precoder is proposed aimed at maximizing the lower...

chapter

Aerial Acoustic Modem with Decoding Capabilities Using a CELP-Based Speech Encoder

A Nishimura

2010 Sixth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 514 - 517

2010 Sixth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIHMSP 2010)

A technology for aerial transmission of acoustic data which is robust against background noise in reverberant spaces is proposed. Hidden data are encoded as complex tones whose fundamental frequencies correspond to the chromatic scale. The decoding of hidden data is based on a pitch extraction algorithm that is employed in a CELP-based speech codec. Computer simulations revealed that the average bit...

chapter

Robust and energy-efficient DSP systems via output probability processing

Rami A Abdallah, Naresh R Shanbhag

2010 IEEE International Conference on Computer Design > 38 - 44

2010 IEEE International Conference on Computer Design (ICCD 2010)

This paper proposes to employ error statistics of nanoscale circuit fabrics to design robust energy-efficient digital signal processing (DSP) systems. Architectural level error statistics are exploited to generate probability or the reliability of each output bit of a DSP kernel. The proposed technique is referred to here as bit-level a posteriori probability processing (BLAPP). Energy efficiency...

chapter

Improving the Speech Quality with OSC: Double Full-Rate Performance Assessment

R C D Paiva, R D Vieira, R Järvelä, R F Iida, more

2010 IEEE 72nd Vehicular Technology Conference - Fall > 1 - 5

2010 IEEE 72nd Vehicular Technology Conference Fall

Speech quality is an important measurement for performance evaluation in a wireless mobile communication system since voice is still the most used service on it. The speech quality evaluation in GSM system employing narrowband and wideband AMR codecs with Orthogonal Sub Channel (OSC) technique is addressed in this paper. OSC is a feature proposed in 3GPP GERAN to double circuit switched capacity in...

chapter

Robust image compression based on compressive sensing

Chenwei Deng, Weisi Lin, Bu-sung Lee, Chiew Tong Lau

2010 IEEE International Conference on Multimedia and Expo > 462 - 467

2010 IEEE International Conference on Multimedia and Expo (ICME)

The existing image compression methods (e.g., JPEG2000, etc.) are vulnerable to bit-loss, and this is usually tackled by channel coding that follows. However, source coding and channel coding have conflicting requirement. In this paper, we address the problem with an alternative paradigm, and a novel compressive sensing (CS) based compression scheme is therefore proposed. Discrete wavelet transform...

chapter

VoIP network performance evaluation of operating systems with IPv4 and IPv6 network implementations

Shaneel Narayan, Matthew Gordon, Chad Branks, Li Fan

2010 3rd International Conference on Computer Science and Information Technology > 5 > 669 - 673

2010 3rd IEEE International Conference on Computer Science and Information Technology (ICCSIT 2010)

VoIP implementations are nowadays the preferred information technology alternative to public switched telephone networks. With dependence on this technology, VoIP quality and performance are critical. In this paper, we implement some commonly used VoIP CODECs on Windows desktop operating systems to evaluate their performance on two versions on IP, namely IPv4 and IPv6. Performance related metrics...

chapter

Sub-Sampling Framework of Distributed Video Coding

Wenbo Xu, Zhiqiang He, Kai Niu, Jiaru Lin

Proceedings of 2010 IEEE International Symposium on Circuits and Systems > 1145 - 1148

2010 IEEE International Symposium on Circuits and Systems. ISCAS 2010

Distributed video coding (DVC) has recently been proposed to reduce the complexity of the encoder, whereas it suffers from the sampling cost of huge amount of image data. To relax such sampling burden, this paper develops a novel sub-sampling distributed video coding (SuDVC) by utilizing compressive sensing (CS) technique. Due to the inherent sparsity in video sources, the video frames are compressively...

Data set:
ieee
Keywords:
CODECS
ROBUSTNESS
Publication type:
book

Publication date

Set your own date range

Content availability

Available (33)
None (1)

Keywords

DECODING (11)
ENCODING (10)
SPEECH (9)
CHANNEL CODING (6)
COMPLEXITY THEORY (6)
IMAGE CODING (6)
QUANTIZATION (6)
SPEECH CODING (6)
TRANSFORM CODING (6)
EDUCATIONAL INSTITUTIONS (5)
PROBABILITY (5)
RECEIVERS (5)
SOURCE CODING (5)
STREAMING MEDIA (5)
VIDEO CODING (5)
WATERMARKING (5)
BIT ERROR RATE (4)
BIT RATE (4)
DATA COMPRESSION (4)
DISCRETE COSINE TRANSFORMS (4)
IMAGE PROCESSING (4)
IMAGE RECONSTRUCTION (4)
MULTIMEDIA COMMUNICATION (4)
NOISE (4)
NOISE MEASUREMENT (4)
SIGNAL PROCESSING (4)
WIRELESS COMMUNICATION (4)
COMPUTERS (3)
CORRELATION (3)
DIGITAL VIDEO BROADCASTING (3)
DISTRIBUTED VIDEO CODING (3)
ERROR CORRECTION CODES (3)
ERROR STATISTICS (3)
FORWARD ERROR CORRECTION (3)
MATHEMATICAL MODEL (3)
OPTIMIZATION (3)
PROPAGATION LOSSES (3)
PROPOSALS (3)
REAL TIME SYSTEMS (3)
REDUNDANCY (3)
RELIABILITY (3)
SIGNAL TO NOISE RATIO (3)
SIMULATION (3)
STANDARDS (3)
VIDEO SEQUENCES (3)
VISUAL COMMUNICATION (3)
WIRELESS SENSOR NETWORKS (3)
ADAPTIVE SYSTEMS (2)
ALGORITHM DESIGN AND ANALYSIS (2)
AUDIO CODING (2)
BANDWIDTH (2)
CAMERAS (2)
CHAOTIC COMMUNICATION (2)
CIRCUITS AND SYSTEMS (2)
COMBINED SOURCE-CHANNEL CODING (2)
COMPUTATIONAL MODELING (2)
COMPUTER NETWORK RELIABILITY (2)
CONVOLUTION (2)
CRYPTOGRAPHY (2)
DATA COMMUNICATION (2)
DEGRADATION (2)
DELAY (2)
DELAYS (2)
DISCRETE WAVELET TRANSFORMS (2)
DISTORTION MEASUREMENT (2)
DOWNLINK (2)
EDUCATION (2)
ENTROPY (2)
EQUATIONS (2)
ERROR PROTECTION (2)
HARDWARE (2)
IEEE TRANSACTIONS ON INFORMATION THEORY (2)
IMAGE ENHANCEMENT (2)
IMAGE QUALITY (2)
INDEXES (2)
INFORMATION THEORY (2)
INTERFERENCE (2)
INTERNET (2)
INTERNET TELEPHONY (2)
ITERATIVE DECODING (2)
JITTER (2)
LOW DELAY AUDIO CODING (2)
MODULATION (2)
MOTION COMPENSATION (2)
NUCLEAR MAGNETIC RESONANCE (2)
PERFORMANCE EVALUATION (2)
PROTOCOLS (2)
PSNR (2)
RESOURCE MANAGEMENT (2)
ROBUST CONTROL (2)
SECURITY (2)
SHAPE (2)
SIGNAL PROCESSING ALGORITHMS (2)
SPEECH CODECS (2)
SPEECH PROCESSING (2)
TRANSMITTERS (2)
TURBO CODES (2)
USA COUNCILS (2)
more

INFONA - science communication portal

Search results

Robust speaker recognition based on multi-stream features

On the robustness of action recognition methods in compressed and pixel domain

Error robust low delay audio coding using spherical logarithmic quantization

Quality and Error Robustness Assessment of Low-Latency Lightweight Intra-Frame Codecs

Robust speech coding with EVS

Adaptive pre- and post-filtering for a subband ADPCM-based low delay audio codec

An Automatic Watermarking in CELP Speech Codec Based on Formant Tuning

Privacy-enhanced perceptual hashing of audio data

Rapid and generalized identification of packetized voice traffic flows

Detection of Tampering in Speech Signals with Inaudible Watermarking Technique

Time-domain audio watermarking using multiple marking spaces

FEC-based packet loss recovery for AVS-M audio codec

Design of error-resilient M-description codec over wireless broadcasting networks

Robust Multiuser Precoder for Base Station Cooperative Transmission with Non-Ideal Channel Reciprocity

Aerial Acoustic Modem with Decoding Capabilities Using a CELP-Based Speech Encoder

Robust and energy-efficient DSP systems via output probability processing

Improving the Speech Quality with OSC: Double Full-Rate Performance Assessment

Robust image compression based on compressive sensing

VoIP network performance evaluation of operating systems with IPv4 and IPv6 network implementations

Sub-Sampling Framework of Distributed Video Coding

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options