Search results

chapter

Multi-view Subspace Learning with Diversity Enforced Skeleton Embedding

Shijie Yang, Liang Li, Shuhui Wang, Weigang Zhang, more

2017 IEEE Third International Conference on Multimedia Big Data (BigMM) > 121 - 128

2017 IEEE Third International Conference on Multimedia Big Data (BigMM)

We consider the task of multi-view subspace learning which integrates multi-view information to learn a unified representation for multimedia data. In real-world scenarios, we encounter views with high diversities of semantic levels. Neglecting the problem of semantic inconsistency, existing graph-based methods directly convert heterogeneous information into local affinity matrices to conduct a fusion...

chapter

Performance evaluation of mixtures of PLDA and conventional PLDA for a small-set speaker verification system

Qianhui Wan, Martin Bouchard

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 4

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE)

This paper compares the use of signal to noise ratio (SNR)-dependent and SNR-independent mixtures of probabilistic linear discriminant analysis (PLDA) versus conventional PLDA, under multi-noise and multi-SNR conditions for a small-set speaker verification system. Results indicate that conventional PLDA is more robust under multi-SNR conditions. The effect of the testing speech length is also examined...

chapter

PLineD: Vision-based power lines detection for Unmanned Aerial Vehicles

T. Santos, M. Moreira, J. Almeida, A. Dias, more

2017 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC) > 253 - 259

2017 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC)

It is commonly accepted that one of the most important factors for assuring the high performance of an electrical network is the surveillance and the respective preventive maintenance. From a long time ago that TSOs and DSOs incorporate in their maintenance plans the surveillance of the grid, where is included the aerial power lines inspection. Those inspections started by human patrol, including...

chapter

Voice activity detection for children's read speech recognition in noisy conditions

Ankita Pasad, Kamini Sabu, Preeti Rao

2017 Twenty-third National Conference on Communications (NCC) > 1 - 6

2017 Twenty-third National Conference on Communications (NCC)

Recordings of read-aloud stories by children in a school setting can be used to provide an assessment of reading skills via automatic speech recognition (ASR). ASR, however, is known to be highly susceptible to background noise. The unusual variety of foreground (breath release, mic pops, etc.) and background (children playing, distinct background talker, wind, etc.) non-speech sounds makes this application...

chapter

Role of voice activity detection methods for the speakers in the wild challenge

Sarfaraz Jelil, Rohan Kumar Das, S. R. Mahadeva Prasanna, Rohit Sinha

2017 Twenty-third National Conference on Communications (NCC) > 1 - 6

2017 Twenty-third National Conference on Communications (NCC)

One of the major reasons for the performance degradation of a speaker verification (SV) system in real-world conditions is its inability to spot speech regions due to the presence of noise. This work focuses on the role of voice activity detection (VAD) methods in alleviating such shortcomings. The experiments are conducted on the core-core task of the speakers in the wild (SITW) challenge. Two VAD...

chapter

Performance analysis of localization methods for wireless sensor networks

Jyoti Kashniyal, Shekhar Verma, Krishna Pratap Singh

2017 4th International Conference on Power, Control & Embedded Systems (ICPCES) > 1 - 6

2017 4th International Conference on Power, Control & Embedded Systems (ICPCES)

Accurate and cost-effective localization is an important requirement for several sensor network applications. In this paper, we compare two localization methods-multilateration and Isomap and study some of the key issues that affect their performance. We also analyze the flip ambiguity problem and the effect of applying a robustness criterion in both the methods. Our simulation results show that the...

chapter

Assessment of observer based fault estimators for TS fuzzy models

Zahra Shams, S. Seyedtabaii

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS) > 196 - 201

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS)

Fault detection of nonlinear systems become more feasible when it is conducted over Takagi-Sugeno (TS) approximated fuzzy models. Proportional plus integral observer (PIO) and robust observer (RO) have already been developed for the estimation of the system states and actuator/sensor faults. In this paper, the algorithms are implemented for the detection of valve and level sensor faults of a two-tank...

chapter

A robust content based image retrieval using local full-directional pattern (LFDP)

Behzad Merhrbakhsh Choobari, Saeed Mozaffari

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS) > 178 - 183

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS)

In this paper; we propose new method named local full-directional pattern (LFDP) for content-based image retrieval (CBIR). In addition, instead of applying the algorithm to the image itself, we apply it to a new image constructed by getting mean of 3×3 sub-regions gray value as each pixel's value. In local binary patter (LBP) the gray value difference of the central pixel and its neighboring pixels...

chapter

Sparsity regularized Principal Component Pursuit

Jing Liu, Pamela C. Cosman, Bhaskar D. Rao

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4431 - 4435

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We study the problem of low-rank and sparse decomposition from possibly noisy observations. We propose a novel objective function with nuclear norm on the low-rank term and ℓ₀-‘norm’ on the sparse term, as well as ℓ₁-norm on the additive noise term. When there is no dense inlier noise, the proposed method shares the same theoretical guarantee as the Principal Component Pursuit (PCP), i.e., it can...

chapter

Robust particle filter by dynamic averaging of multiple noise models

Bin Liu

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4034 - 4038

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

State filtering is a key problem in many signal processing applications. From a series of noisy measurement, one would like to estimate the state of some dynamic system. Existing techniques usually adopt a Gaussian noise assumption which may result in a major degradation in performance when the measurements are with the presence of outliers. A robust algorithm immune to the presence of outliers is...

chapter

Minimum entropy pursuit: Noise analysis

Shirin Jalali, H. Vincent Poor

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6100 - 6104

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Universal compressed sensing algorithms recover a “structured” signal from its under-sampled linear measurements, without knowing its distribution. The recently developed minimum entropy pursuit (MEP) optimization suggests a framework for developing universal compressed sensing algorithms. In the noiseless setting, among all signals that satisfy the measurement constraints, MEP seeks the “simplest”...

chapter

A noise suppression method for body-conducted soft speech based on non-negative tensor factorization of air- and body-conducted signals

Yusuke Tajiri, Hirokazu Kameoka, Tomoki Toda

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4960 - 4964

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a novel noise suppression method to enhance soft speech recorded with a special body-conductive microphone called nonaudible murmur (NAM) microphone. NAM microphone is capable of detecting extremely soft speech, but the recorded soft speech easily suffers from external noise due to its faint volume. To effectively suppress noise on the body-conducted signals, an external noise...

chapter

Unlabeled sensing: Reconstruction algorithm and theoretical guarantees

Golnoosh Elhami, Adam Scholefield, Benjamin Bejar Haro, Martin Vetterli

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4566 - 4570

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

It often happens that we are interested in reconstructing an unknown signal from partial measurements. Also, it is typically assumed that the location (temporal or spatial) of each sample is known and that the only distortion present in the observations is due to additive measurement noise. However, there are some applications where such location information is lost. In this paper, we consider the...

chapter

Temporal localization of audio events for conflict monitoring in social media

Junwei Liang, Lu Jiang, Alexander Hauptmann

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1597 - 1601

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

With the explosion in the availability of user-generated videos documenting any conflicts and human rights abuses around the world, analysts and researchers increasingly find themselves overwhelmed with massive amounts of video data to acquire and analyze useful information. In this paper, we develop a temporal localization framework for intense audio events in videos which addresses the problem....

chapter

Synchronization for multi-perspective videos in the wild

Junwei Liang, Poyao Huang, Jia Chen, Alexander Hauptmann

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1592 - 1596

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In the era of social media, a large number of user-generated videos are uploaded to the Internet every day, capturing events all over the world. Reconstructing the event truth based on information mined from these videos has been an emerging challenging task. Temporal alignment of videos “in the wild” which capture different moments at different positions with different perspectives is the critical...

chapter

Comparison of two binaural beamforming approaches for hearing aids

Elior Hadad, Daniel Marquardt, Wenqiang Pu, Sharon Gannot, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 236 - 240

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Beamforming algorithms in binaural hearing aids are crucial to improve speech understanding in background noise for hearing impaired persons. In this study, we compare and evaluate the performance of two recently proposed minimum variance (MV) beamforming approaches for binaural hearing aids. The binaural linearly constrained MV (BLCMV) beamformer applies linear constraints to maintain the target...

chapter

SPARTA: Sparse phase retrieval via Truncated Amplitude flow

Gang Wang, Georgios B. Giannakis, Jie Chen, Mehmet Akcakaya

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3974 - 3978

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

A linear-time algorithm termed SPARse Truncated Amplitude flow (SPARTA) is developed for the phase retrieval (PR) of sparse signals. Upon formulating the sparse PR as a non-convex empirical loss minimization task, SPARTA emerges as an iterative solver consisting of two components: s1) a sparse orthogonality-promoting initialization leveraging support recovery and principal component analysis; and,...

chapter

A provable nonconvex model for factoring nonnegative matrices

Dung N. Tran, Sang P. Chin, Trac D. Tran

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2262 - 2266

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We study the Nonnegative Matrix Factorization problem which approximates a nonnegative matrix by a low-rank factorization. This problem is particularly important in Machine Learning, and finds itself in a large number of applications. Unfortunately, the original formulation is ill-posed and NP-hard. In this paper, we propose a row sparse model based on Row Entropy Minimization to solve the NMF problem...

chapter

Integrated DNN-based model adaptation technique for noise-robust speech recognition

Kang Hyun Lee, Woo Hyun Kang, Tae Gyoon Kang, Nam Soo Kim

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5245 - 5249

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Since the introduction of deep neural network (DNN)-based acoustic model, robust automatic speech recognition using DNN are being in research. Especially in model adaptation, the techniques utilizing auxiliary context features is known to be a promising technique. Recently, we proposed a technique which is called two-stage noise-aware training (TSNAT). The key idea of TS-NAT is to let the DNN clarify...

chapter

Analysis of various image feature extraction methods against noisy image: SIFT, SURF and HOG

Sidheswar Routray, Arun Kumar Ray, Chandrabhanu Mishra

2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT) > 1 - 5

2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT)

We present the performance of three popular image feature extraction methods such as Scale Invariant Feature Transformation (SIFT), Speeded-Up Robust Features (SURF) and Histogram of Oriented Gradient (HOG). Specifically, we compare the performance of feature detection methods for images corrupted with different types of noise. The efficiency of three methods are measured by considering number of...

INFONA - science communication portal

Search results

Multi-view Subspace Learning with Diversity Enforced Skeleton Embedding

Performance evaluation of mixtures of PLDA and conventional PLDA for a small-set speaker verification system

PLineD: Vision-based power lines detection for Unmanned Aerial Vehicles

Voice activity detection for children's read speech recognition in noisy conditions

Role of voice activity detection methods for the speakers in the wild challenge

Performance analysis of localization methods for wireless sensor networks

Assessment of observer based fault estimators for TS fuzzy models

A robust content based image retrieval using local full-directional pattern (LFDP)

Sparsity regularized Principal Component Pursuit

Robust particle filter by dynamic averaging of multiple noise models

Minimum entropy pursuit: Noise analysis

A noise suppression method for body-conducted soft speech based on non-negative tensor factorization of air- and body-conducted signals

Unlabeled sensing: Reconstruction algorithm and theoretical guarantees

Temporal localization of audio events for conflict monitoring in social media

Synchronization for multi-perspective videos in the wild

Comparison of two binaural beamforming approaches for hearing aids

SPARTA: Sparse phase retrieval via Truncated Amplitude flow

A provable nonconvex model for factoring nonnegative matrices

Integrated DNN-based model adaptation technique for noise-robust speech recognition

Analysis of various image feature extraction methods against noisy image: SIFT, SURF and HOG

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options