The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper evaluates the application of three methods for Sound Source Separation (SSS) in industrial acoustic condition monitoring scenarios. To evaluate the impact of SSS, we use a machine learning approach where a classifier is trained to detect a specific operating machine. The evaluation procedure is based on simulated and measured data, comprising three different machine sounds as targets and...
While most dereverberation methods focus on how to estimate the amplitude of an anechoic signal, we propose a method which also takes the phase into account. By applying a sinusoidal model to the anechoic signal, we derive a formulation to compute the amplitude and phase of each sinusoid. These parameters are then estimated by our method in the reverberant case. As we jointly estimate the amplitude...
Motivated by the fact that modeling and representation of multi-class signal patterns plays a critical role in Electroencephalogram (EEG)-based brain computer interface (BCI) systems, the paper proposes the coupling of error correction output coding (ECOC) with the common spatial pattern (CSP) analysis. Referred to as the ECO-CSP framework, the ECOC approach is applied to EEG motor imagery classification...
Pulse photopletysmographic signal (PPG) is modulated by the respiratory rate, so there are some algorithms capable to extract respiratory information from the derived PPG signals, as the Pulse Amplitude Variability (PAV). Previous works have shown that the use of the PPG leads to different results depending on the PPG sensor location (finger and forehead). Therefore, a database recording finger and...
Tensor-based analysis of brain imaging data, in particular functional Magnetic Resonance Imaging (fMRI), has proved to be quite effective in exploiting their inherently multidimensional nature. It commonly relies on a trilinear model generating the analyzed data. This assumption, however, may prove to be quite strict in practice; for example, due to the natural intra-subject and inter-subject variability...
In this paper, we present a time-recursive implementation of a recent hyperparameter-free group-sparse estimation technique. This is achieved by reformulating the original method, termed group-SPICE, as a square-root group-LASSO with a suitable regularization level, for which a time-recursive implementation is derived. Using a proximal gradient step for lowering the computational cost, the proposed...
Audio Event Detection (AED) aims to recognize sounds within audio and video recordings. AED employs machine learning algorithms commonly trained and tested on annotated datasets. However, available datasets are limited in number of samples and hence it is difficult to model acoustic diversity. Therefore, we propose combining labeled audio from a dataset and unlabeled audio from the web to improve...
Many optimization problems in communications and signal processing can be formulated as rank-one constrained optimization problems. This has motivated the development of methods to solve such problem in specific scenarios. However, due to the non-convex nature of the rank-one constraint, limited progress has been made in solving generic rank-one constrained optimization problems. In particular, the...
Gesture recognition has multiple applications in medical and engineering fields. The problem of hand gesture recognition consists of identifying, at any moment, a given gesture performed by the hand. In this work, we propose a new model for hand gesture recognition in real time. The input of this model is the surface electromyography measured by the commercial sensor the Myo armband placed on the...
This paper deals with the problem of airfare prices prediction. For this purpose a set of features characterizing a typical flight is decided, supposing that these features affect the price of an air ticket. The features are applied to eight state of the art machine learning (ML) models, used to predict the air tickets prices, and the performance of the models is compared to each other. Along with...
We present an extension for HEVC intra-frame coding with trapezoidal splits and orthogonal transforms. A block can be split into two 180-degrees rotationally-symmetric (C2) trapezoidal parts, each coded separately using standard DCT implementation. We also introduce part-to-part prediction from a diagonal edge. The optimal trapezoidal split of a quad tree block is selected in a rate-distortion sense...
The problem of detecting misinformation and fake content on social media is gaining importance with the increase in popularity of these social media platforms. Researchers have addressed this content analysis problem using machine learning tools with innovations in feature engineering as well as algorithm design. However, most of the machine learning approaches use a conventional classification setting,...
With the growing development of video applications and services for mobile devices, saving energy consumption when managing video is becoming a more and more important issue. The challenge is then to deliver video with high quality while reducing the energy consumption. In this paper, we investigate the relationship between subjective video quality and energy consumption in an HEVC decoder. By reducing...
In this paper, a Bayesian method with a hierarchical sparsity enforcing prior model for Dual-Tree Complex Wavelet Transform (DT-CWT) coefficients is proposed. This model is used for X-ray Computed Tomography (CT) image reconstruction. A generalized Student-t distributed prior model is used to enforce the sparse structure of the DT-CWT coefficient of the image. The joint Maximum A Posterior algorithm...
In this paper, we address the estimation of power spectral density (PSD) matrix. The accurate estimation of PSD matrix plays an important role in many speech enhancement methods. In traditional PSD estimation methods, only the information of previous frames is employed through a forgetting factor. In the current research, we consider the correlation of inter-band components and incorporate their information...
Text-to-speech (TTS) systems are often used as part of the user interface in wearable devices. Due to limited memory and computational/battery power in wearable devices, it could be useful to have a TTS system which requires less memory and is less computationally intensive. Conventional speech synthesis systems has separate modeling for pitch (FO-model) and spectral representation, namely Mel generalized...
Like other divergences, Jeffrey's divergence (JD) is used for change detection, for model comparison, etc. Recently, a great deal of interest has been paid to this symmetric version of the Kullback-Leibler (KL) divergence. This led to analytical expressions of the JD between autoregressive (AR) processes, moving-average (MA) processes, either noise-free or disturbed by additive white noises, as well...
This paper proposes a new algorithm for image inpainting algorithm based on the matrix rank minimization with nonlinear mapping function. Assuming that each intensity value of a nonlinear mapped image can be modeled by the autoregressive (AR) model, the image inpainting problem is formulated as a kind of the matrix rank minimization problem, and this paper modifies the iterative partial matrix shrinkage...
In this paper we argue that the Wigner-Ville distribution (WVD), instead of the spectrogram, should be used as basic input into convolutional neural network (CNN) based classification schemes. The WVD has superior resolution and localization as compared to other time-frequency representations. We present a method where a large-size kernel may be learned from the data, to enhance features important...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.