The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Due to the nonlinear interaction between the ultrasonic waves emitted by the parametric loudspeaker, a directional sound beam is generated along with harmonic distortion. It is known that the single sideband amplitude modulation (SSB-AM) technique is one of the most effective ways of reducing the harmonics. A 3rd-order inverse Volterra filter (VF) is designed on the basis of the SSB-AM in this paper...
In this paper, we propose a novel feature compensation approach based on the interacting multiple model (IMM) algorithm specially designed for joint processing of background noise and acoustic reverberation. Our approach to cope with the time-varying environmental parameters is to establish a switching linear dynamic model for the additive and convo-lutive distortions in the log-spectral domain. The...
In view of structure similarity between depth and texture in multiview video plus depth, efficient depth intra-coding with the aid of texture information has received a lot of attention. In this paper, a new depth-texture cooperative clustering method is first proposed for cluster-based depth prediction (CBDP) by exploiting the similarity. Due to inaccuracy of depth maps and the resulted texture-depth...
The emergence of large multi-platform and multi-scale data repositories in biomedicine has enabled the exploration of data integration for holistic decision making. In this research, we investigate multi-modal genomic, proteomic, and histopathological image data integration for prediction of ovarian cancer clinical endpoints in The Cancer Genome Atlas (TCGA). Specifically, we study two data integration...
Compressed sensing has shown great potential to speed up magnetic resonance imaging (MRI) assuming the image is sparse and compressible in a transform domain. Conventional methods typically use a pre-defined sparsifying transform such as wavelets or finite difference, which sometimes does not lead to a sufficient sparse representation. In this paper, we design a patch-based nonlocal operator (PANO)...
Wireless capsule endoscopy (WCE) is a new innovative solution for gastrointestinal disease detection. The image quality of WCE is not satisfactory for medical applications since some of them are dark and low-contrast. The WCE image enhancement is a challenge task, mainly because the diversity of the WCE images of different people and the need to preserve the local fine details of WCE images. Hence,...
In this paper, we propose a novel image quality assessment (IQA) metric based on nonnegative matrix factorization (NM-F). With nonnegativity and parts-based properties, NMF well demonstrates how human brain learns the parts of objects. This makes NMF distinguished from other feature extraction methods like singular value decomposition (SVD), principal components analysis (PCA), etc. Inspired by this,...
This article presents a novel scheme for video denoising based on improved matrix recovery strategy. The proposed scheme attempts to go beyond the conventional approaches that focus on the rank properties of the matrix by making use of a priori knowledge derived from the characteristics of video and noise. In this paper, we will first demonstrate that the conventional approach such as robust PCA (principal...
Biometrics and information hiding, as two different yet promising techniques for individual identification and digital media protection, have been extensively studied in the latest decade. Recently, hybrid approaches that combine these two techniques (i.e., biometric information hiding) for advanced information security have obtained increasing research interest. The principle idea is applying information...
Trajectory-based human activity recognition aims at understanding human behaviors in video sequences. Some existing approaches to this problem, e.g., hidden Markov models (HMM), have a severe limitation, namely the number of motions has to be preset. In fact, this number is difficult to define in advance in real practice. To overcome this shortcoming, we propose a new method for modeling human trajectories...
The Cave Automatic Virtual Environment (CAVE) system is a fully immersive virtual reality system, which can provide users with a realistic experience and a large freedom of interactions. In this paper, we propose vDesign, a CAVE-based virtual design environment using finger interactions. Specifically, we focus on the function of image segmentation and composition in the vDesign system. In vDesign,...
In this paper, steganography methods on streaming cover data were modeled uniformly, and then a subliminal channel consisting of multiple methods was constructed. A channel coding with feedback was proposed for the reliability of transmission. This coding was not affected by whether the capacity of used methods was fixed or variant. Experiments showed that, with the same capacity utilization as former...
Albayzin 2012 language recognition evaluation (LRE) is one of the most challenging language recognition evaluation, which is mainly reflected in: (1) the target languages are more confusable with other languages, which might push down the system performance; (2) developing and test data is heterogeneous regarding duration, number of speakers, ambient noise/music, channel conditions, etc. (3) signals...
In this paper, we propose a novel method for improving performance of the acoustic echo canceller (AEC) employed in the hands-free communication. The main objective is to realize an improved performance without requiring a double talk detector (DTD). The basic idea is to employ a gradient-based independent component analysis (ICA) method with a generalized Cauchy distribution-based flexible score...
In this paper, a kind of clipping detection method for audio signal is proposed based on kernel Fisher discriminant (KFD) in MDCT domain. The kernel method and the Fisher linear discriminant analysis (FLDA) are introduced to the proposed method. First, the clipping and non-clipping feature parameters are extracted using MDCT coefficients of audio signals. Next, the optimal projection vector and the...
In this paper, we propose a method to improve detecting the mispronunciation type of the non-native learners. In order to cope with the low-resource condition of non-native speech and the difference of native and non-native speech, the following efforts are made: 1) train acoustic model with the low-resource non-native data; 2) introduce the articulatory-based tandem feature; 3) pool auxiliary native...
This paper develops a method to evaluate the accuracy of permanent scatterers selection by using the proportion from the number of lines in the initial PS network to the number of lines in the refined PS network. Taking advantage of this method, two main PSC selection methods: amplitude dispersion method and coherence method were tested using 20 Envisat ASAR acquisitions. The results show that, when...
This paper proposes a novel linear transceiver design using lattice reduction algorithms. We propose a novel transceiver optimization problem that aims to minimize the sum of mean square errors (MSEs) of information symbols in a lattice-reduced domain. To alleviate the high complexity of solving the optimization problem, we devise an alternating algorithm to find a sub-optimal solution with low complexity...
To support spatial scalability, the scalable extension of H.264/AVC (SVC) uses video cropping or uniform scaling to downscale the original higher-resolution (HR) sequence to a lower resolution (LR) one. Both operations, however, will cause critical visual information loss in the retargeted frames. The content-adaptive spatial scalability SVC coders (CASS-SVC) use non-homogeneous scaling to avoid critical...
Lattice reduction aided decoding has been successfully used for signal detection in multiinput and multioutput (MIMO) systems and many other wireless communication applications. In this paper, we propose a novel enhanced Jacobi (short as EJacobi) method for lattice basis reduction. To assess the performance of the new EJacobi method, we compared it with the LLL algorithm, a widely used algorithm in...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.