The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In Distributed Video Coding (DVC) the encoder compresses frames at source rates that rely on the statistical dependency involving the Wyner-Ziv (WZ) and side information frames. A significant issue that individuals address in this paper provides the encoder with a mechanism to spot the origin rate to be found in encoding a WZ frame. One possible solution is to check out a feedback approach; the encoder...
In the minimum description length (MDL) and Bayesian criteria, we construct description length of data zn = z1 … z„ of length n such that the length divided by n almost converges to its entropy rate as n → ∞, assuming Zi is in a finite set A. In model selection, if we knew the true conditional probability P(zn|F) of zn Є An given each F, we would choose F such that the posterior probability P(F|zn...
This paper presents a multi-camera motion capture system aiming to provide caregivers with timely access to the patient's health status through mobile communication devices. The major components include video capture, object detection, video coding and transmission, error concealment, and video analysis. Our contribution is twofold. First, several novel ideas are developed, including fast object detection,...
The biggest challenge in image set compression is how to efficiently remove the set redundancy among images as well as the redundancy inside a single image. Different from all the previous schemes, in this paper we are the first to propose a generic image set compression scheme which removes the set redundancy based on local features in addition to luminance values. The SIFT (Scale Invariant Feature...
Audio source separation consists in recovering different unknown signals called sources by filtering their observed mixtures. In music processing, most mixtures are stereophonic songs and the sources are the individual signals played by the instruments, e.g. bass, vocals, guitar, etc. Source separation is often achieved through a classical generalized Wiener filtering, which is controlled by parameters...
Detecting abnormal events in crowded scenes remains challenging due to the diversity of events defined by various applications. Among the many application situations, motion analysis for event representation is suited for crowded scenes. In this paper, we propose a novel abnormal event detection method via likelihood estimation of dynamic-texture motion representation, called Structural Multi-scale...
Bandwidth-limited channels demand the transmission of the per-pixel depth maps with the texture data to provide immersive 3D video services that allow arbitrary 3D viewpoint reconstruction. This auxiliary depth data offers geometric information, which together with the multi-view and epipolar geometries, can be exploited during 3D video coding to calculate geometric positions for the search areas...
In this paper, we propose a scalable multiview video coding (MVC) algorithm based on Wavelet Pyramids and Set Partitioning in Hierarchical Tree (SPIHT) codec. This algorithm first uses the Mallat pyramid method to decompose the original images into subband of different scales, and then follows the usual phase matching method to conduct disparity estimation from the order of coarse to fine. To solve...
Intra coding algorithm in High Efficiency Video Coding employs up to 35 directional prediction modes. Upon the end of alleviating the intra encoding complexity, we proposed the candidate mode selection algorithm from analyzing the textures of the source image block. Considering the fine difference between the neighboring prediction directions, we devise the fix-point arithmetic based edge detector,...
In 3D video systems, depth maps supply geometry information which is used to generate virtual views. The coding of the depth maps can be improved by considering distortion of synthesized views instead of depth map distortion. Therefore, this paper proposes a novel metric for depth coding, which quantifies the impact of the depth distortion on the fidelity of synthesized views, without banding with...
Most work in automatic facial expression analysis seeks to detect discrete facial actions. Yet, the meaning and function of facial actions often depends in part on their intensity. We propose a part-based, sparse representation for automated measurement of continuous variation in AU intensity. We evaluated its effectiveness in two publically available databases, CK+ and the soon to be released Binghamton...
Cell biology is characterised by low molecule numbers and coupled stochastic chemical reactions with intrinsic noise permeating and dominating the interactions between molecules. Recent work [9] has shown that in such environments there are hard limits on the accuracy with which molecular populations can be controlled and estimated. These limits are predicated on a continuous diffusion approximation...
Digital Spectral Analysis of DNA sequence using AR Models have long been proved to be superior to classical Fourier Transform techniques. Here authors have applied a special case of all-pole model using Prony's method to DNA sequence from various Chromosomes for Power Spectral Density (PSD) estimation in order to identify protein-coding regions. A quaternary mapping method comprising real and imaginary...
Video forensics is an emerging discipline, that aims at inferring information about the processing history undergone by a digital video in a blind fashion. In this work we introduce a new forensic footprint and, based on it, propose a method for detecting whether a video has been encoded twice; if this is the case, we also estimate the size of the Group Of Pictures (GOP) employed during the first...
We consider state estimation for a discrete-time system over a lossy network. In order to improve the estimation performance, different from the standard approach of sending the current measurement data, we choose sending a linear combination of the current measurement and the measurement collected at the previous time, a method called linear temporal coding. We consider the case when the packet arrival...
An efficient coding algorithm for depth map images and videos, based on view synthesis distortion estimation, is proposed in this work. We first analyze how a depth error is related to a disparity error and how the disparity vector error affects the energy spectral density of a synthesized color video in the frequency domain. Based on the analysis, we propose an estimation technique to predict the...
This paper introduced a camera surveillance system in wireless communications. The system contains three major modules, PTU (pan-tilt unit) camera control for surveillance video capture, cross-layer control for data compression and transmission, and error concealment for video quality enhancement. Our contribution is twofold. First, a system design for data collection and transmission over wireless...
TCP is the ubiquitous transport protocol in the Internet. However, in a wireless ad-hoc environment where links are unreliable, TCP causes a number of performance issues. The key reason behind this is that TCP considers all packet losses to be due to congestion and reduces its send rate, which is not necessarily appropriate in a lossy ad-hoc environment. In prior work, we have designed Loss Tolerant...
This paper investigates the effect of reducing the resolution of estimated depth maps that are used for inter-view motion and residual prediction in a multiview extension of HEVC. The investigated HEVC extension includes disparity compensated prediction similar to the MVC extension of H.264/AVC. For further increasing the coding efficiency, the obtained disparity vectors are used to estimate depth...
H.264 is an up-to-date video coding standard with wide applications. In order to improve the segmentation efficiency for H.264 video, a novel object segmentation approach using background estimation is proposed in this paper. Firstly the preprocessing including spatiotemporal filtering and motion field accumulation is used to remove the noisy motion vectors and obtain the dense motion field. Then...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.