The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Video surveillance systems have enabled the monitoring of complex events in several places, such as airports, banks, streets, schools, industries, among others. Due to the massive amount of multimedia data acquired by video cameras, traditional visual inspection by human operators is a very tedious and time consuming task, whose performance is affected by fatigue and stress. A challenge is to develop...
In this work we address the problem of analyzing video sequences and representing meaningful space-time points of interest. We base our work on the 3D shearlet transform. In particular, we exploit the relation between coefficients with similar shearings to build a local representation which turns out to be really informative to understand the local spatio-temporal characteristics of the points that...
This work analyses the actual throughput of the Discrete Sine Transform (DST) stage in a realistic HEVC encoder, which executes the rate-distortion optimization algorithm to achieve high compression quality. Then, a low complexity DST factorization, where all the integer multiplications are substituted with add-and-shift operations, is exploited to design an efficient 1D-DST core. The proposed 1D-DST...
Soft errors resulting from encoding video sequences on unreliable hardware can create significant artifacts in decoded video sequences, contributing to extreme video quality degradation. Modern systems are required to operate under increasingly challenging constraints, including smaller feature sizes and lower operating voltage, increasing the likelihood of soft errors in the video encoding hardware...
This paper presents a new static caption detection method that uses the census transform (CT) and motion vector (MV) for frame rate up-conversion. The proposed method splits a frame into several blocks and detects the static regions using CT and MV. CT is used to consider the spatio-temporal consistency of the texture and MV is used to remove the non-static regions containing moving objects. Next,...
The rich choice of hierarchical partitioning modes within the coding tree unit (CTU) in the high efficiency video coding (HEVC/H.265) standard is the main reason for its higher coding efficiency, as well as its high computational complexity. The single most timing consuming process in H.265/HEVC is motion estimation (ME). In this paper, to accelerate the ME process, we propose an efficient parallel...
In this paper, we develop a real-time pedestrian legs detection and tracking system that emphasizes on targeting the lower part of a human body. In the legs detection procedure, we evaluate two kinds of classifiers based on multilayer perceptron (MLP) and support vector machines (SVM) to determine whether pedestrian legs appear in a frame of the video sequence captured by a single webcam equipped...
Content structure is an important aspect in the understanding of video. In this paper, we demonstrate that knowledge about the structure can improve the performance of content analysis operations such as feature extraction, shot transition, shot duration and activity. We have proposed two concepts with the aim to improve the performance of existing Video Shot detection methods. First, we have used...
In this paper, a video watermarking scheme is proposed to embed the watermarking with the depth sequences of the respective video frames for multi-view video plus depth (MVD) based 3D video sequences. To make the scheme invariant to 3D-HEVC compression attack, motion compensated temporal filtering (MCTF) is done over the video sequences to find motion-coherent connected pixels. Scale- invariant feature...
High Efficiency Video Coding (HEVC) is the latest video coding standard and has achieved 50% better compression performance compared to prior video standards competing with (2K, 4K, and 8K) video resolutions. HEVC adopts flexible quad-tree structure, resulting 60% of inter prediction complexity. A fast coding unit decision taking algorithm is proposed which can reduce the coding tree unit (CTU) inter...
Video denoising systems aims at the removal of noise within each frames. Most of the video denoising systems employ spatial filtering, wavelet decomposition or by utilizing the temporal coherence by motion estimation. Unfortunately, these denoising systems distorts edges, lines and curves. The drawbacks of temporal correlation based schemes are that they suffer due to aperture problems in optical...
In this paper, a novel approach for target tracking in FLIR (Forward Looking Infra-red) imagery is presented. Generally IR (infra-red) signatures of targets are more prominent than background and clutter, and this contrast is commonly used as a clue for detection of targets and initialization of tracking algorithms. But in the case of small targets with poor SNR, detection based on this feature alone...
The conventional video denoising algorithms utilizes either a strenuous motion estimation step or by three dimensional wavelet transformation. However, these schemes of video denoising results in videos with jittery edges and curves. The limitations of motion estimation based schemes are that they suffer due to aperture problems in optical flow and lighting variations. Yue M. Lu and Minh N. Do introduced...
As the environment identification is a vital necessity for blind people, a visual substitution system based on videoanalysis is the solution to their problem. This paper focuses onthe assessment and integration of the local dissimilarity map inthe video processing. A Real Value local Dissimilarity Map isbuilt for grayscale images in order to get an excellent detectionof similar frames. The elimination...
A selective video encryption method based on the manipulation of transform skip signal and sign bin is proposed for the HEVC standard. The basic performance of the proposed selective video encryption method is evaluated in terms of perceptual inspection, outline detection and sketch attack using various classes of test video sequences. Preliminary results show that the proposed method provides quality...
The quad tree structure based Transform Unit (TU) helps high efficiency video coding to improve the coding efficiency. However, the achieved coding efficiency comes at the cost of the increased computational complexity. In this paper, based on the quantizated coefficients of the TU, we propose an early termination for the quad tree structure based TU encoding process. If the quantized coefficients...
When designing hardware-accelerated video encoding systems, it is fundamental to determine the maximum throughput needed by each subsystem so that the design can optimize the cost-performance tradeoff. One of the key modules in video coding is the 2D transform operation which is typically subject to heavy optimization efforts. This work investigates the tradeoff between the computational power spent...
This paper proposes a framework for recognizing human actions from depth video sequences by designing a novel feature descriptor based on Depth Motion Maps (DMMs), Contour let Transform (CT) and Histogram of Oriented Gradients (HOGs). First, CT is implemented on the generated DMMs of a depth video sequence and then HOGs are computed for each contour let sub-band. Finally, the concatenation of these...
In this paper, we propose a robust obstacle detection approach by leveraging Weighted Hough Transform (WHT) in combination with temporal information correlation from the stereo video sequences. First, to model the road surface or obstacles in the video, rather than using simple threshold from binarized frame sequence, we propose to adopt WHT to extract the linear relation from the v-disparity map...
Screen content videos increasingly gain the popularity due to the rapid advances in cloud and multimedia technologies, which in turn requires highly efficient screen content compression. A recent standard, namely SCC is under development in JCT-VC, Joint Collaborative Team on Video Coding between ISO/IEC and ITU-T. In SCC, the most efficient new coding tool is Intra block copy (Intra BC). In this...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.