The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In video transcoding, pre-encoded frames may be arbitrarily dropped to freely adjust the video to meet the network and client requirements. Since transcoding is carried out in real-time, incoming motion vectors are reused to reduce the transcoding latency. In this paper, we propose a new motion vector composition scheme for arbitrarily dropping any frame from incoming video bit-stream comprising I,...
We propose a new video homogeneous transcoding architecture DCT-based which relies on both quality and temporal reduction techniques. The frame layer control is driven by a new indicator, the jerkiness, which represents the user perception of the movement which affects a video stream. The proposed transcoder can meet the constraints of a real-time communication and it has been extensively tested under...
Motivated by the needs for efficient indexing structures adapted to real applications in video database, we present a new indexing structure named Kpyr. In Kpyr, we use a clustering algorithm to partition the data space into sub-spaces on which we apply Pyramid technique (S. Berchtold, et al., 1998). We thus reduce the search space concerned by a query and improve the performances. We show that our...
We propose a quad-tree scheme for obtaining sub-pixel estimates of interframe motion in the frequency domain. Our scheme is based on phase correlation and uses motion compensated prediction error to control the partition of a parent block to four children quadrants. This criterion guarantees a monotonic decrease of the motion compensated prediction error with an increasing number of iterations making...
Video annotation is typically performed by classifying video elements according to some pre-defined ontology of the video content domain. Ontologies are defined by establishing relationships between linguistic terms that specify domain concepts at different abstraction levels. However, although linguistic terms are appropriate to distinguish event and object categories, they are inadequate when they...
In this paper, we propose a hybrid architecture to integrate peer-to-peer (P2P) streaming approaches with content distribution networks (CDNs). We further utilize multiple description (MD) coding in this architecture to address challenging issues in P2P approaches such as low peer upstream bandwidth and low quality assurance. The proposed schemes take advantage of the high availability of CDNs, the...
Streaming high quality audio/video (AV) from home media sources to TV sets over a wireless local area network (WLAN) is a challenging problem because of the fluctuating bandwidth caused by interference. Our approach is to adjust the video bit-rate dynamically in order to improve the experienced audiovisual quality. The effectiveness of rate adaptation depends on the accurate and timely estimation...
In baseball game, an event is defined as the portion of video clip between two pitches, and a play is defined as a batter finishing his plate appearance. A play is a concatenation of many events, and a baseball game is formed by a series of plays. In this paper, only the event happened in the last pitch of a plate appearance is detected. It is then semantically classified to represent the corresponding...
Traditionally artistic color concepts play an important role in the analysis of artworks, and provide valuable domain knowledge to guide the analysis and accurate retrieval of paintings. This paper presents an automated approach to analyzing and representing artistic color concepts such as color temperature, color palette and color contrasts in paintings domain. The color concept definitions rely...
Our previous research shows that the use of multiple sources of information based on intrinsic AV features and external knowledge helps to detect events in soccer video. To make the system scalable, we process each source of information independently before fusing the detection results. The fusion of results is vital to the success under this architecture. However, this fusion problem is unique in...
Video management research has largely been ignoring the increased attractiveness of using camera-equipped mobile phones for the production of short home video clips, mostly considering them as additional channels for video consumption. The CANDELA project, which is part of the European ITEA program, focuses on the integration of video content analysis with advanced retrieval, mobile, networked delivery,...
Video is about to conquer the Internet. Real-time delivery of video content is technically possible to any desktop and mobile device, even with modest connections. The main problem hampering massive (re)usage of video content today is the lack of effective content-based tools that provide semantic access. In this contribution, we discuss systems for both video analysis and video retrieval that facilitate...
Recent video coding standards such as H.264 offer the flexibility to select reference frames during motion estimation for predicted frames. In this paper, by tracking loss compensation during distortion minimization, we improve upon an earlier proposal to jointly select reference frame, level of QoS and transmission path for each video frame in a multi-path streaming scenario. An algorithm that efficiently...
In many novel application scenarios such as smart rooms or sensing rooms visual sensors (such as cameras) need to know which visual actuators (such as displays) are visible to them. Often only parts of a display are visible from a camera. Therefore, a novel algorithm for precise visibility determination is presented. The algorithm makes the assumption that the displays are active, i.e., they can be...
A more efficient coding scheme for H.264 by heuristically assign macroblock partition types for video foreground and background coding is proposed. High visual quality of foreground regions are retained while low bit-rate background coding is achieved. More importantly, the encoding time is reduced significantly owing to the elimination of the exhausted searches over all partition types during the...
Traditional design and test of complex multimedia systems involves a large number of test vectors and is a difficult and time-consuming task. The simulation times are prohibitively long on current desktop computers. Driving actual design scenarios and timing burst behavior which produce real-time effects is difficult to do with current simulation environments. This paper describes a rapid emulation...
Multiview video compression is important to the image-based 3D video applications. In this paper, we proposes a novel neighbor-based multiview video compression scheme. It is essentially a MPEG2-like block-based scheme. In particular, a method to decide the stream encoding order is presented. The resulting stream encoding order can better decorrelate spatial redundancies among multiple video streams...
Unlike video coding, video deinterlacing relies heavily on the correctness of motion. To obtain more reliable motion, we propose a new motion search criterion that imposes constraints on the motion diversity among neighboring blocks, and improve the symmetric ME method by dynamic block splitting and single direction ME. To further improve the visual quality, adaptive deinterlacing algorithm based...
A novel system is described that significantly enhances the usefulness of handwritten notes taken during a presentation by creating a multimedia document that includes scanned images of handouts, personal notes, and links to a multimedia recording of the presentation. Notes are linked to the e-presentation media with automatic content analysis without any special notes capture device. Layout segmentation...
With the spread of digital cameras, shooting photos has been becoming an everyday affair. However, there are few methods or systems to manage photos simply, and a huge amount of photo data remains unorganized. Although it is possible to add appropriate words explaining the contents of the photo as one of the methods to manage photos, it requires much time and effort to input such indexes manually...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.