The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper presents a multi-channel/multi-speaker 3D audio-visual corpus for Mandarin continuous speech recognition and other fields, such as speech visualization and speech synthesis. This corpus consists of 24 speakers with about 18k utterances, about 20 hours in total. For each utterance, the audio streams were recorded by two professional microphones in near-field and far-field respectively, while...
Digital videos play an important role today: they are used for entertainment, for communication and for information exchange. Within the internet, there are many sources that provide video content in different ways. Videos can be streamed in real-time or downloaded for local usage. In all these scenarios, video content is exchanged using some sort of binary serialization format. Consequently, compatible...
As video gameplay recording and streaming is becoming very popular on the Internet, there is an increasing need for automatic classification solutions to help service providers with indexing the huge amount of content and users with finding relevant content. The automatic classification of gameplay videos into specific genres is not a trivial task due to their high content diversity. This paper address...
The paper discusses the design and implementation of a low-cost tool for analysis and design of applications involving video based networked interaction or teleoperation, such as remotely operated devices and collaborative environments. The tool is intended for providing an accurate measure of the video lag involved in the encoding and streaming video over packet networks, and for assessing the impact...
Mobile devices have been playing an important role in our daily lives. Therefore, to extend the standby time of mobile devices has become a focal topic to both industry and academia. Based on the brightness, contrast and motion analysis, this paper proposes a new backlight dimming and pixel compensation method for display power saving when playing video. First, we calculate the brightness and contrast...
Non-Photorealistic Rendering (NPR) for video sequences can be regarded as an extension of NPR for images. However, if the NPR sequence is obtained from a real video using an image processing pipeline, discontinuity problems have to be addressed, especially for the background region. In this paper, we propose to adopt Fast Video Segmentation algorithm to extract the common background in the video before...
QoS guarantees for real-time HD video transmission can be assured by proper resource allocation. The TFD option of the IP protocol can be used to convey signaling information about required resources for a given transmission. In this paper the new traffic description for TFD option, based on analysis of scene changes, is presented. The traffic description was carried out in two stages. First, scene...
Visual-based indoor localization have become a favored research area in recent years. It can be used inside a building where GPS signals are often not available. And due to its low deployment cost, visual-based indoor localization has been implemented in the complicated indoor environment. However, in order to increase the accuracy of indoor localization, the scale of image database should be as large...
In discovering nature of the Primary Language of the human brain introduced by J. von Neumann, we suggest that the Primary Language is the Language of Visual Streams. We investigate the Primary Language by researching major ancient algorithms that have been developed as a result of evolution of human intelligence and should be based directly on the Primary Language. One of them is the Algorithm of...
We present a mobile application that helps a group of collocated collaborators to design, pre-visualize and modify the furniture layout plan for academic exhibits, using Augmented Reality (AR). The users can define the dimensions of a virtual exhibition room and populate it with different types of suitable furniture for an exhibit. They can concurrently move around the virtual furniture and evaluate...
Este documento presenta el proceso de elaboración de un micromundo educativo, con el cual se pretende apoyar actividades de comprensión lectora y escucha de la lengua nam trik hablada en el resguardo de Totoró, departamento del Cauca-Colombia. Para este propósito, se describe la adaptación metodológica obtenida a partir del estudio de tres metodologías: (i) Metodología para la construcción de materiales...
As introduced by J. von Neumann in 1957 we continue investigate the structure of the Primary Language of the human brain. Our investigation is focused on rediscovering two major algorithms essential for development of humanity. They are Linguistic Geometry (LG), the algorithm for optimizing war fighting, and the Algorithm of Discovery (AD), the algorithm for inventing new algorithms. According to...
Two directions of development of intelligent real time video systems (technical vision systems) are considered in the report. First direction consists in increasing intellectuality of video systems at the cost of development of new information basis and dynamic models of video information perception processes, principles of control reading parameters of video information for reducing redundancy of...
BGM (background music) of a video plays an important role for making a video impressive. Although a large number of royalty-free music clips are available on the web, it is still difficult for amateur video creators to select appropriate music clips for their videos. In this paper, we propose a computational method for estimating the impression of a video from auditory and visual features of a video...
To increase the overall visual quality of the video services without increasing data rate, we developed an unequal protection technique called UEP3D (Unequal Error Protection 3D) based on a hierarchy of the video stream in different levels of importance,. The determination of levels of importance takes three classification criteria: pixel level, macroblock level and image level. At the end of the...
Video copy detection is still an open problem as current approaches are not able to carry out the detection with enough efficacy and efficiency. These are desirable features in modern video-based applications requiring real-time processing in large scale video databases and without compromising detection performance, especially when facing non-simulated video attacks. These characteristics are also...
Subjective studies showed that most HDR video tone mapping operators either produce disturbing temporal artifacts, or are limited in their local contrast reproduction capability. Recently, both these issues have been addressed by a novel temporally coherent local HDR tone mapping method, which has been shown, both qualitatively and through a subjective study, to be advantageous compared to previous...
In this paper, an enhanced mobility manager will be presented to overcome many drawbacks of Castalia's traditional mobility manager. The presented mobility manger can deal with paths rather than lines. This will allow users to simulate nodes moving with any possibilities within the simulation space. Additionally a 3D visualization engine will be integrated so that users can visualize their simulated...
Rapid development of up-to-date information technologies and the advent of the Web have accelerated the growth of digital media and, in particular, video collections. Due to semantic gap between the low-level video features and high-level interpretations lots of difficulties remain in the construction of video stream semantic structure. Relational model of video parsing has been proposed. Each frame...
This paper presents a lightweight video sensor node for moving object surveillance using region-of-interest (ROI) based coding and an on-line multi-parameter rate controller. The proposed ROI-based coding scheme determines ROI blocks, pre-processes non-ROI blocks using bit-truncation, and encodes all blocks using Motion JPEG. The on-line rate controller modulates the parameters of the ROI-based coding...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.