The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The latest High Efficiency Video Coding (HEVC) has been increasingly used to generate video streams over Internet. However, the decoded HEVC video streams may incur severe quality degradation, especially at low bit-rates. Thus, it is necessary to enhance visual quality of HEVC videos at the decoder side. To this end, we propose in this paper a Decoder-side Scalable Convolutional Neural Network (DS-CNN)...
Recent advances in technology and rapid growth of consumer electronics have made tremendous amount of multimedia information available to the general population. Browsing through large collections of consumer videos and manually creating summaries can be tedious. Automatic summarization techniques will give the user an easy way to look up important content of a collection of media and to browse media...
Recent advances in video understanding are enabling incredible developments in video search, summarization, automatic captioning and human computer interaction. Attention mechanisms are a powerful way to steer focus onto different sections of the video. Existing mechanisms are driven by prior training probabilities and require input instances of identical temporal duration. We introduce an intuitive...
Delivering persuasive presentation is a challenging task for the university students. One of the difficult parts is grasping the attention of audiences during the critical situations, such as in the early morning or after the lunch break. At the situations, students are not paying more attention to any speeches or presentations occurred in the classroom because of some lapses of attention span such...
QR codes are increasingly being used as a mechanism to transmit one time passwords (OTPs) between devices for the purpose of authentication due to their convenience, low cost, and the ubiquity of consumer mobile devices. Existing practice typically utilizes a single QR code which is relatively easy to capture and relay to an offsite attacker or collaborator. We propose a mechanism using a stream of...
Real-time video delivery in Vehicle-to-Infrastructure (V2I) scenario enables a variety of multimedia vehicular services. We conduct experiments with Dedicated Short Range Communications (DSRC) transceivers located in the mutual proximity and exchanging Skype video calls traffic. We demonstrate that the lack of coordination between the users both at the application as well as Medium Access Control...
Neural representations for object recognition are difficult to construct because vision operates in highdimensional space. This study aims to develop low-dimensional neural representations (“manifolds”) that could contain either rotation or viewpoint information. In our experiments, four rotating tools were used as visual stimuli and brain activity was recorded using functional magnetic resonance...
World wide web (www) is a huge information repository and rapidly growing as source of information. Web pages is known as semi-structured data and it contains variety of information such as text, images, audio, video and other various format. The process of extracting information from the web pages is time consuming and requires correct approach and this paper presents an improvised algorithm in extracting...
Visual data are rich, which have opened vast analytics opportunities and been widely used in many applications. However, the demanding requirements of computational resources and bandwidth have prevented the data from being useful in an economically efficient manner. A visual fog paradigm is needed for efficient processing of continuous video streams by collaboratively using things in the Internet...
Nowadays, in our fast-paced world there are countless MOOC courses in the Internet with various topics that have been designed to broaden our knowledge. One of the most powerful tools for effective learning are online videos. Many case studies have been carried out in order to specify the qualities of a good educational video. Philip J. Guo, Juho Kim and Rob Rubin published an article (How Video Production...
Context-aware segmentation of laparoscopic and robot assisted surgical video has been shown to improve performance and perioperative workflow efficiency, and can be used for education and time-critical consultation. Modern pressures on productivity preclude manual video analysis, and hospital policies and legacy infrastructure are often prohibitive of recording and storing large amounts of data. In...
Laughter detection is an essential aspect towards effective human-computer interaction. This work primarily addresses the problem of laughter detection in a real-time environment. We utilize annotated audio and visual data collected from a Kinect sensor to identify discriminative features for audio and video, separately. We show how the features can be used with classifiers such as support vector...
There is an ongoing debate in the research community over the improved visual quality of UHD video in comparison to the still widely-deployed HD standard. It is the inspiration of many scientific studies, yet UHD displays and services are continuously spreading on the consumer market. This paper presents the results of a subjective paired-comparison test with both upscaled HD and UHD video sequences,...
This paper addresses vision-based tracking and landing of a micro-aerial vehicle (MAV) on a ground vehicle (GV). The camera onboard the MAV is mounted so that the optical axis is aligned with the downward-facing axis of the body-fixed frame. A novel supervised learning vision algorithm is proposed as the method to detect the ground vehicle in the image frame. A feedback linearization technique is...
In this study, an effective approach is proposed for detecting unattended objects in visual surveillance systems. In the proposed approach, the regions of foreground are labeled as moving regions which are determined by background subtraction algorithms in a streamed video from a fixed camera. Moving blobs are used to extract information about events occurring in the environment from which the image...
Plenty of video stuff is created, broadcasted, shared and stored each and every day by industry experts, beginners, and hobbyists. Video summaries aim at showcasing the semantics and content of a clip in reduced time and space to enable a quick overview of video clip relevance. This paper focuses on static summaries showing key frames from the video. The key frames are extracted by leveraging the...
Accurate prediction of vehicle ego-motion in real time is crucial for an autonomous driving system. In this paper, we formulate the problem of ego-motion classification as video event detection, and we propose an end-to-end deep model to address this problem. In this model, we utilize Convolutional Neural Networks (CNNs) to extract semantic visual feature of each video frame, and employ a Long Short...
It is important to generate both interesting and representative video summary for massive videos. This work proposes a new method to generate dynamic video summary using multiple features and image quality without human's involvement in the whole procedure. Specifically, we first split a video into several video clips. Second, a set of features including visual attention, exposure of light, saturation,...
Whenever a video is being digitized, compressed and transmitted across the network, some degradation might be introduced that could affect the quality of the video received. Thus, it is essential to provide feedback system to the provider which could allow them the freedom to feedback to the system, if the quality of video being transmitted could be improved, in terms of video quality. This is an...
With the rapid development of Internet and multimedia, there is new challenge on streaming media processing and transmission. On one hand, the demand for the quantity and quality of multimedia resources are getting higher, which requires the matching capabilities of data storage and transmission. On the other hand, restricted to the computer's processing power, heterogeneous network environment, network...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.