The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In recent years, various approaches have been investigated towards blind image quality assessment (IQA) with high accuracy and low complexity. In this paper we develop a pre-saliency map based blind IQA method, which takes advantage of saliency information in prior of quality prediction for performance enhancement by two steps. 1) We split the image into patches and design a convolution neural network...
Text is the easiest means to record information but need not always be the best means for understanding a concept. In psychological theories, it is argued that when information is presented visually, it provides a better means to understand a concept. While techniques exist for generating text from a given image, the inverse problem that is to automatically fetch coherent images to represent a given...
In this paper, we describe the LFM-1b User Genre Profile dataset. It provides detailed information on musical genre preferences for more than 120,000 listeners and links to the LFM-1b dataset. We created the dataset by exploiting social tags, indexing them using two genre term sets, and aggregating the resulting annotated listening events on the user level. We foresee several applications of the dataset...
It is usual for a consumer to search a product based on its category and go to related kind of shop to buy a product, e.g. food in supermarket, a pencil from a stationary shop and etc. While it is not uncommon nowadays for a shop to sell various categories of goods at the same time, like a newspaper stand do sell toys, an accessory shop has stationary. However, consumer may not easily notice and purchase...
With the popularity of social networks, users can communicate with each other in a more convenient way. However, the increasing amount of data poses new challenges for the analysis of the social activities of the users. In this paper, we propose to visualize the heterogeneous information of user interactions in a social network in a three-dimensional way using the concept of solar systems. The target...
Considering the cultural background of users is known to improve recommender systems for multimedia items. In this work, we focus on music and analyze user demographics and music listening events in a large corpus (120,000 users, 109 events) from Last.fm to investigate whether similarity between countries in terms of cultural and socio-economic factors is reflected in music taste. To this end, we...
Although light field data provides abundant cues for depth estimation, light field depth estimation suffers from occlusion and uncertain edges. In this paper, we propose occlusion robust light field depth estimation using segmentation guided bilateral filtering. First, we calculate refocused images from light field data using digital refocusing. Second, we perform support vector machines (SVM) classification...
In this paper, we propose a new adaptation approach for viewport-adaptive streaming of 360-degree videos over the Internet. The proposed approach is able to systematically decide quality levels of tiles according to user head movements and network conditions by taking into account not only prediction errors but also user head movements in each adaptation interval. Experimental results show that the...
The JPEG committee (formally, ISO SC29 WG1) is currently standardizing a lightweight mezzanine codec for video over IP transport under the name JPEG XS. A particular challenging design constraint of this codec is multi-generation robustness, that is the necessity to minimize the error built-up under multiple re-compression cycles. In this paper, we discuss the sources of such errors, how they are...
The high demand of bandwidth from multimedia applications, specially video applications which consume the great majority of the Internet bandwidth, has caused a challenge for service providers and network operators. On the one hand, the allocation of bandwidth in a fair manner for multimedia users is necessary, so that the total utility of all users is maximized for higher quality of experience. On...
Motor imagery (MI) based on brain computer interfaces (BCIs) have been widely applied for upper limb motor rehabilitation. Due to the fact that a large number of disabled people need to restore or improve walking ability, it is also important to investigate the use of MI-based BCIs for lower limb motor rehabilitation. The brain activity of lower limb MI is more difficult to detect because of low reliability...
Recently, deep learning has enjoyed a great deal of success for computer vision problems due to its capability to model highly complex tasks, such as image classification, object detection, face recognition, among many others. Although these neural networks are nowadays very powerful, there is a huge amount of parameters (i.e. the model) that need to be learned and require considerable storage space...
Omnidirectional, also referred to as 360º, visual content provides an immersive experience since it allows users to view a visual scene from different directions. The overall content typically covers a full sphere, and omnidirectional videos or images are processed to obtain a projection on a 2D plane of a fraction of the sphere (aka viewport), which is shown to the user. Therefore, users can look...
This paper introduces an open-source HEVC video call application called Kvazzup. This academic proposal is the first HEVC-based end-to-end video call system with a user-friendly Graphical User Interface for call management. Kvazzup is built on the Qt framework and it makes use of four open-source tools: Kvazaar for HEVC encoding, OpenHEVC for HEVC decoding, Opus codec for audio coding, and Live555...
In this paper, we present a realtime natural interaction system by using Kinect sensor. It can stability and smoothly control hand like mouse by user's holding hands, and implement common mouse operations such as 'clicking', 'dragging' and 'dropping' so on. Our interaction system is made of several novel technique. It can identify the user's interaction with intent by detecting the engaged/disengaged...
This paper demonstrates the usage of Kvazaar open-source HEVC intra encoder in 4K real-time video encoding. In this setup, a raw 4K video is shot by an action camera, captured by an HDMI capture card, encoded in real-time by Kvazaar ultrafast preset on a 22-core Intel Xeon processor, sent to a laptop, and decoded by OpenHEVC decoder for playback. The encoding process is visualized on the fly by Kvazaar...
The subsequence-matching operation applied to motion capture data searches in long motion sequences to locate their parts that are similar to a query example. An effective and efficient implementation of such operation is valuable to increase reusability and findability of expensively recorded data in the past. This demonstration paper builds on recent advances in the field of motion-data processing...
In this work, we propose to derive the attribute specific similarity score for a pair of images using an existing parent deep model. As an example, given two facial images, we derive a similarity score for attributes like gender and complexion using an existing face recognition model. It is not always feasible to train a new model for each attribute, as training of deep neural network based model...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.