The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Many HCI researchers are keen in producing accessible computer applications for the users. However, not many of these findings focus on disable user especially the blind. Most of the technological advancement meant for the blind user is in the form of assistive technologies mainly screen reader. Blind user has highlighted the inefficiency of screen reader as well as other unobtainable and expensive...
The development of delay sensitive applications needs massive data storage and computing resources, especially in a typical cloud environment. The cloud computing paradigm provides a broad range of services viz. software, platform, and infrastructure for various applications (both real-time and non real-time) over the Internet. But, in the case of Infrastructure-as-a-Service (IaaS) cloud platform,...
For TCP-friendly multimedia applications, congestion control may bring about disadvantages such as the variability of sending rate and Round Trip Time although it helps to improve robustness of Internet. Aiming at alleviating this variability and further guaranteeing the user-perceived service quality in the aspect of network layer, this paper proposes a smooth adaptive adjustment mechanism for Random...
Obesity phenomenon has become a significant issue over the world. Obesity has various negative consequences that might impact not only the health but also the social and the economic issues. Current studies reveal the lack of patients' commitment to the doctors' instructions. In this paper, we propose a new cloud-based model with ultimate aim to monitor obese patients' health condition and behavior...
A novel photomosaic art with three-layer information is proposed in this paper. In addition to the over-arching image can be seen from a distance and a matrix of individual images when looked closely, a QR code can be accessed by taking a picture of the whole photomosaic using public QR code scanners. In the proposed scheme, a tile image classification procedure is carefully designed to dispatch appropriate...
Rate-constrained motion estimation (RCME) is considered to be the most time-consuming process of H.265/HEVC encoding. Massively parallel architectures, such as graphics processing units (GPUs), used in combination with a multi-core central processing unit (CPU), provide a promising computing platform to achieve fast encoding. However, the inherent dependencies in the process for deriving motion vector...
Users of video-sharing sites often search for derivative works of music, such as live versions, covers, and remixes. Audio and video content are both important for retrieval: “karaoke” specifies audio content (instrumental version) and video content (animated lyrics). Although YouTube's text search is fairly reliable, many search results do not match the exact query. We introduce an algorithm to classify...
Deep learning has led to many breakthroughs in machine perception and data mining. Although there are many substantial advances of deep learning in the applications of image recognition and natural language processing, very few work has been done in video analysis and semantic event detection. Very deep inception and residual networks have yielded promising results in the 2014 and 2015 ILSVRC challenges,...
Action recognition in still images is a challenging task in computer vision. Recent successes in deep feature-learning advance this research, employing robust and rich-semantic feature representation. However, the issue that recognition fails when two action images share similar contexts is long-standing. In this paper, we employ metric learning method to address within-class and between-class confusions...
Media playout buffer is widely employed by today's streaming media player to cope with short-term network variation and achieve continuous playout. However, the playout buffer inevitably introduces additional latency, affecting mobile live streaming experience. In this paper, we propose a novel adaptive playout buffer management approach to dynamically optimize the buffer latency while to a great...
In this paper we deal with two image-based object search tasks in the fashion domain, clothing attribute prediction and cross-domain shoe retrieval. Clothing attribute prediction is about describing the appearances of clothes via semantic attributes and cross-domain shoe retrieval aims at retrieving the same shoe items from online stores given a daily life shoe photo. We jointly solve these two problems...
Existing interactive systems suffer from low user engagement due to their passiveness and steep learning curve. To address these issues, this paper presents an interactive framework, Notify-and-Interact, which leverages the Bluetooth low energy (BLE) beacon infrastructure to notify and a smart-phone to interact, such that it transforms a passive interactive system into an active one. The proposed...
Quality of Experience (QoE) assessment of multimedia services is a challenging task and an understanding of how the user perceives quality at the physiological level would facilitate this. Physiological signals, such as the electroencephalogram (EEG), have shown promise in revealing the subject's emotion or attention in quality assessment and the correlation of this with media service quality. This...
The constrained local model (CLM) proposes a paradigm that the locations of a set of local landmark detectors are constrained to lie in a subspace, spanned by a shape point distribution model (PDM). Fitting the model to an object involves two steps. A response map, which represents the likelihood of locations for a landmark, is first computed for each landmark using local-texture detectors. Then,...
Deep learning is a popular method for monaural source separation, and especially for extracting a singing voice from a single-channel song. However, deep learning-based source separation ignores the geometrical structure of the input data. This work develops a novel approach to source separation that is based on non-negative matrix factorization (NMF) and deep recurrent neural networks (DRNN) with...
Hashing has been recognized as one of the most promising ways in indexing and retrieving high-dimensional data due to the excellent merits in efficiency and effectiveness. Nevertheless, most existing approaches inevitably suffer from the problem of “semantic gap”, especially when facing the rapid evolution of newly-emerging “unseen” categories on the Web. In this work, we propose an innovative approach,...
The popularity of multi-view panoramic videos has been considerably increased for producing Virtual Reality (VR) content, due to its immersive visual experience. We argue in this paper that PSNR is less effective in assessing visual quality of compressed panoramic videos than Sphere-based PSNR (S-PNSR), in which sphere-to-plain mapping of panoramic videos is considered. Thus, the conventional rate...
in idea is to express a ballot to allow voting for up to out the candidates and unlimited participants. The purpose of vote is to select more than one winner among candidates. Our result is complementary to the result by Sun peiyong ¡äs scheme, in the sense, their scheme is not amenable for large-scale electronic voting due to flaw of ballot structure. In our scheme the vote is split and hidden, and...
Delivering persuasive presentation is a challenging task for the university students. One of the difficult parts is grasping the attention of audiences during the critical situations, such as in the early morning or after the lunch break. At the situations, students are not paying more attention to any speeches or presentations occurred in the classroom because of some lapses of attention span such...
Multimedia clips, such as lecture recordings and screencasts, are increasingly used in both formal and informal learning contexts, such as flipped classroom, blended learning, MOOCs and mobile learning. In order to create effective educational multimedia applications, it is increasingly important to understand the factors contributing to the learning performance and learner experience. This paper...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.