The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper, we deal with the most challenging task of recovering the 3D human pose from just a single monocular image, that may be a synthetic image or a real internet image. The retrieval and reconstruction of the articulated 3D pose, both are prerequisites for the analysis of the people in images/videos. We address both tasks together and propose an efficient framework for search & retrieval...
Films seek to elicit emotions in viewers by infusing the story they tell with an affective character or tone - in a word, a mood. In content-based multimedia analysis, considerable effort has been made to develop methods to estimate film affect computationally. However, results have been hampered by a tendency to classify film scenes either by genre or not at all, while other potentially helpful classification...
To deal with the rigid template matching problem in real-world scenarios, we propose a novel iterative feature-pair updating framework which is also robust to high levels of outliers, such as background changing, complex nonrigid deformation and partial occlusion. Given a pair of template image and target image, we first extract a set of corresponding feature-pairs as candidates. Then, we propose...
This paper extended Stacked Denoising Autoencoder to build a deep neural network which initialized the weight of neural network through the encoder's weight and used Dropout to reduce the error rate in fine-tuning stage. The neural network used the information of students in recent years as input data to train neural network, and predicted the possibility of dropout on the students during the semester...
In nowadays, as the development of digital photographic technology, video files grow rapidly, there is a great demand for automatic video semantic analysis in many scenes, such as video semantic understanding, content-based analysis, video retrieval. Shot boundary detection is a key basic technology and first step for video analysis. However, recent methods are time consuming and performs bad in the...
Kotenseki is a collection of classical and ancient Japanese literature. It is comprised of image books that express Japanese stories by using comic drawings of different characters, such as humans, nature, and animals. To effectively store them for posterity, a search system is important. We propose an efficient CBIR system to assist the users in easily accessing the information and have an enjoyable...
Collaborative filtering is widely used in recommender systems. When training data are extremely sparse, neighbor selection methods work ineffectively. To address this issue, this paper proposes a distributed representation model that represents users as low-dimensional vectors for neighbor selection by considering the chronological order of users' ratings. Experiments show that the proposed method...
Accurate Human Epithelial-2 (HEp-2) cell image classification plays an important role in the diagnosis of many autoimmune diseases. However, the traditional approach requires experienced experts to artificially identify cell patterns, which extremely increases the workload and suffer from the subjective opinion of physician. To address it, we propose a very deep residual network (ResNet) based framework...
In this paper, a new method of hand gesture recognition is proposed. First, the hand region is separated based on the depth information. Then the wavelet feature is calculated by enforcing the wavelet invariant moments of the hand region, and the distance feature is extracted by calculating the distance from fingers to hand centroid. Next, a feature vector which is composed of wavelet invariant moments...
The increasing demand of mobile applications has brought large amount of mobile traffic. To meet users' requirements for high-quality video delivery, it is an urgent task to provide fair-quality video delivery for various users and situations. In this paper, we evaluate QoS and QoE characteristics and validate QoE unfriendliness in heterogeneous DASH contents distributions to provide QoE-fair video...
In this paper, we propose a method for image selection using Web image search for automatic video biography authoring. In the proposed method, images are selected from the image search results considering their visual contents for inclusion in the video biography. Through evaluation, we confirmed the effectiveness of the proposed image selection method compared to a baseline method which simply selects...
The method proposed in this paper focuses on problems of motion detection and counting of very small moving objects in videos. Existing video processing methods need a defined form or a sufficient size of moving objects and do not provide accurate results in the case of very small moving objects. Many false detections can occur or many moving objects can be missed. To deal with these problems, reliability...
The computer mouse is the main interaction device for graphical user interfaces. Many attempts have been made to replace it or render it obsolete, ranging from more ergonomically shaped designs to pen-like devices or touch screens. While the latter have opened up the way for completely new interaction designs mainly on mobile devices, PC and laptop users still prefer the computer mouse as a pointing...
With the rapid growth of online content consumption, knowing end-users and having actionable content insights has become extremely important for any online content provider. Insights from user segment identification could help in developing a content recommendation as well as new content acquisition. For advertisers, identifying segments could assist in designing ad campaigns with greater target accuracy...
With the rapid advances in digital technology, the multimedia documents have been growing ubiquitously. The analysis of this huge repository of multimedia documents requires efficient organization of documents. Multimedia document clustering organizes the multimedia documents with common multimedia topics. The important step of multimedia document clustering is computing the similarity between multimedia...
Query suggestion plays a key role in improving the usability of image search. Textual Query Suggestion, widely used in existing search text-based image retrieval engines, is able to suggest a list of textual query terms based on users' query input. This paper presents a color prediction system dedicated to interactive drawing based image retrieval system on mobile devices. Usually, such systems retrieve...
Venue photos, as a new type of multimedia contents, are exploding on the Internet because users like to take photos and share with their friends in which venue they spent time and what impressed them there. Discovering a venue by a social photo is very useful for supplementing venue retrieval and recommendation. However, little research focused on fine-grained venue discovery by leveraging multimodal...
The task of object tracking in rectangular videos has been addressed in recent years by many researchers, where each method tries to propose a solution for a special challenge. Handling a variety of challenging situation of object tracking in 360-degree videos is still an unsolved problem and needs to be more considered. In the real world, the challenging situations include moving camera, high-resolution...
This paper proposed an adaptive sparse learning (ASL) framework to solve the multi-classification problem for neurodegenerative disease analysis. Specifically, we integrate the idea of feature selection and subspace learning to construct a least square regression model. The principle of Fisher's linear discriminant analysis (LDA) and locality preserving projection (LPP) are incorporated to utilize...
Data retrieval serves a critical role in the development of multimedia applications. However, due to the exponential growth of multimedia data, high-speed and efficient indexing is becoming more and more difficult than ever. In this paper, we propose a novel approach to speed up the retrieval process by adopting a distributed computing paradigm through the Apache Spark framework. Utilizing search...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.