The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
MQ arithmetic coder has been adopted to achieve entropy coding in the latest image compression standard JPEG2000. which is a bit-level operation with intensive branch and feedback thus becomes a serious bottleneck of high speed JPEG2000. In this paper, an efficient fast algorithm for MQ coder was proposed, in which the renormalization process with BYTEOUT was performed in batch fashion instead of...
One basic observation for pedestrian detection in video sequences is that both appearance and motion information are important to model the moving people. Based on this observation, we propose a new kind of features, 3D Haar-like (3DHaar) features. Motivated by the success of Haar-like features in image based face detection and differential-frame based pedestrian detection, we naturally extend this...
This paper presents a system architecture and information retrieval strategy for multimedia content which exploits descriptive metadata as well as domain ontology. We propose a query processing model including a semantic ranking scheme which can retrieve multimedia objects semantically relevant to the user query and provide users with a search result categorized by concepts and ordered by their semantic...
With the proliferation of handheld devices, the demand of multimedia information retrieval on mobile devices has attracted more attention. A relevance feedback information retrieval process usually includes several rounds of query refinement. Each round incurs exchange of tens of images between the mobile device and the server. With limited wireless bandwidth, this process can incur substantial delay...
In this paper, we enhance group (or inter-destination) synchronization control, which adjusts the output timing among multiple destinations, for a remote drawing system using haptic media. Under the control, an instructor and a learner can draw figures while watching the identical virtual space. By subjective assessment, we demonstrate that Mean Opinion Score (MOS) of the instructor can be improved...
Annotation is the core item to enable efficient access to media. Thus, automatic annotation is a highly challenging problem. We present a novel solution to personal digital photograph annotation problem which annotates photo based on users' personal, and public information. We use timestamps of photo together with exact location name as pivots to search for more contextual metadata from users' emails,...
In recent years, some computer vision algorithms such as SIFT (scale invariant feature transform) have been employed in image similarity match to perform image-based search applications. However, with the increasing scale of image databases, centralized image retrieval system no longer provide adequate prompt search. In this paper, we design a scalable distributed architecture, which is analog to...
Recent researches have shown that one can use distributed hash table (DHT) to build scalable and robust distributed systems. In this paper, we propose an efficient hierarchical DHT-based method for addressing two important problems that are encountered while using DHTs in distributed multimedia information queries: multi-attribute query, and range query. The structure of our method consists of two...
The massive amount of multimedia information especially video available on the Web requires a more precise and interactive retrieval. Current operational video retrieval systems do not make use of the implicit visual features but rely only on textual metadata supplied by the user during uploading. This greatly affects the retrieval performance as the metadata may not be comprehensive or consistent...
Privacy protection issue introduces numerous challenges in the multimedia processing domain. In this paper, we propose an anonymization framework for audio clinical data. The HMM based keyword recognition technique is used to locate the predefined sensitive keywords, which are identified by the users or patients in advance. These keywords will then be substituted by the synthesized nominal words of...
We suggest a method for automatic identification of respiratory sounds, for example, identifying wheeze from normal breath sounds. Here we apply higher order moments over time and frequency planes. The method is based on the use of efficient fast Gabor spectrogram followed by our recursively measured instantaneous kurtosis and the sample entropy. The input signal is analyzed first by using a fast...
With the improvement of network bandwidth, multimedia services based on streaming live media have gained much attention recently, among which IPTV has become a hot topic. After emergence of Peer-to-Peer (P2P) technology, P2P based IPTV systems are deployed widely. However, there exists a huge challenge, which is how to manage content. Digital rights management (DRM) is such a system that includes...
Current studies on digital rights management (DRM) have focused on security and encryption as a means of solving the issue of illegal copying by purchasers. In this paper, we propose a scheme that can adapt one DRM's DRM content to another DRM's DRM content in PAV (portable audio & video) device environments. The proposed DRM content adaptation is for making one DRM system use another DRM's DRM...
Voice over IP (VoIP) has experienced tremendous growth in recent years due to its low cost and flexible service enhancement. However, it is vulnerable to security attack. The most popular solution to providing secure VoIP service is based on the advanced encryption standard (AES). The practice for AES-based solution is to adopt a common secret key negotiated during a VoIP call setup phase. This single...
Chromatic aberration is the phenomenon where light of different wavelengths fail to converge at the same position on the focal plane. There are two kinds of chromatic aberration: longitudinal aberration causes different wavelengths to focus at different distances from the lens while lateral aberration is attributed to different wavelengths focusing at different positions on the sensor. In this paper,...
Automatic image annotation is crucial for keyword-based image retrieval because it can be used to improve the textual description of images efficiently. For this purpose, many methods have been developed. Due to the restrictions of computational complexity and small training set, the image annotation methods are usually based on the probability of individual word, instead of the joint probability...
We present an interactive visualization, called table of video contents (TOVC), for browsing structured TV programs such as news, magazines or sports. In these telecasts, getting a good segmentation can be very time-consuming, especially in an annotating context. This visualization, connected with a classical media player, offers a very handy video browser. This system allows a global overview by...
In this paper, we introduce a new module, Codebook+, into a classical framework which combines bag-of-words image representation with probabilistic latent semantic analysis (pLSA) for unsupervised object categorization. This new module makes the framework less sensitive to the image sampling methods as well as improves its performance. In this module, we create a new codebook based on the discriminability...
During recent years, the quick development of computer techniques has witnessed the ever-increasing surveillance video data, which essentially pose great challenge on the data storage, management, analysis and even retrieval. Considering that most of the high volume of data is with no interest, we mainly investigate the problem of effectively and efficiently discovering segments-of-interest (SoI)...
Image annotation refinement is crucial to improve the performance of automatic image annotation, in which the estimation of word correlation is a key issue. Typically, the word co-occurrence information may be utilized to estimate the word correlation. However, this approach is not accurate enough because it equally treats any word pair co-occurring in the training data and cannot extract synonymy...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.