The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Similarity search in large multimedia databases is an important issue in nowadays multimedia environment. Multimedia objects such as music videos usually consist of multiple representations such as audio or video features. Since each representation may be of significantly different quality for a given multimedia object, similarity search methods could greatly benefit from taking these multiple representations...
In this paper, we propose a novel mechanism of hierarchically indexing soccer video using visual, audio and textual cues. Firstly, video is indexed with information from pure video and audio respectively. Then, video is segmented into physical shots based on visual features, and then identified as syntactic shots according to broadcasting rules. Audio is analyzed to get physical contents and then...
An active warden can significantly reduce the steganographic information capacity by slightly modifying some contents of suspected media. When using QIM-based embedding methods, the steganographic capacity in the presence of active attack can be obtained with a secure transform, an identical suitable quantization step and a corresponding length of embedded bits. It is shown that the maximum attainable...
This paper presents a SVM-based prediction approach for constructing personal recommendation system for TV programs. We have applied support vector machine (SVM) to personal prediction of online Internet electronic program guide (IEPG). Our basic idea is to combine SVM and feedback processing into our system, using user-watched histories as retraining data, to realize personal predictions. We evaluate...
This paper presents a new approach for finding camera shot transitions in the compressed stream of MPEG-1 and MPEG-2 videos. False transitions patterns on the compressed data are considered to be related to the choices made by the encoder, which has the freedom of making several decisions without violating the MPEG-1 and MPEG-2 standards. These choices contaminate the similarity metrics with patterns...
In this paper, a modeling methodology called BN_CPN based on colored Petri nets for business negotiation activities is proposed. With BN_CPN, a Web-based collaborative e-business negotiation system is modeled. Based on the analysis of the characteristics of business negotiation activities, through the expansion of the classical colored Petri net with the connotation of collaborative e-business negotiation,...
Summary form only given. Multimedia research over the past decade has resulted in the development of multimedia data managers that have achieved a reasonable level of sophistication. It is possible now to organize images and video data efficiently, model them reasonably well and perform searches on them based on selected features (query-by-example). The development of XML has facilitated the storage...
In this publication, we apply the concept of foveal imaging to appropriately direct the attention of the viewer during browsing raster image contents. The proposed approaches are based on properties of the human visual system and are rather effective. To meet requirements of mobile environments, we exploit properties of the new and flexible image coding standard JPEG2000 for creation and transmission...
In this paper, we propose a novel cross-media retrieval method. The most important feature of it is to integrate the multi-modal data seamlessly via a cross-reference graph, and then based on the graph, it is able to use improved personalized PageRank to calculate how close the media object associates with the query on semantic and content level. It is also able to adjust the cross-reference graph...
We discuss in this paper an adapted proxy working scheme for SMIL presentation delivery and show how to exploit the causal relations, the time requirements as well as the media characteristics, to deduce a semantic based request pattern. The latter is used to efficiently schedule the pre-fetch requests, as well as the send-back delivery, such that the imposed synchronizations constraints can be met...
Person identification is very important in the domain of multimedia news as it is often the focus of events in news stories and interest of searchers. However, this detection is impeded by the imprecise audio/visual analysis tools. In this paper, we describe a multimodal and multi-faceted approach to person-x detection in news video. We make use of multimodal features extracted from text, visual and...
Summary form only given. Techniques of computer and Internet are developing very fast, and users want to access their interested multimedia information from anywhere at anytime by using their most convenient digital equipments. Sports video always appeals to large audiences, and it becomes an important problem to automatically extracting useful semantic information from sports video to facilitate...
We use the concept of film pace, expressed through the audio, to analyze the broad level narrative structure of film. The narrative structure is divided into visual narration, action sections, and audio narration, plot development sections. We hypothesize that changes in the narrative structure signal a change in audio content, which is reflected by a change in audio pace. We test this hypothesis...
One of the ultimate challenges of computer vision is in video semantic understanding. Many efforts at detecting events in video have focused on structured sequences such as sports or news broadcasts. However even in seemingly freeform media such as feature films, there is an inherent structure and established production codes. Over the last century, film theorists have developed the principles of...
Summary form only given. The explosive growth of multimedia content on the Web has brought many new business opportunities to search engines. In this talk, the author describes our vision of multimedia search and discuss the monetization opportunities and technical challenges in front of us. He argues the importance of combining text, metadata, natural language, content-based analysis, and visualization...
Scalable encoding scheme enables the player or streaming server to adaptively change the playback rate of multimedia content. However, in scalable streaming of layer encoded content, sequential playback of content does not necessarily coincide with the sequential scan of a file. This property introduces another dimension of complexity in the scheduling of data block retrieval. In this work, we develop...
This paper introduces one passive prism based single-lens multi-ocular stereo image capture system. It only employs one real CCD camera with a pyramid-like multi-face glass prism. Each image captured by this system can be divided into three, four or more sub-images which can be taken as the images simultaneously captured by a group of virtual cameras generated by the prism. Hence this system can be...
This paper proposes new features for presenting and authoring MPEG-4 programs using the NCL language. On one hand, the language allows to define relationships among objects inside an MPEG-4 scene and external media objects, including objects in other MPEG-4 scenes. Relationships with several different semantics will also be allowed other than those defined by the standard. An NCL hypermedia formatter...
The greatest obstacle in developing distance learning system is the lack of real-time interaction. This paper provides a real-time interactive shared system for distance learning that combines the audio, video and seminar. The remote learners can study and exchange opinions with the instructor through this system in real time. After the test between Waseda University and Guilin University, we find...
In this paper, we emphasize salient experiences on peer to peer based live video streaming. First, we describe a two-layer overlay structure and a multi-sender based transmission algorithm to broadcast video programs. Although both numerical analysis and practical data reveal feasibility to support nearly 600 concurrent users in 500kbps streaming delivery, the tremendous workload imposed upon structure...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.