The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Face detection and landmark localization have been extensively investigated and are the prerequisite for many face applications, such as face recognition and 3D face reconstruction. Most existing methods achieve success on only one of the two problems. In this paper, we propose a coupled encoderdecoder network to jointly detect faces and localize facial key points. The encoder and decoder generate...
This study addresses an automatic approach to analyze the structure of large scale web videos based on visual and acoustic information. In our approach, video streams are macro-segmented via mining the duplicate sequences. Acoustic and visual information are both adopted for mining so as to avoid missing true-positive. Web videos contain severe visual and acoustic distortions, differing to TV data,...
This paper introduces our system competing in MSR-Bing Image Retrieval Challenge at ICME 2014. The task of the challenge is to rank images by their relevance to a given topic, by leveraging cues hidden in search engine's click log. With the successful trial in last year's challenge, search-based method is shown to be effective in this task. We reserve the basic idea of search-based method in our new...
Image search reranking has become a widely-used approach to significantly boost retrieval performance in the state-of-art content-based image retrieval system. Most of the methods merely rely on matching visual distances between query and initial results or among initial results to detect confident samples relevant to query. However, they may fail to rerank due to the existence of a huge gap between...
The state of the art in query expansion is mainly based on the spatial information. These methods achieve high performance, however, suffer from huge computation and memory.
The exponential growth of web videos brings content based copy detection into a crucial issue. Besides the image information, audio also plays an important role in copy detection. In this paper, the audio-based copy detection framework is introduced. Three contributions are presented: (1) the band energy difference based feature is improved by adding multi-scale information, which extends the candidate...
A novel word-based algorithm is presented to detect duplicate Picture in Picture (PiP) video sequences in this study. The conventional edge-based methods used to extract the PiP regions are not robust in the noise and blurring images. Bag of Words (BOW) model emphasizes words ambiguity and ignores spatial information. Without detecting the PiP regions and unlike the traditional word based approach,...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.