The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Discriminative correlation filters (DCF) have aroused great interests in visual object tracking in recent years due to the accuracy and computation efficiency. However, occlusion is still the main factor that affects performance. In this paper, a spatial-temporal consistent correlation filter utilizes the rich features extracted from a pre-trained convolutional neural network (CNN) is proposed to...
Visual tracking is a significant but challenging field in computer vision. Although considerable progress has been made in recent years, robust tracking in complicated scenes remains an open problem. Trackers get confused easily when similar objects appear or heavy clutter occurs due to indistinguishable features. In this work, a more effective feature extraction method based on convolutional neural...
Robust visual tracking is a significant but challenging task in computer vision. Deep convolutional neural networks have been proverbially applied to visual tracking in recent years by learning a genetic representation from numerous training images. However, the deep networks training is time-consuming. In this work, an efficient and robust tracking algorithm using a small single Convolutional Neural...
Bone texture characterization is important for differentiating osteoporotic and healthy subjects. Automated classification is however very challenging due to the high degree of visual similarity between the two types of images. In this paper, we propose to describe the bone textures by extracting dense sets of local descriptors and encoding them with the improved Fisher vector (IFV). Compared to the...
The text-to-scene conversion is such a process that converts the input text into 3D scenes automatically based on the natural language processing. The scene layout is the basic research content and the key section of the text-to-scene conversion. It realizes the automatic layout through identifying the information about scene layout from the input text and relying on a large database of 3D models...
An effective video coding should not only remove statistical redundancy but also take into account the characteristic of human visual system. Just noticeable difference (JND) represents the maximum distortion that cannot be perceived in a suitable viewing condition. In this paper, we introduce a foveated JND model combined with visual attention model. Though moving object always attracts viewer's...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.