The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper proposes a new approach for analysing crowded video scenes. The proposed approach decomposes the scene motion dynamics into a graph of interconnected atomic elements of coherent motions named Motion Units (MUs). Different MUs cover scene's local regions with different size and shape, which can even overlap. MUs relationships are analysed to discover the scene entrances and exits. Dominant...
Numerous style transfer methods which produce artistic styles of portraits have been proposed to date. However, the inverse problem of converting the stylized portraits back into realistic faces is yet to be investigated thoroughly. Reverting an artistic portrait to its original photo- realistic face image has potential to facilitate human perception and identity analysis. In this paper, we propose...
Multi-object video segmentation and multi-object tracking are very similar in the aspect that both determine the locations and maintain the identities of the objects of interest (targets) in each frame of the video. Our approach takes advantage of this fact and uses the strengths of one task to improve the accuracy of the other. In our framework, the multi-object tracking and segmentation modules...
Provides an abstract for each of the keynote presentations and may include a brief professional biography of each presenter. The complete presentations were not made available for publication as part of the conference proceedings.
The use of autonomous drones in industrial inspection is gaining momentum with improvements in hardware and control. Considering the availability of historical data as drones gather information by regular sorties, a new opportunity for change detection is emerging for inspection and maintenance. In this paper, we propose a visual change detection framework using multi- scale super pixel approach....
The bone age of a child indicates the skeletal and biological maturity of an individual. The most commonly applied clinical methods for Bone Age Assessment (BAA) are based on the visual examination of ossification of individual bones in radiographs of the left hand and the wrist by comparing with standard hand atlas. This kind of method is highly subjective and the performance extremely depends on...
Digital and paper based documents co-exist in our daily lives. Seamless integration of information from both sources is crucial for efficient knowledge management. This paper address the algorithm that can handle the detection of document so that it can be captured easily to convert it into a digital form for automatic integration of relevant information in electronic work flows. It uses the deep...
Due to the superiority in handling label ambiguity, multiple instance learning (MIL) has been introduced into adaptive tracking-by-detection methods to alleviate drift and yields promising tracking performance. However, the MIL tracker assumes that all samples in a positive bag contribute equally to the bag probability, which ignores sample importance. To address this issue, in this paper we propose...
Robust visual object tracking against occlusions and deformations is still very challenging task. To tackle these issues, existing Convolutional Neural Networks (CNNs) based trackers either fail to handle them or can just run in low speed. In this paper, we present a realtime tracker which is robust to occlusions and deformations based on a Region-based, Multi-Scale Fully Convolutional Siamese Network...
Scene Text Recognition is an extremely useful but challenging task and has drawn much attention in recent years. The best of previous model is CNN-LSTM model with attention mechanism, and it can recognize the whole text without character-level segmentation and recognition. Compared with LSTM, Recurrent Highway Networks (RHN), as a popular architecture because of its capability of training deep structure,...
Real-time lane detection and tracking is one of the most reliable approaches to prevent road accidents by alarming the driver of the excessive lane changes. This paper addresses the problem of correct lane detection and tracking of the current lane of a vehicle in real-time. We propose a solution that is computationally efficient and performs better than previous approaches. The proposed algorithm...
Accurate early lung cancer detection is essential towards precision oncology and would effectively improve the patients' survival rate. In this work, we explore the lung cancer early detection capacity by learning from deep spatial lung features. A 3D CNN network architecture is constructed with segmented CT lung volumes as training and testing samples. The new model extracts and projects 3D features...
Despite impressive results in object classification, verification and recognition, most deep neural network based recognition systems become brittle when the view point of the camera changes dramatically. Robustness to geometric transformations is highly desirable for applications like wild life monitoring where there is no control on the pose of the objects of interest. The images of different objects...
The PCA dimensionality reduction algorithm for 2D data with the Laplacian noise model, i.e., L1-2DPCA, not only preserves the structural relation among 2D data, but also is robust for data outliers. The algorithm relies on the EM algorithm with great computational cost. In order to learn intrinsic information more consistently, this paper takes a view of manifold optimization for the model based on...
There is a continuous rise of the number of trains' passengers to transport, therefore a densification of traffic is studied. For safety reasons, it becomes necessary to know the wheel-rail contact condition, in traffic, as it affects driving variables as adherence. The latest determines passenger's safety and the proper functioning of train equipment but it is often deteriorated due to recurrent...
This paper describes a latent space understandable network: Self- excited Generative Adversarial Network (Self- ExGAN), a novel self- excited structure based on adversarial learning. Compared with the conventional generative adversarial networks, SelfExGAN consists of three components, which are en- coder (E), generator (G), and discriminator (D). Different from other papers which directly apply reconstruction...
In image retrieval, an effective dissimilarity (or similarity) measure is required to retrieve the perceptually similar images. Minkowski-type distance is widely used for image retrieval, however it has its limitation. It focuses on distance between image features and ignores the data distribution of the image features, which can play an important role in measuring perceptual similarity of images...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.