The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Histopathological analysis of tissues has been gaining a lot of interests recently, from developing computer algorithms to assist pathologists from cell detection and counting, to tissue classification and cancer grading. With the advent of whole slide imaging, the field of digital pathology has gained enormous popularity, and is currently regarded as one of the most promising avenues of diagnostic...
Traffic sign recognition is an important step for integrating smart vehicles into existing road transportation systems. In this paper, an NVIDIA Jetson TX1-based traffic sign recognition system is introduced for driver assistance applications. The system incorporates two major operations, traffic sign detection and recognition. Image color and shape based detection is used to locate potential signs...
From the empirical studies, it is quite difficult for the license plate recognition to perform 100% accuracy in a real world environment. Nevertheless, it is common that only a few characters are misread from a license plate recognition system. In this paper, license plate matching is used for vehicle re-identification. We evaluate several approximate string matching techniques to determine an applicable...
Monitoring phenology of agricultural plants is a critical understanding in precision agriculture. Vital improvements can be achieved with precise detection of phenological change of plants which would henceforth improve the timing for the harvest, pest control, yield prediction, farm monitoring, disaster warning etc. Many countries across the world have been developing initiatives to build national...
Multimedia semantic concept detection is one of the major research topics in multimedia data analysis in recent years. Disaster information management needs the assistance of multimedia data analysis to better utilize those disasterrelated information, which has been widely shared by people through the Internet. In this paper, a Feature Affinity based Multiple Correspondence Analysis and Decision...
In this paper, we propose a new person re-identification algorithm based on bi-directional superpixel earth mover's distance (BD-SP-EMD). To address the viewpoint change issue, the human body segmentation is first extracted based on background modeling and saliency maps. A bi-directional scheme is then applied to obtain the forward and backward SP-EMD distances. Based on these two distances, pedestrians...
In this paper, we propose a pedestrian attribute recognition approach and a CNN-based person re-identification framework enhanced by pedestrian attributes. The knowledge of person attributes can help video surveillance tasks like person re-identification as well as person search, semantic video indexing and retrieval to overcome viewpoint changes with their robustness to the inherent visual appearance...
Color is one of the attributes that play a role in identifying specific objects, color processing including the extraction of information about the spectral properties of the object's surface and look for the best similarity of a set of descriptions which have been known to do an introduction. Therefore, the classification is needed right fuji apples to obtain good quality fruit. Fuzzy model is one...
We propose a real-time algorithm for the generic classification of humans and objects in 3D scenes. The algorithm does not depend on color information and works with depth data alone, making it very flexible for a wide area of applications. Further, we will show that it is very resistant to occlusion and will give correct classification results even in cases, where only a fraction of a full human...
This paper proposes an insulator defect detection algorithm based on computer vision for helicopter aerial insulator imaging in complex backgrounds. The algorithm runs fast with high detection accuracy, which meets the requirements for detecting missing insulators. However, because the background of the insulator image acquired by aerial photography is complicated and there is more than one insulator...
The segmentation of some scenes can be better for its next step in the processing and analysis. In this paper, the Gaussian mixture model clustering is used to detect and segment the scenes in the sports competition. Firstly, the main color of the scene is extracted by the method of color space histogram, and it is used as a sample for local training. Then we use the expectation maximization algorithm...
Compositing is one of the most common operations in photo editing. To generate realistic composites, the appearances of foreground and background need to be adjusted to make them compatible. Previous approaches to harmonize composites have focused on learning statistical relationships between hand-crafted appearance features of the foreground and background, which is unreliable especially when the...
Improvements in color constancy have arisen from the use of convolutional neural networks (CNNs). However, the patch-based CNNs that exist for this problem are faced with the issue of estimation ambiguity, where a patch may contain insufficient information to establish a unique or even a limited possible range of illumination colors. Image patches with estimation ambiguity not only appear with great...
We present a method for synthesizing a frontal, neutral-expression image of a persons face, given an input face photograph. This is achieved by learning to generate facial landmarks and textures from features extracted from a facial-recognition network. Unlike previous generative approaches, our encoding feature vector is largely invariant to lighting, pose, and facial expression. Exploiting this...
Image matting is a fundamental computer vision problem and has many applications. Previous algorithms have poor performance when an image has similar foreground and background colors or complicated textures. The main reasons are prior methods 1) only use low-level features and 2) lack high-level context. In this paper, we propose a novel deep learning based algorithm that can tackle both these problems...
Transferring artistic styles onto everyday photographs has become an extremely popular task in both academia and industry. Recently, offline training has replaced online iterative optimization, enabling nearly real-time stylization. When those stylization networks are applied directly to high-resolution images, however, the style of localized regions often appears less similar to the desired artistic...
We propose a general framework called Network Dissection for quantifying the interpretability of latent representations of CNNs by evaluating the alignment between individual hidden units and a set of semantic concepts. Given any CNN model, the proposed method draws on a data set of concepts to score the semantics of hidden units at each intermediate convolutional layer. The units with semantics are...
Colorization is an ambiguous problem, with multiple viable colorizations for a single grey-level image. However, previous methods only produce the single most probable colorization. Our goal is to model the diversity intrinsic to the problem of colorization and produce multiple colorizations that display long-scale spatial co-ordination. We learn a low dimensional embedding of color fields using a...
Re-identification of people in surveillance footage must cope with drastic variations in color, background, viewing angle and a persons pose. Supervised techniques are often the most effective, but require extensive annotation which is infeasible for large camera networks. Unlike previous supervised learning approaches that require hundreds of annotated subjects, we learn a metric using a novel one-shot...
We propose split-brain autoencoders, a straightforward modification of the traditional autoencoder architecture, for unsupervised representation learning. The method adds a split to the network, resulting in two disjoint sub-networks. Each sub-network is trained to perform a difficult task – predicting one subset of the data channels from another. Together, the sub-networks extract features...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.