The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The intelligent analysis of video data is currently in wide demand because a video is a major source of sensory data in our lives. Text is a prominent and direct source of information in video, while the recent surveys of text detection and recognition in imagery focus mainly on text extraction from scene images. Here, this paper presents a comprehensive survey of text detection, tracking, and recognition...
In this paper, we examined the effectiveness of deep convolutional neural network (DCNN) for food photo recognition task. Food recognition is a kind of fine-grained visual recognition which is relatively harder problem than conventional image recognition. To tackle this problem, we sought the best combination of DCNN-related techniques such as pre-training with the large-scale ImageNet data, fine-tuning...
This paper proposes two effective color local texture features, i.e., color local Gabor wavelets (CLGWs) and color local binary pattern (CLBP), for face recognition (FR).This method encodes the discriminative features by combining both color and texture information as well as its fusion approach. To make full use of both color and texture information, the opponent color texture features are used....
Considering the lower accuracy of existing traffic sign recognition methods, a new traffic sign recognition method using histogram of oriented gradient - support vector machine (HOG-SVM) and grid search (GS) is proposed. First, the histogram of oriented gradient (HOG) is used to extract the characteristics of traffic sign. Then the grid search technique is applied to optimize the parameters of support...
A novel method using "color drop-out" for document images with "color shift" is proposed. Color shift phenomena sometimes occur in document images captured by a camera device or stand type scanner. It adversely affects the binarization and character recognition processes, because it generates pseudo color pixels on scanned image, which do not exist on the original document. To...
Text localization in natural scene images is an important prerequisite for many content-based image analysis tasks. In this paper, we proposed a novel and effective approach to accurately localize scene texts. Firstly, Maximally stable extremal regions(MSER) are extracted as letter candidates. Secondly, after elimination of non-letter candidates by using geometric information, candidate regions are...
Digital watermarks provide the capability to insert additional information onto various media, such as still images, movies, and audio, by utilizing features of the content. Several methods that use features of the content, such as text or images, have already been proposed for printed documents. To overcome the disadvantages of the existing methods, we have proposed a new information hiding scheme...
It is difficult to analyze how human feels deliciousness by seeing served food because many factors affect to judgment of deliciousness. Human recognizes and evaluates deliciousness of the food from many points of view. In this paper, we propose a method to extract the points of view and factors that recognize deliciousness. Images which are reduced resolution to control information of the image were...
This paper presents a novel design of visual servo control of a mobile manipulator for autonomous grasping of a target object. In this design, scale invariant feature transform (SIFT) algorithm is adopted to search and recognize the object to grasp. Random sample consensus (RANSAC) algorithm is used to remove outliers and find the refined homography matrix between database and current image. Robust...
The growing usage of mobile camera phones has led to proliferation of many mobile applications. Landmark recognition is one of the mobile applications that are gaining more attention in recent years. The main idea of the application is that a user will use a camera phone to capture the image of a landmark or building and then the system will analyze, identify, and inform the user the name of the captured...
Gaussian mixture modeling is a recent approach in texture analysis and is used to model image textures. Texture is modeled using a mixture of Gaussian distributions, which capture the local statistical properties of the texture. The mixture parameters are estimated using Expectation Maximization algorithm. This algorithm finds the maximum likelihood estimate of the parameters of an underlying distribution...
Aimed at that there is often a video paragraph of human body's reciprocating motion in the pornographic videos, a novel method of pornographic videos detection is proposed. On the basis of calculating the optical flow field, we extract the characteristics points of the moving target, and classify optical flow direction of these points and get the statistics of them. Then we establish the optical flow...
This paper proposes a car recognition algorithm based on multiple features of contour. Firstly, canny operator is used to get the edge of the image. Afterwards, image pyramid is used to shrink the image so that the contour will be single-edged and complete. Then, from a point in the contour, travel the whole contour to gain the Fourier descriptors and direction ratio from the traversal sequence. At...
We propose a novel building detection algorithm for processing high-resolution aerial images. Our algorithm exploits the building-shadow geometric relationship according to lighting models, making it suitable to detect buildings in a more general setting, possibly with irregular shapes. We use image segmentation to provide spatial support for both building and shadow detections. A novel confidence...
Mobile robots typically operate in environments where objects of interest are likely to appear as mixtures of colors and textures with complex outlines. To use color or multispectral imagery for identification and decision-making, systems that can quickly be trained by example to recognize such objects have distinct advantages. Two examples are shown of the use of WAY-2C, a system for color-based...
Traffic sign recognition is a technology which allows us to recognize signs in real time, typically in videos, or sometimes just (off-line) in photos. It is used for Driver Assistance Systems (DAS), road surveys, or the management of road assets (to improve road safety). In this paper, we propose a method for general traffic sign recognition (tested for the New Zealand road signs) which combines previously...
The application of geoagent in RS image have been studied in the paper. Samples of carbonate rocks were scanned into rock images. By analysing these samples of carbonate rocks, a new arithmetic model of geoagent was chosed and a standard curve of carbonate rocks by the arithmetic model can be gotten. Rs images were divided into grids. There are curves by the arithmetic in grids. The standard curve...
At present, more and more traditional Chinese paintings (TCPs) have been digitized and exhibited on the Internet. How to effectively brose and retrieve these images have emerged as a hot topic. Most existing algorithms typically use the global color and texture low-level visual features to describe the content of the traditional Chinese paintings, where the semantic information has been omitted. As...
Enterprise brand image and product image likes twins in market competition. The enterprise brand image acts as a guide for product image and the product image as a support for the enterprise brand image, that is, they coordinately develop and influence each other. The enterprise can survive in the fierce competition only by giving consideration to both of them, placing equal emphasis on both of them...
After PCA pre-processing, rough set theory was introduced in image's feature attributes reduction, and its application in characterized parameters' attribute optimization was explored. The combination of these two methods was effective in reducing the unnecessary attributes. The novel algorithm could also decrease the complexity of CBIR's inner redundancy. The experimental result of attribute reduction...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.