Search results

Items from 1 to 20 out of 72 results

article

Text Detection, Tracking and Recognition in Video: A Comprehensive Survey

Xu-Cheng Yin, Ze-Yu Zuo, Shu Tian, Cheng-Lin Liu

IEEE Transactions on Image Processing > 2016 > 25 > 6 > 2752 - 2773

The intelligent analysis of video data is currently in wide demand because a video is a major source of sensory data in our lives. Text is a prominent and direct source of information in video, while the recent surveys of text detection and recognition in imagery focus mainly on text extraction from scene images. Here, this paper presents a comprehensive survey of text detection, tracking, and recognition...

chapter

Food image recognition using deep convolutional network with pre-training and fine-tuning

Keiji Yanai, Yoshiyuki Kawano

2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) > 1 - 6

2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

In this paper, we examined the effectiveness of deep convolutional neural network (DCNN) for food photo recognition task. Food recognition is a kind of fine-grained visual recognition which is relatively harder problem than conventional image recognition. To tackle this problem, we sought the best combination of DCNN-related techniques such as pre-training with the large-scale ImageNet data, fine-tuning...

chapter

Face recognition by using Gabor and LBP

Priyanka V. Bankar, Anjali C. Pise

2015 International Conference on Communications and Signal Processing (ICCSP) > 45 - 48

2015 International Conference on Communications and Signal Processing (ICCSP)

This paper proposes two effective color local texture features, i.e., color local Gabor wavelets (CLGWs) and color local binary pattern (CLBP), for face recognition (FR).This method encodes the discriminative features by combining both color and texture information as well as its fusion approach. To make full use of both color and texture information, the opponent color texture features are used....

chapter

Traffic sign recognition using HOG-SVM and grid search

Chang Yao, Feng Wu, Hou-jin Chen, Xiao-li Hao, more

2014 12th International Conference on Signal Processing (ICSP) > 962 - 965

2014 12th International Conference on Signal Processing (ICSP 2014)

Considering the lower accuracy of existing traffic sign recognition methods, a new traffic sign recognition method using histogram of oriented gradient - support vector machine (HOG-SVM) and grid search (GS) is proposed. First, the histogram of oriented gradient (HOG) is used to extract the characteristics of traffic sign. Then the grid search technique is applied to optimize the parameters of support...

chapter

Color Drop-Out Binarization Method for Document Images with Color Shift

Minenobu Seki, Eisuke Asano, Tsukasa Yasue, Hiroto Nagayoshi, more

2013 12th International Conference on Document Analysis and Recognition > 123 - 127

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

A novel method using "color drop-out" for document images with "color shift" is proposed. Color shift phenomena sometimes occur in document images captured by a camera device or stand type scanner. It adversely affects the binarization and character recognition processes, because it generates pseudo color pixels on scanned image, which do not exist on the original document. To...

chapter

Effective text localization in natural scene images with MSER, geometry-based grouping and AdaBoost

Xuwang Yin, Xu-Cheng Yin, Hong-Wei Hao, Khalid Iqbal

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 725 - 728

2012 21st International Conference on Pattern Recognition (ICPR)

Text localization in natural scene images is an important prerequisite for many content-based image analysis tasks. In this paper, we proposed a novel and effective approach to accurately localize scene texts. Firstly, Maximally stable extremal regions(MSER) are extracted as letter candidates. Secondly, after elimination of non-letter candidates by using geometric information, candidate regions are...

chapter

Information Hiding Based on the Artificial Fiber Pattern with Improved Robustness against Foregound Objects

Kitahiro Kaneda, Yuta Kito, Keiichi Iwamura

2011 Seventh International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 322 - 329

2011 Seventh International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

Digital watermarks provide the capability to insert additional information onto various media, such as still images, movies, and audio, by utilizing features of the content. Several methods that use features of the content, such as text or images, have already been proposed for printed documents. To overcome the disadvantages of the existing methods, we have proposed a new information hiding scheme...

chapter

A deliciousness information extraction method by controlling of image information

S Nohara, K Kato, K Yamamoto, W Yoshimura, more

2011 17th Korea-Japan Joint Workshop on Frontiers of Computer Vision (FCV) > 1 - 6

2011 17th Korea-Japan Joint Workshop on Frontiers of Computer Vision (FCV 2011)

It is difficult to analyze how human feels deliciousness by seeing served food because many factors affect to judgment of deliciousness. Human recognizes and evaluates deliciousness of the food from many points of view. In this paper, we propose a method to extract the points of view and factors that recognize deliciousness. Images which are reduced resolution to control information of the image were...

chapter

Robust feature extraction and control design for autonomous grasping and mobile manipulation

Kai-Tai Song, Che-Hao Chang, Chia-How Lin

2010 International Conference on System Science and Engineering > 445 - 450

2010 International Conference on System Science and Engineering (ICSSE 2010)

This paper presents a novel design of visual servo control of a mobile manipulator for autonomous grasping of a target object. In this design, scale invariant feature transform (SIFT) algorithm is adopted to search and recognize the object to grasp. Random sample consensus (RANSAC) algorithm is used to remove outliers and find the refined homography matrix between database and current image. Robust...

chapter

A multi-scale learning approach for landmark recognition using mobile devices

Tao Chen, Zhen Li, Kim-Hui Yap, Kui Wu, more

2009 7th International Conference on Information, Communications and Signal Processing (ICICS) > 1 - 4

2009 7th International Conference on Information, Communications & Signal Processing (ICICS)

The growing usage of mobile camera phones has led to proliferation of many mobile applications. Landmark recognition is one of the mobile applications that are gaining more attention in recent years. The main idea of the application is that a user will use a camera phone to capture the image of a landmark or building and then the system will analyze, identify, and inform the user the name of the captured...

chapter

Texture analysis based on Gaussian mixture modeling

T. Sobha, S. Remya

2009 World Congress on Nature&Biologically Inspired Computing (NaBIC) > 1436 - 1440

2009 World Congress on Nature & Biologically Inspired Computing (NaBIC 2009)

Gaussian mixture modeling is a recent approach in texture analysis and is used to model image textures. Texture is modeled using a mixture of Gaussian distributions, which capture the local statistical properties of the texture. The mixture parameters are estimated using Expectation Maximization algorithm. This algorithm finds the maximum likelihood estimate of the parameters of an underlying distribution...

chapter

A Pornographic Videos Detection Method Based on Optical Flow Direction's Statistical Histogram

Zhi-yi Qu, Ying Liu, Yan-min Liu, Lin-na Zhang

2009 International Symposium on Computer Network and Multimedia Technology > 1 - 4

2009 International Symposium on Computer Network and Multimedia Technology (CNMT 2009)

Aimed at that there is often a video paragraph of human body's reciprocating motion in the pornographic videos, a novel method of pornographic videos detection is proposed. On the basis of calculating the optical flow field, we extract the characteristics points of the moving target, and classify optical flow direction of these points and get the statistics of them. Then we establish the optical flow...

chapter

Contour Based Car Recognition Algorithm

Yi Lu, Bo Cai, Dengyi Zhang

2009 International Symposium on Computer Network and Multimedia Technology > 1 - 4

2009 International Symposium on Computer Network and Multimedia Technology (CNMT 2009)

This paper proposes a car recognition algorithm based on multiple features of contour. Firstly, canny operator is used to get the edge of the image. Afterwards, image pyramid is used to shrink the image so that the contour will be single-edged and complete. Then, from a point in the contour, travel the whole contour to gain the Fourier descriptors and direction ratio from the traversal sequence. At...

chapter

Building recognition from aerial images combining segmentation and shadow

Keyan Ren, Hanxu Sun, Qingxuan Jia, Jianbo Shi

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems > 4 > 578 - 582

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2009)

We propose a novel building detection algorithm for processing high-resolution aerial images. Our algorithm exploits the building-shadow geometric relationship according to lighting models, making it suitable to detect buildings in a more general setting, possibly with irregular shapes. We use image segmentation to provide spatial support for both building and shadow detections. A novel confidence...

chapter

When trees are not green: Recent developments in an off-the-shelf system for robust color and multispectral based recognition and robot control

R.K. McConnell

2009 IEEE International Conference on Technologies for Practical Robot Applications > 204 - 209

2009 IEEE International Conference on Technologies for Practical Robot Applications. TePRA 2009

Mobile robots typically operate in environments where objects of interest are likely to appear as mixtures of colors and textures with complex outlines. To use color or multispectral imagery for identification and decision-making, systems that can quickly be trained by example to recognize such objects have distinct advantages. Two examples are shown of the use of WAY-2C, a system for color-based...

chapter

General traffic sign recognition by feature matching

FeiXiang Ren, Jinsheng Huang, Ruyi Jiang, R. Klette

2009 24th International Conference Image and Vision Computing New Zealand > 409 - 414

2009 24th International Conference Image and Vision Computing New Zealand (IVCNZ 2009)

Traffic sign recognition is a technology which allows us to recognize signs in real time, typically in videos, or sometimes just (off-line) in photos. It is used for Driver Assistance Systems (DAS), road surveys, or the management of road assets (to improve road safety). In this paper, we propose a method for general traffic sign recognition (tested for the New Zealand road signs) which combines previously...

chapter

The Application of Geoagent in RS Image

Honghai Kuang, Junhua Chen

2009 International Conference on Artificial Intelligence and Computational Intelligence > 2 > 449 - 451

2009 International Conference on Artificial Intelligence and Computational Intelligence (AICI 2009)

The application of geoagent in RS image have been studied in the paper. Samples of carbonate rocks were scanned into rock images. By analysing these samples of carbonate rocks, a new arithmetic model of geoagent was chosed and a standard curve of carbonate rocks by the arithmetic model can be gotten. Rs images were divided into grids. There are curves by the arithmetic in grids. The standard curve...

chapter

An effective method to detect seal images from traditional Chinese paintings

Hong Bao, De Xu, Songhe Feng

2009 International Conference on Wireless Communications&Signal Processing > 1 - 4

2009 International Conference on Wireless Communications & Signal Processing

At present, more and more traditional Chinese paintings (TCPs) have been digitized and exhibited on the Internet. How to effectively brose and retrieve these images have emerged as a hot topic. Most existing algorithms typically use the global color and texture low-level visual features to describe the content of the traditional Chinese paintings, where the semantic information has been omitted. As...

chapter

Analysis on the relation between enterprise brand image and product image

Haohua Zheng, Mingye Wang

2009 IEEE 10th International Conference on Computer-Aided Industrial Design&Conceptual Design > 1826 - 1828

2009 IEEE 10th International Conference on Computer-Aided Industrial Design & Conceptual Design. E-Business, Creative Design, Manufacturing. (CAID&CD 2009)

Enterprise brand image and product image likes twins in market competition. The enterprise brand image acts as a guide for product image and the product image as a support for the enterprise brand image, that is, they coordinately develop and influence each other. The enterprise can survive in the fierce competition only by giving consideration to both of them, placing equal emphasis on both of them...

chapter

Application of Rough Set in Image's Feature Attributes Reduction

Sun Yingkai, Chen Hai

2009 2nd International Congress on Image and Signal Processing > 1 - 4

2009 2nd International Congress on Image and Signal Processing (CISP)

After PCA pre-processing, rough set theory was introduced in image's feature attributes reduction, and its application in characterized parameters' attribute optimization was explored. The combination of these two methods was effective in reducing the unnecessary attributes. The novel algorithm could also decrease the complexity of CBIR's inner redundancy. The experimental result of attribute reduction...

Data set:
ieee
Keywords:
IMAGE COLOR ANALYSIS
IMAGE RECOGNITION
DATA MINING
Publication language:
English

Publication date

Set your own date range

INFONA - science communication portal

Search results

Text Detection, Tracking and Recognition in Video: A Comprehensive Survey

Food image recognition using deep convolutional network with pre-training and fine-tuning

Face recognition by using Gabor and LBP

Traffic sign recognition using HOG-SVM and grid search

Color Drop-Out Binarization Method for Document Images with Color Shift

Effective text localization in natural scene images with MSER, geometry-based grouping and AdaBoost

Information Hiding Based on the Artificial Fiber Pattern with Improved Robustness against Foregound Objects

A deliciousness information extraction method by controlling of image information

Robust feature extraction and control design for autonomous grasping and mobile manipulation

A multi-scale learning approach for landmark recognition using mobile devices

Texture analysis based on Gaussian mixture modeling

A Pornographic Videos Detection Method Based on Optical Flow Direction's Statistical Histogram

Contour Based Car Recognition Algorithm

Building recognition from aerial images combining segmentation and shadow

When trees are not green: Recent developments in an off-the-shelf system for robust color and multispectral based recognition and robot control

General traffic sign recognition by feature matching

The Application of Geoagent in RS Image

An effective method to detect seal images from traditional Chinese paintings

Analysis on the relation between enterprise brand image and product image

Application of Rough Set in Image's Feature Attributes Reduction

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options