The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper presents a comparative study of two recent word spotting techniques ([1] and [2]) directly in the run-length compressed domain. The first technique is based on partial decompression and limited usage of OCR, and the second technique is completely decompression-less and OCR-less. Both the word spotting techniques use word bounding box ratio feature initially for matching words in the database...
Text in natural scenes provides many information for peoples and presents an essential tool to interact with their environment. Therefore, recognizing text existing in camera-captured images has become an important issue for many researches in the last decades. Currently, there isn't any available dataset of Arabic script text images in the wild. Since our aim is to help the research community in...
As vast amount of digital image data is stored by the advanced libraries, there is a requirement for an efficient query word searching methodologies which can make them accessible according to user's requirement. For their accurate retrieval, it is essential to understand their contents. Present technologies for optical character recognition (OCR) and image document analysis do not handle such documents...
Handwritten character recognition has been emerging topic studied in the last half century and shape up to the level which is sufficient to develop a technology driven application. Now the rapidly increase in the computation power, CR creates an increasing demand for new emerging applications, which require more advanced methodologies. The problem of character segmentation and its recognition in India...
Handwritten Character Recognition is the capability of a computer to receive and interpret handwritten input from paper documents, photographs, touch screens and other devices. In this paper we have introduced a new method for Hindi handwritten character segmentation. It consists of a novel approach segmentation line, word and character using depth first search on the distance metric of connected...
The aim of this paper is to develop a system that involves character recognition of Brahmi, Grantha and Vattezuthu characters from palm manuscripts of historical Tamil ancient documents, analyzed the text and machine translated the present Tamil digital text format. Though many researchers have implemented various algorithms and techniques for character recognition in different languages, ancient...
This paper is intended to support the preservation of national cultural asset, particularly for ancient symbols. By using image processing principle, an automatic system that can be designed and implemented to translate ancient manuscript documents. The system is composed of several phases, from scanning, preprocessing, segmentation, feature extraction and classification. Sample images of the document...
Vehicle Plate Recognition (VPR) algorithm in images and videos usually consists of the following three steps: 1) Region extraction of the plate (plate localization), 2) characters segmentation of the plate 3) Recognition of each character. This paper presents new methods for real-time plate recognition in each step. We used a Detector for the Blue Area (DBA) to locate the plate, Averaging of White...
It is thought that a large quantity of data improve quality of recognition. A large database, however, is not easy to obtain. The hardest task is labeling (also known as ground truthing), which usually requires human intervention. Since labeling by human is laborious and costly, labeling without human (automatic labeling) or minimization of human intervention (semi-automatic labeling) are ideal scenarios...
Automatic Number Plate Recognition or ANPR is a mass surveillance method that uses optical character recognition on images to read the number plates on vehicles. This system is designed with a neural network which is trained to recognize all the characters that can be found in an Indian Standard High Security Number Plate and is implemented using MATLAB.
This paper presents a new method for reconstructing degraded, or broken, digits. The proposed method uses inertia based techniques to exam the digit's stroke where degradation may be found and then extrapolate the stroke in order to reconstruct the digit. The main goal is to create strokes that remain as natural as possible, maintaining digit integrity. Experiments were performed using an artificially...
In this paper, an approach of hand-printed English character recognition is developed based on Fuzzy theory. The approach has two main functions, which are feature extraction and pattern recognition. Applying the feature extraction method carries out the feature properties that is used in identification process. For the pattern recognition, the Fuzzy theory is adopted to deal with the fuzzy patterns...
This paper introduces a pair of online and offline Chinese handwriting databases, containing samples of isolated characters and handwritten texts. The samples were produced by 1,020 writers using Anoto pen on papers for obtaining both online trajectory data and offline images. Both the online samples and offline samples are divided into six datasets, three for isolated characters (DB1.0-C1.2) and...
This paper presents a conditional random field (CRF) model for aligning online handwritten Chinese/Japanese text lines (character strings) with the corresponding transcripts. The CRF model is defined on a lattice which contains all possible segmentation hypotheses. The feature functions characterize the shape and context dependences of characters, including the scores of character recognition and...
Research towards Indian handwritten document analysis achieved increasing attention in recent years. In pattern recognition and especially in handwritten document recognition, standard databases play vital roles for evaluating performances of algorithms and comparing results obtained by different groups of researchers. For Indian languages, there is a lack of standard database of handwritten texts...
Calligraphic data entry is accelerated by generating, with a feature-based character classifier, an ordered list of reference candidate labels for each character image. The improvement of labeling throughput depends on the top-N accuracy of the classifier, which in turn is a function of the available already-labeled patterns. Experiments on a database of 13,351 ancient calligraphic characters indicate...
License plate detection and recognition is one of the most important aspects of applying computer techniques towards intelligent transportation systems. Detecting the accurate location of a license plate from a vehicle image is the most crucial step of a license plate detection system. This paper describes a proposing of a new region-based license plate detection method based on a symbol analysis...
Handwritten character recognition is the key technique in correcting assignment system as well as development of aided instruction software. Considering the disparity in distribution of the pixel, we propose a distribution-based algorithm for handwritten character recognition. Based on the theory of Image Segmentation, the centroid of a character can be found. Around this centroid, the image is divided...
This research is conducted to accommodate the needs of visually impaired people through an intelligent system, which reads textual information on papers and produces corresponding voice. Indonesian Automated Document Reader (I-ADR) is operated via a voice-based user interface to scan a document page. Textual information from the scanned page is then extracted using Optical Character Recognition (OCR)...
India is a multilingual and multi-script country where a line of a bilingual document page may contain text words both in regional language and in English. Recognition of documents containing multi-scripts is really a challenging task, which needs more effort of the OCR designers for improving the accuracy rate. This paper presents a Bilingual OCR system for printed Malayalam and English text. Here...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.