The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper, we propose a novel approach for writer identification using codebook generation based on text skeletonization.Unlike other schemes, the skeleton in this approach is segmented at its junction pixels into elementary graphic units called graphemes. The codebook is generated by clustering the graphemes according to their distributions into a predefined grid. This method has been evaluated...
Optical Character Recognition alludes to the methodology of taking images or photos of letters or typewritten content and changing over them into information that a machine can easily interpret, e.g. organizations and libraries taking physical duplicates of books, magazines, or other old printed material and utilizing OCR to put them into computers. Segmentation is the indispensable and most difficult...
This paper describes the use of a novel A path-planning algorithm for performing line segmentation of handwritten documents. The novelty of the proposed approach lies in the use of a smart combination of simple soft cost functions that allows an artificial agent to compute paths separating the upper and lower text fields. The use of soft cost functions enables the agent to compute near-optimal separating...
Bangla handwritten character recognition is one of the complex works because of the wide variation of the Bangla character. In this paper we proposed a new approach for extracting the features of Bangla handwritten characters and then recognition of those characters using artificial neural network has done. For the feature extraction process we have used a row and column basis segmentation process...
Offline writer identification is widely applied in various research areas. Forensic document analysis is a typical motivation in which researchers strive for the best possible performance using limited amount of information. This paper is concerned with offline, text-dependent writer identification. Three techniques are analyzed and evaluated on different levels of analysis using different scripts...
Strokes are the most natural way of describing the character formation; although most of the researcher uses transform based features for recognition of offline text. The stroke base features are very popular in online handwritten text recognition because it is easy to identify stroke and its sequence by tracing pen tip, whereas it is difficult to obtain the same information in the offline text. The...
This paper presents an innovative technique to recognize Handwritten Articles. Proposed system is called "Panhinda". The target user group for this application would be the people who are involved with a lot of paper work on a daily basis. The proposed Character Recognition system was implemented with the capability of extracting the content of an image where the mentioned content is a hand...
Writer recognition based on peculiarity of hand-writing is an important aspect of any forensic analysis. We present an approach for selecting best discriminative primitives for writer recognition. After selecting the primitives we also propose a hybrid system by combining both writer recognition and handwriting recognition for improved accuracy. We have also validated the performance of selected primitives...
Although there are some reports on offline Tamil isolated handwritten character recognition, to our knowledge there is only two reports on Tamil off-line handwritten word recognition. Also no city name dataset is available for Tamil script. In this paper we present a Tamil offline city name dataset, we developed, and propose a scheme for recognition. Because of the different writing style of various...
Most of the algorithms proposed for text line detection are designed to process binary images as input. For severely degraded documents, binarization often introduces significant noise and other artifacts. In this work we present a novel method designed to detect text lines directly in gray scale images. The method consists of two stages. Potential characters are detected in the first stage. This...
Recognition of Bangla compound characters has rarely got attention from researchers. This paper deals with segmentation and recognition of online handwritten Bangla cursive text containing basic and compound characters and all types of modifiers. Here, at first, we segment cursive words into primitives. Next primitives are recognized. A primitive may represent a character/compound character or a part...
This paper presents a robust lexicon reduction technique using segment descriptors for Arabic handwritten text. The method segments an Arabic word into graphemes and adaptively generates a descriptor of the presence/absence of dots in those segments. The segmentation algorithm is based on the characteristic of Arabic script, which indicates predictable segmentations of Arabic characters. This in turn...
Segmentation of handwritten Bangla script is one of the most critical areas of the Optical Character Recognition System. Paying attention on the various writing style of different individuals we propose an efficient scheme to segment unconstrained handwritten Bangla script into lines, words and characters. At First for Line Segmentation, we divide the whole script into column segment. These segments...
An automated system capable of recognizing responses for questionnaires and entering them into the database will be very useful in many subjects. Entering data manually is time consuming. Thus, the purpose of the research is to automate the manual data entry process. Through this research, a new clustering method to cluster printed and handwritten words, and character recognition method to identify...
As is well known, good segmentation is one reason for high accuracy of character recognition; this paper proposes and investigates a new technique for segmentation of handwritten Arabic scripts. A new Arabic heuristic segmenter (AHS) has been implemented. The AHS employs three new features to locate a Prospected Segmentation Point (PSP) based on shape of the word image, first, remove the punctuation...
Arabic Pattern recognition can be regarded as a problem of classification, where different patterns are presented and be needed to classify into specified classes. One way to improve the recognition rates of pattern recognition tasks is to improve the accuracy of individual classifiers, and another is to apply ensemble of classifiers methods. The advantage of dynamic ensemble selection vs dynamic...
Segmentation of unconstrained handwritten word into different zones (upper middle and lower) and characters is more difficult than that of printed documents. This is mainly because of variability in inter-character distance, skew, slant, size and curved like handwriting. Sometimes components of two consecutive characters may be touched or overlapped and this situation complicates the segmentation...
Text lines in free-style handwritten documents are often curved, touch or overlap with each other, which presents a challenge for text line segmentation. In this paper, we proposed a novel text line segmentation method that utilizes the advantages of algorithms in both the small scale and large scale. A path is dynamically detected between each pair of neighboring text lines to separate them. During...
Although for postal automation there are many pieces of work towards street name recognition on non-Indian languages, to the best of our knowledge there is no work on street name recognition on Indian languages. In this paper we proposed a scheme for recognition of Indian street name written in Bangla script. Because of the writing style of different individuals some of the characters in a street...
For historical documents, available transcriptions typically are inaccurate when compared with the scanned document images. Not only the position of the words and sentences are unknown, but also the correct image transcription may not be matched exactly. An error-tolerant alignment is needed to make the document images amenable to browsing and searching in digital libraries. In this paper, we propose...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.