The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
To reduce the classification errors of online handwritten Japanese character recognition, we propose a method for confusing characters discrimination with little additional costs. After building confusing sets by cross validation using a baseline quadratic classifier, a logistic regression (LR) classifier is trained to discriminate the characters in each set. The LR classifier uses subspace features...
The hierarchical nature of Chinese characters has inspired radical-based recognition, but radical segmentation from characters remains a challenge. We previously proposed a radical-based approach for on-line handwritten Chinese character recognition, which incorporates character structure knowledge into integrated radical segmentation and recognition, and performs well on characters of left-right...
This paper presents a text query-based method for keyword spotting from online Chinese handwritten documents. The similarity between a text word and handwriting is obtained by combining the character similiarity scores given by a character classifier. To overcome the ambiguity of character segmentation, multiple candidates of character patterns are generated by over-segmentation, and sequences of...
The HMM-based segmentation-free strategy for Chinese handwriting recognition has the advantage of training without annotation of character boundaries. However, the recognition performance has been limited by the small number of string samples. In this paper, we explore two techniques to improve the performance. First, Delta features are added to the static ones for alleviating the conditional independence...
The alignment of text line images with text transcript is a crucial step of handwritten document annotation. Handwritten text alignment is prone to errors due to the difficulty of character segmentation and the variability of character shape, size and position. In this paper, we propose to incorporate the geometric context of character strings to improve the alignment accuracy for offline handwritten...
The splitting of touching characters remains a challenge in over-segmentation, which is crucial to the performance of integrated segmentation-recognition of handwritten character strings. In this paper, we propose a new method based on contour analysis for touching character splitting in Chinese handwriting. To reliably locate splitting points on the contour of touching pattern, we pair upper and...
Chinese handwriting recognition remains a challenge. Research works have reported very high accuracies on neatly handwritten characters yet the performance on unconstrained handwriting remains quite low. To promote the recognition technology, new databases of unconstrained handwriting have been constructed for academic research and benchmarking. This paper reports the contest results of online and...
This paper presents a radical-based on-line handwritten Chinese character recognition method, which integrates appearance-based radical recognition and geometric context into a principled framework using a character-radical dictionary to guide radical segmentation and recognition during path search. To solve the connection between radicals, we detect corner points to extract sub-strokes. Based on...
This paper describes a method of online handwritten Japanese text recognition by improved path evaluation. Based on a theoretical ground, the method evaluates the likelihood of candidate segmentation paths by combining scores of character pattern size, inner gap, character recognition, single and pair character position, candidate segmentation point and linguistic context, with the weight parameters...
Text line segmentation in unconstrained handwritten documents remains a challenge because handwritten text lines are multi-skewed and not obviously separated. This paper presents a new approach based on the variational Bayes (VB) framework for text line segmentation. Viewing the document image as a mixture density model, with each text line approximated by a Gaussian component, the VB method can automatically...
This paper describes a publicly available database, CASIA-OLHWDB1, for research on online handwritten Chinese character recognition. This database is the first of our series of online/offline handwritten characters and texts, collected using Anoto pen on paper. It contains unconstrained handwritten characters of 4,037 categories (3,866 Chinese characters and 171 symbols) produced by 420 persons, and...
This paper describes an online handwritten Japanese character string recognition system based on conditional random fields, which integrates the information of character recognition, linguistic context and geometric context in a principled framework, and can effectively overcome the variable length of candidate segmentation. For geometric context, we employ both unary and binary feature functions,...
Annotating the regions, text lines and characters of document images is an important, but tedious and expensive task. A ground-truthing tool may largely alleviate the human burden in this process. This paper describes an automated recognition-based tool GTLC for finding the best alignment between the text transcript and the connected components of unconstrained handwritten document image. The alignment...
This paper describes a system for handwritten Chinese text recognition integrating language model. On a text line image, the system generates character segmentation and word segmentation candidates, and the candidate paths are evaluated by character recognition scores and language model. The optimal path, giving segmentation and recognition result, is found using a pruned dynamic programming search...
Directional features are preferred in off-line Chinese character recognition due to their superior performance. This paper proposes an enhanced four plane feature (en-FPF) within a segmentation-free recognition framework. First, the directional planes are strengthened by replenishing salient pixels. Second, the method to count perpendicular strokes are renewed. In experiments of realistic Chinese...
The accuracy of handwritten Chinese character recognition can be improved by pair discrimination of similar characters. In this paper, we propose a new method for combining the baseline classifier with incomplete pair discriminators to better exploit their complementariness. The outputs of the baseline classifier and pair discriminators are transformed to two-class probabilities, which are then fused...
This paper proposes a new radical-based approach for online handwritten chinese character recognition. The approach is novel in three respects: statistical classification of radicals, over-segmentation of characters into candidate radicals, and lexicon-driven recognition of characters. Currently, we have applied the approach to Chinese characters of left-right structure and are extending to other...
Text line extraction from unconstrained handwritten documents is a challenge because the text lines are often skewed and curved and the space between lines is not obvious. To solve this problem, we propose an approach based on minimum spanning tree (MST) clustering with new distance measures. First, the connected components of the document image are grouped into a tree by MST clustering with a new...
In this paper, we propose a linear discriminant analysis (LDA)-based compound distance measure for discriminating similar characters in handwritten Chinese character recognition. The previous compound Mahalanobis function (CMF) is shown to be a special case of the proposed method. On finding similar character pairs by cross-validation using a baseline classifier, LDA is applied to each similar pair,...
This paper describes an online handwritten Japanese character string recognition system integrating scores of geometric context, character recognition, and linguistic context. We give a string evaluation criterion for better integrating the multiple scores while overcoming the effect of string length variability. For measuring geometric context, we propose a statistical method for modeling both single-...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.