Search results

Items from 1 to 4 out of 4 results

chapter

Integrating Geometric Context for Text Alignment of Handwritten Chinese Documents

Fei Yin, Qiu-Feng Wang, Cheng-Lin Liu

2010 12th International Conference on Frontiers in Handwriting Recognition > 7 - 12

2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010)

The alignment of text line images with text transcript is a crucial step of handwritten document annotation. Handwritten text alignment is prone to errors due to the difficulty of character segmentation and the variability of character shape, size and position. In this paper, we propose to incorporate the geometric context of character strings to improve the alignment accuracy for offline handwritten...

chapter

Computer Assisted Transcription of Text Images: Results on the GERMANA Corpus and Analysis of Improvements Needed for Practical Use

Verónica Romero, Alejandro H Toselli, Enrique Vidal

2010 20th International Conference on Pattern Recognition > 2017 - 2020

2010 20th International Conference on Pattern Recognition (ICPR 2010)

We present a study of the application of Computer Assisted Transcription of Text Images (CATTI) to a task which is much closer to real applications than other tasks previously studied. The new task consists in the transcription of a new publicly available historic handwritten document, called GERMANA. A detailed analysis of the main factors influencing the system performance are exposed and some strategies...

chapter

Adaptive Correction of Errors from Segmented Digital Ink Texts in Chinese Based on Context

Xi-Wen Zhang, Wei-Hua An, Yong-Gang Fu

2010 Second International Conference on Information Technology and Computer Science > 25 - 35

2010 2nd International Conference on Information Technology and Computer Science (ITCS 2010)

Digital ink texts in Chinese can neither be converted into users' desired layouts nor be recognized until their characters, lines, and paragraphs are correctly extracted. There are many errors in automatically segmented digital ink texts in Chinese because they are free forms and mixed with other languages, as well as their Chinese characters have small gaps and complex structures. Paragraphs, lines,...

chapter

Monothetic separation of Telugu, Hindi and English text lines from a multi script document

M.C. Padma, P.A. Vijaya

2009 IEEE International Conference on Systems, Man and Cybernetics > 4870 - 4875

2009 IEEE International Conference on Systems, Man and Cybernetics. SMC 2009

In a multi-script multi-lingual environment, a document may contain text lines in more than one script/language forms. It is necessary to identify different script regions of the document in order to feed the document to the OCRs of individual language. With this context, this paper proposes to develop a monothetic algorithmic model to identify and separate text lines Telugu, Hindi and English scripts...

Filter options

Data set:
ieee
Keywords:
CONTEXT
FEATURE EXTRACTION
TEXT ANALYSIS
DOCUMENT IMAGE PROCESSING

Publication date

Set your own date range

Keywords

HANDWRITTEN CHARACTER RECOGNITION (2)
HANDWRITTEN DOCUMENT (2)
HIDDEN MARKOV MODELS (2)
IMAGE SEGMENTATION (2)
TRAINING (2)
ADAPTIVE ERROR CORRECTION (1)
ALIGNMENT ACCURACY (1)
ANNOTATION (1)
BETWEEN-CHARACTER RELATIONSHIPS (1)
CHARACTER RECOGNITION (1)
CHARACTER RECOGNIZER (1)
CHARACTER SEGMENTATION (1)
CHARACTER STRINGS (1)
CHINESE TEXT (1)
COLOR (1)
COMPUTER ASSISTED TRANSCRIPTION (1)
CONTEXT APPROACH (1)
CORRECTION (1)
DATA MINING (1)
DIGITAL INK TEXT (1)
DIGITAL INK TEXT SEGMENTATION (1)
ENGLISH TEXT LINE (1)
ERBIUM (1)
GEOMETRIC CONTEXT (1)
GEOMETRIC FEATURES (1)
GEOMETRIC MODELS (1)
GEOMETRY (1)
GERMANA CORPUS (1)
HANDWRITING RECOGNITION (1)
HANDWRITTEN DOCUMENT ANNOTATION (1)
HANDWRITTEN TEXT ALIGNMENT (1)
HANDWRITTEN TEXT IMAGE RECOGNITION (1)
HUMANS (1)
INK (1)
INTERACTIVE PREDICTIVE FRAMEWORK (1)
KNOWLEDGE BASED SYSTEMS (1)
MATHEMATICAL MODEL (1)
MONOTHETIC ALGORITHM (1)
MONOTHETIC CLASSIFIER (1)
MONOTHETIC SEPARATION (1)
MULTI SCRIPT DOCUMENT (1)
MULTI-SCRIPT MULTI-LINGUAL DOCUMENT (1)
MULTILINGUAL DOCUMENT (1)
NATURAL LANGUAGE PROCESSING (1)
OBJECT EXTRACTION (1)
OBJECT RECOGNITION (1)
OFFLINE HANDWRITTEN CHINESE DOCUMENTS (1)
OPTICAL CHARACTER RECOGNITION (1)
OPTICAL CHARACTER RECOGNITION SOFTWARE (1)
PEN GESTURE (1)
PIXEL (1)
SCRIPT IDENTIFICATION (1)
SCRIPT/LANGUAGE FORM (1)
SEGMENTATION (1)
SHAPE (1)
SINGLE CHARACTERS (1)
STATISTICAL ANALYSIS (1)
STATISTICAL MODELS (1)
TEXT IMAGES (1)
TEXT LINE IMAGES (1)
TEXT TRANSCRIPT (1)
UNCONSTRAINED HANDWRITTEN CHINESE TEXT LINES (1)
VISUALIZATION (1)
VOCABULARY (1)
WRITING (1)
more

INFONA - science communication portal

Search results

Integrating Geometric Context for Text Alignment of Handwritten Chinese Documents

Computer Assisted Transcription of Text Images: Results on the GERMANA Corpus and Analysis of Improvements Needed for Practical Use

Adaptive Correction of Errors from Segmented Digital Ink Texts in Chinese Based on Context

Monothetic separation of Telugu, Hindi and English text lines from a multi script document

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options