Search results for: Huaigu Cao

Items from 1 to 20 out of 32 results

chapter

Combining deep learning and language modeling for segmentation-free OCR from raw pixels

Stephen Rawls, Huaigu Cao, Ekraam Sabir, Prem Natarajan

2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR) > 119 - 123

2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR)

We present a simple yet effective LSTM-based approach for recognizing machine-print text from raw pixels. We use a fully-connected feed-forward neural network for feature extraction over a sliding window, the output of which is directly fed into a stacked bi-directional LSTM. We train the network using the CTC objective function and use a WFST language model during recognition. Experimental results...

chapter

Document Image Quality Assessment Using Discriminative Sparse Representation

Xujun Peng, Huaigu Cao, Prem Natarajan

2016 12th IAPR Workshop on Document Analysis Systems (DAS) > 227 - 232

2016 12th IAPR Workshop on Document Analysis Systems (DAS)

The goal of document image quality assessment (DIQA) is to build a computational model which can predict the degree of degradation for document images. Based on the estimated quality scores, the immediate feedback can be provided by document processing and analysis systems, which helps to maintain, organize, recognize and retrieve the information from document images. Recently, the bag-of-visual-words...

chapter

Document image OCR accuracy prediction via latent Dirichlet allocation

Xujun Peng, Huaigu Cao, Prem Natarajan

2015 13th International Conference on Document Analysis and Recognition (ICDAR) > 771 - 775

2015 13th International Conference on Document Analysis and Recognition (ICDAR)

Optical character recognition (OCR) accuracy of document images is an important factor for the success of many document processing and analysis tasks, especially for unconstraint captured document images. Although several document image OCR capability assessment methods are proposed, they mostly model the problem based on the empirically defined rules of image degradation, which cause the existing...

article

Integrating natural language processing with image document analysis: what we learned from two real-world applications

Jinying Chen, Huaigu Cao, Premkumar Natarajan

International Journal on Document Analysis and Recognition (IJDAR) > 2015 > 18 > 3 > 235-247

Automatically accessing information from unconstrained image documents has important applications in business and government operations. These real-world applications typically combine optical character recognition (OCR) with language and information technologies, such as machine translation (MT) and keyword spotting. OCR output has errors and presents unique challenges to late-stage processing. This...

chapter

Progress in the Raytheon BBN Arabic Offline Handwriting Recognition System

Huaigu Cao, Prem Natarajan, Xujun Peng, Krishna Subramanian, more

2014 14th International Conference on Frontiers in Handwriting Recognition > 555 - 560

2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR)

This paper presents the most recent progress and state of the art result obtained from BBN's Arabic offline handwriting recognition research. Our system is based a left-to-right hidden Markov model and integrates discriminative learning methods including discriminative MPE and n-best rescoring using the scores of glyph classifiers (SVM, DNN) and the RNNLM. Arabic-related features for n-best rescoring...

chapter

Applications of Recurrent Neural Network Language Model in Offline Handwriting Recognition and Word Spotting

Nan Li, Jinying Chen, Huaigu Cao, Bing Zhang, more

2014 14th International Conference on Frontiers in Handwriting Recognition > 134 - 139

2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR)

The recurrent neural network language model (RNNLM) is a discriminative, non-Markovian model that can capture long-span word history in natural language. It has been proved to be successful in automatic speech recognition and machine translation. In this work, we applied RNNLM to the n-best rescoring stage of the state-of-the-art BBN Byblos OCR (optical character recognition) system for handwriting...

chapter

Confusion Network Based Recurrent Neural Network Language Modeling for Chinese OCR Error Detection

Jinying Chen, Yue Wu, Huaigu Cao, Prem Natarajan

2014 22nd International Conference on Pattern Recognition > 1266 - 1271

2014 22nd International Conference on Pattern Recognition (ICPR)

This paper presents a new framework for OCR error detection, which uses a conditional random field model to combine rich features from multiple sources, including confusion networks (c-nets), lexical local context and recurrent neural network language model (RNNLM)1. We propose a novel, efficient method for computing character-level c-net based RNNLM scores by using dynamic programming and c-net partial...

chapter

Text detection and recognition in natural scenes and consumer videos

Arpit Jain, Xujun Peng, Xiaodan Zhuang, Pradeep Natarajan, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1245 - 1249

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose an end-to-end system for text detection and recognition in natural scenes and consumer videos. Maximally Stable Extremal Regions which are robust to illumination and viewpoint variations are selected as text candidates. Rich shape descriptors such as Histogram of Oriented Gradients, Gabor filter, corners and geometrical features are used to represent the candidates and classified using...

chapter

Text Classification via iVector Based Feature Representation

Shengxin Zha, Xujun Peng, Huaigu Cao, Xiandan Zhuang, more

2014 11th IAPR International Workshop on Document Analysis Systems > 151 - 155

2014 11th IAPR International Workshop on Document Analysis Systems (DAS)

In this paper, we address the problem of text classification: classifying modern machine-printed text, handwritten text and historical typewritten text from degraded noisy documents. We propose a novel text classification approach based on iVector, a newly developed concept in speaker verification. To a given text line, the iVector is a fixed-length feature vector representation, transformed from...

chapter

Exploiting Stroke Orientation for CRF Based Binarization of Historical Documents

Xujun Peng, Huaigu Cao, Krishna Subramanian, Rohit Prasad, more

2013 12th International Conference on Document Analysis and Recognition > 1034 - 1038

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

We present a novel binarization method that is especially effective on historical documents with the following characteristics: (a) the documents contain free-form cursive handwritten text with significant but consistent slant, (b) scanning artifacts resulting in the text and background pixels not having uniform intensity even within the same page, and (c) pages containing significant amount of bleeds...

chapter

Detecting OOV Names in Arabic Handwritten Data

Jinying Chen, Rohit Prasad, Huaigu Cao, Premkumar Natarajan

2013 12th International Conference on Document Analysis and Recognition > 994 - 998

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

This paper presents a novel approach to detect Arabic OOV names from OCR'ed handwritten documents. In our approach, OOV names are searched for using approximate string match on character consensus networks (cnets). The retrieved regions are re-ranked using novel features representing the quality of the match and the likelihood of the detected region to be an OOV name. Our features that encode word...

chapter

Applying Discriminatively Optimized Feature Transform for HMM-based Off-Line Handwriting Recognition

Jin Chen, Bing Zhang, Huaigu Cao, Rohit Prasad, more

2012 International Conference on Frontiers in Handwriting Recognition > 219 - 224

2012 International Conference on Frontiers in Handwriting Recognition (ICFHR)

Feature extraction is an important step in off-line handwriting recognition systems to represent raw handwriting in a low-dimensional, tractable feature space. Traditionally, linear feature transforms such as Principle Component Analysis (PCA), Linear Discriminative Analysis (LDA) are commonly used. The assumptions they make, however, usually cannot be satisfied in practice and thus the best performance...

chapter

Local Segmentation of Touching Characters Using Contour Based Shape Decomposition

Le Kang, David Doermann, Huaigu Cao, Rohit Prasad, more

2012 10th IAPR International Workshop on Document Analysis Systems > 460 - 464

2012 10th IAPR International Workshop on Document Analysis Systems (DAS)

We propose a contour based shape decomposition approach that provides local segmentation of touching characters. The shape contour is linearized into edge lets and edge lets are merged into boundary fragments. The connection cost between boundary fragments is obtained by considering local smoothness, connection length and a stroke-level property called the Same Stroke Rate. Samples of connections...

chapter

Document recognition and translation system for unconstrained Arabic documents

Huaigu Cao, Jinying Chen, Jacob Devlin, Rohit Prasad, more

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 318 - 321

2012 21st International Conference on Pattern Recognition (ICPR)

We describe an end-to-end system for translating real-world Arabic field documents that contain a mix of handwritten and printed content into English. These documents are extremely challenging to recognize due to presence of noise, poor image capture quality, and variations in writing style, writing device, font, layout, genre, etc. Furthermore, no off-the-shelf machine translation (MT) engine is...

chapter

Extracting information from handwritten content in census forms

Huaigu Cao, Krishna Subramanian, Xujun Peng, Jinying Chen, more

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 306 - 309

2012 21st International Conference on Pattern Recognition (ICPR)

In this paper, we describe our approach for extracting salient information from US census form images. These forms present several challenges including variations in individual form templates, skew, writing device, writing style, etc. We describe an innovative registration algorithm that is robust to scale variations for segmenting the input image into cells. Following registration, the borders of...

chapter

Automated image quality assessment for camera-captured OCR

Xujun Peng, Huaigu Cao, Krishna Subramanian, Rohit Prasad, more

2011 18th IEEE International Conference on Image Processing > 2621 - 2624

2011 18th IEEE International Conference on Image Processing (ICIP 2011)

Camera-captured optical character recognition (OCR) is a challenging area because of artifacts introduced during image acquisition with consumer-domain hand-held and Smart phone cameras. Critical information is lost if the user does not get immediate feedback on whether the acquired image meets the quality requirements for OCR. To avoid such information loss, we propose a novel automated image quality...

chapter

Graph Clustering-Based Ensemble Method for Handwritten Text Line Segmentation

Vasant Manohar, Shiv N. Vitaladevuni, Huaigu Cao, Rohit Prasad, more

2011 International Conference on Document Analysis and Recognition > 574 - 578

2011 International Conference on Document Analysis and Recognition (ICDAR)

Handwritten text line segmentation on real-world data presents significant challenges that cannot be overcome by any single technique. Given the diversity of approaches and the recent advances in ensemble-based combination for pattern recognition problems, it is possible to improve the segmentation performance by combining the outputs from different line finding methods. In this paper, we propose...

chapter

Text Extraction from Video Using Conditional Random Fields

Xujun Peng, Huaigu Cao, Rohit Prasad, Premkumar Natarajan

2011 International Conference on Document Analysis and Recognition > 1029 - 1033

2011 International Conference on Document Analysis and Recognition (ICDAR)

In this paper, we describe an approach to extract text from broadcast videos. Candidate blocks are detected based on edge extraction results. Corners and geometrical features are used for the purpose of initial classification which is carried out by using a support vector machine (SVM). Considering the spatial inter-dependencies of different regions in the image, we propose a novel conditional random...

chapter

OCR-Driven Writer Identification and Adaptation in an HMM Handwriting Recognition System

Huaigu Cao, Rohit Prasad, Prem Natarajan

2011 International Conference on Document Analysis and Recognition > 739 - 743

2011 International Conference on Document Analysis and Recognition (ICDAR)

We present an OCR-driven writer identification algorithm in this paper. Our algorithm learns writer-specific characteristics more precisely from explicit character alignment using the Viterbi algorithm and shows significant reduction of close-set writer identification error rates, compared with the GMM-based method. With writers' identities retrieved, we improve the performance of handwriting recognition...

chapter

Handwritten and Typewritten Text Identification and Recognition Using Hidden Markov Models

Huaigu Cao, Rohit Prasad, Prem Natarajan

2011 International Conference on Document Analysis and Recognition > 744 - 748

2011 International Conference on Document Analysis and Recognition (ICDAR)

We present a system for identification and recognition of handwritten and typewritten text from document images using hidden Markov models (HMMs) in this paper. Our text type identification uses OCR decoding to generate word boundaries followed by word-level handwritten/typewritten identification using HMMs. We show that the contextual constraints from the HMM significantly improves the identification...

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Huaigu Cao

Combining deep learning and language modeling for segmentation-free OCR from raw pixels

Document Image Quality Assessment Using Discriminative Sparse Representation

Document image OCR accuracy prediction via latent Dirichlet allocation

Integrating natural language processing with image document analysis: what we learned from two real-world applications

Progress in the Raytheon BBN Arabic Offline Handwriting Recognition System

Applications of Recurrent Neural Network Language Model in Offline Handwriting Recognition and Word Spotting

Confusion Network Based Recurrent Neural Network Language Modeling for Chinese OCR Error Detection

Text detection and recognition in natural scenes and consumer videos

Text Classification via iVector Based Feature Representation

Exploiting Stroke Orientation for CRF Based Binarization of Historical Documents

Detecting OOV Names in Arabic Handwritten Data

Applying Discriminatively Optimized Feature Transform for HMM-based Off-Line Handwriting Recognition

Local Segmentation of Touching Characters Using Contour Based Shape Decomposition

Document recognition and translation system for unconstrained Arabic documents

Extracting information from handwritten content in census forms

Automated image quality assessment for camera-captured OCR

Graph Clustering-Based Ensemble Method for Handwritten Text Line Segmentation

Text Extraction from Video Using Conditional Random Fields

OCR-Driven Writer Identification and Adaptation in an HMM Handwriting Recognition System

Handwritten and Typewritten Text Identification and Recognition Using Hidden Markov Models

Filter options

Publication date

Publication type

Keywords

Data set

Journal

INFONA - science communication portal

Search results for: Huaigu Cao

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Journal

Reporting an error / abuse

Sending the report failed

Accessibility options