Search results for: J. van Beusekom

Items from 1 to 6 out of 6 results

chapter

Recognition Driven Page Orientation Detection

Y. Rangoni, F. Shafait, J. van Beusekom, T.M. Breuel

2009 16th IEEE International Conference on Image Processing (ICIP) > 1989 - 1992

2009 16th IEEE International Conference on Image Processing (ICIP 2009)

In document image recognition, orientation detection of the scanned page is necessary for the following procedures to work correctly as they assume that the text is well oriented. Several methods have been proposed, but most of them rely on heuristics of the script such as the graphical asymmetry between ascenders and descenders for Roman script. The literature shows that as soon as this assumption...

chapter

Automated Ground Truth Data Generation for Newspaper Document Images

T. Strecker, J. van Beusekom, S. Albayrak, T.M. Breuel

2009 10th International Conference on Document Analysis and Recognition > 1275 - 1279

2009 10th International Conference on Document Analysis and Recognition (ICDAR)

In document image understanding, public datasets with ground-truth are an important part of scientific work. They are not only helpful for developing new methods, but also provide a way of comparing performance. Generating these datasets, however, is time consuming and cost-intensive work, requiring a lot of manual effort. In this paper we both propose a way to semi-automatically generate ground-truthed...

chapter

Background variability modeling for statistical layout analysis

F. Shafait, J. van Beusekom, D. Keysers, T.M. Breuel

2008 19th International Conference on Pattern Recognition > 1 - 4

ICPR 2008 19th International Conference on Pattern Recognition

Geometric layout analysis plays an important role in document image understanding. Many algorithms known in literature work well on standard document images, achieving high text line segmentation accuracy on the UW-III dataset. These algorithms rely on certain assumptions about document layouts, and fail when their underlying assumptions are not met. Also, they do not provide confidence scores for...

chapter

Structural Mixtures for Statistical Layout Analysis

F. Shafait, J. van Beusekom, D. Keysers, T.M. Breuel

2008 The Eighth IAPR International Workshop on Document Analysis Systems > 415 - 422

2008 The Eighth IAPR International Workshop on Document Analysis Systems (DAS)

A key limitation of current layout analysis methods is that they rely on many hard-coded assumptions about document layouts and can not adapt to new layouts for which the underlying assumptions are not satisfied. Another major drawback of these approaches is that they do not return confidence scores for their outputs. These problems pose major challenges in large scale digitization efforts where a...

chapter

Example-Based Logical Labeling of Document Title Page Images

J. van Beusekom, D. Keysers, F. Shafait, T.M. Breuel

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) > 2 > 919 - 923

2007 9th International Conference on Document Analysis and Recognition

This paper presents a flexible and effective example- based approach for labeling title pages which can be used for automated extraction of bibliographic data. The labels of interest are "title", "author", "abstract" and "affiliation". The method takes a set of labeled document layouts and a single unlabeled document layout as input and finds the best matching...

chapter

Distance measures for layout-based document image retrieval

J. van Beusekom, D. Keysers, F. Shafait, T.M. Breuel

Second International Conference on Document Image Analysis for Libraries (DIAL'6) > 11 pp. - 242

Second International Conference on Document Image Analysis for Libraries

Most methods for document image retrieval rely solely on text information to find similar documents. This paper describes a way to use layout information for document image retrieval instead. A new class of distance measures is introduced for documents with Manhattan layouts, based on a two-step procedure: First, the distances between the blocks of two layouts are calculated. Then, the blocks of one...

Filter options

Publication date

Set your own date range

Keywords

DOCUMENT IMAGE PROCESSING (5)
IMAGE MATCHING (3)
IMAGE SEGMENTATION (3)
LAYOUT (3)
ALGORITHM DESIGN AND ANALYSIS (2)
BOOKS (2)
COMPUTATIONAL MODELING (2)
LAYOUT ANALYSIS (2)
MATHEMATICAL MODEL (2)
OPTICAL CHARACTER RECOGNITION SOFTWARE (2)
PAGE SEGMENTATION (2)
STATISTICAL ANALYSIS (2)
STATISTICAL LAYOUT ANALYSIS (2)
TEXT ANALYSIS (2)
TRAINING (2)
AUTOMATED GROUND TRUTH DATA GENERATION (1)
AUTOMATIC GROUND-TRUTH (1)
AUTOMATIC LAYOUT (1)
BACKGROUND VARIABILITY MODELING (1)
BEST DISTANCE MEASURE (1)
BLOCK DISTANCES (1)
COMPARISON (1)
CONFIDENCE SCORE (1)
COST-INTENSIVE WORK (1)
DATA ANALYSIS (1)
DATA MINING (1)
DATASET (1)
DISTANCE MEASURES (1)
DOCUMENT HANDLING (1)
DOCUMENT IMAGE (1)
DOCUMENT IMAGE RECOGNITION (1)
DOCUMENT IMAGES (1)
DOCUMENT LAYOUT (1)
DOCUMENT LAYOUTS (1)
DOCUMENT TITLE PAGE IMAGE LABELING (1)
ENGINES (1)
EQUATIONS (1)
EXAMPLE-BASED LOGICAL LABELING (1)
GAUSSIAN DISTRIBUTION (1)
GEOMETRIC LAYOUT ANALYSIS (1)
GEOMETRY (1)
GRAPHICAL ASYMMETRY (1)
HUMAN INTERVENTION (1)
IMAGE ORIENTATION ANALYSIS (1)
IMAGE RETRIEVAL (1)
IMAGING (1)
LARGE-SCALE DIGITALIZATION PROCESS (1)
LAYOUT INFORMATION (1)
LAYOUT-BASED DOCUMENT IMAGE RETRIEVAL (1)
MANHATTAN DISTANCE (1)
MANHATTAN LAYOUT (1)
MANHATTAN LAYOUTS (1)
MARG DATABASE (1)
MARG DATASET (1)
MEDIA (1)
MINIMUM WEIGHT EDGE COVER MATCHING (1)
MULTIVARIATE GAUSSIAN DISTRIBUTION (1)
NEAREST NEIGHBOR CLASSIFIER (1)
NEWSPAPER DOCUMENT IMAGE (1)
OBJECT DETECTION (1)
OPTICAL CHARACTER RECOGNITION (1)
PAGE LAYOUT (1)
PROBABILISTIC MATCHING ALGORITHM (1)
PUBLIC DATASET (1)
PUBLISHING (1)
RECOGNITION DRIVEN PAGE ORIENTATION DETECTION (1)
RENDERING (COMPUTER GRAPHICS) (1)
ROBUSTNESS (1)
ROMAN SCRIPT (1)
SOCIETIES (1)
STRUCTURAL MIXTURE MODEL (1)
TEXT LINE SEGMENTATION (1)
TIME CONSUMING (1)
USA COUNCILS (1)
UW-1 DATASET (1)
UW-III DATASET (1)
XML (1)
XML FILE (1)
more

INFONA - science communication portal

Search results for: J. van Beusekom

Recognition Driven Page Orientation Detection

Automated Ground Truth Data Generation for Newspaper Document Images

Background variability modeling for statistical layout analysis

Structural Mixtures for Statistical Layout Analysis

Example-Based Logical Labeling of Document Title Page Images

Distance measures for layout-based document image retrieval

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options