Search results

Items from 1 to 8 out of 8 results

article

Weighting scheme for image retrieval based on bag-of-visual-words

Lei Zhu, Hai Jin, Ran Zheng, Xiaowen Feng

IET Image Processing > 2014 > 8 > 9 > 509 - 518

Inspired by the success of bag-of-words in text retrieval, bag-of-visual-words and its variants are widely used in content-based image retrieval to describe visual content. Various weighting schemes have also been proposed to integrate different yet complementary visual-words. However, most of these weighting schemes tend to use fixed weight for every visual-word extracted from the query image, which...

chapter

Converting printed Sinhala documents to formatted editable text

S Ajward, N Jayasundara, S Madushika, R Ragel

2010 Fifth International Conference on Information and Automation for Sustainability > 138 - 143

2010 5th International Conference on Information and Automation for Sustainability (ICIAfS)

Digitizing printed document is always a challenge faced by the computing society. Digitization of text not only allows users to easily modify and reprint printed documents, but also is a need of the day due to the use of word-search capability available at disposal in this era. Converting a printed document into a stream of characters using OCR (optical character recognition) techniques is a widely...

chapter

Word segmentation in a document image using spectral partitionin

V Manikandan, V Venkatachalam, M Kirthiga, K Harini, more

2010 IEEE International Conference on Computational Intelligence and Computing Research > 1 - 4

2010 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC 2010)

State of art document segmentation algorithms employ adhoc solutions which use some document properties and iteratively segment the document image. These solutions need to be adapted frequently and sometimes fail to perform well for complex scripts. This calls for a generalized solution that achieves a one shot segmentation that is globally optimal. This paper describes one such solution based on...

chapter

A Full-Text Search System for Images of Hand-Written Cursive Documents

Hajime Imura, Yuzuru Tanaka

2010 12th International Conference on Frontiers in Handwriting Recognition > 640 - 645

2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010)

We propose a full-text search technique for image-scanned documents that does not recognize individual characters. The system is as fast as a full-text search of machine-readable documents. Such a system is important when working with historical handwritten manuscripts. The proposed method works independently of differences in language and font because it uses a new pseudo-coding scheme based on the...

chapter

Efficient Transcript Mapping to Ease the Creation of Document Image Segmentation Ground Truth with Text-Image Alignment

N Stamatopoulos, G Louloudis, B Gatos

2010 12th International Conference on Frontiers in Handwriting Recognition > 226 - 231

2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010)

One of the major issues in document image processing is the efficient creation of ground truth in order to be used for training and evaluation purposes. Since a large number of tools have to be trained and evaluated in realistic circumstances, we need to have a quick and low cost way to create the corresponding ground truth. Moreover, the specific need for having the correct text correlated with the...

chapter

Fusion of Word Spotting and Spatial Information for Figure Caption Retrieval in Historical Document Images

K. Khurshid, C. Faure, N. Vincent

2009 10th International Conference on Document Analysis and Recognition > 266 - 270

2009 10th International Conference on Document Analysis and Recognition (ICDAR)

We present a method for figure caption detection by employing a fusion of several information sources. The evaluation is performed on documents gathered from the collection of the historical medical digital library Medic@. A method based on perceptual grouping simultaneously segments the vertical and horizontal text lines in a page. Spatial relationships between the text lines and the graphics are...

chapter

Using Linguistic Information to Classify Portuguese Text Documents

T. Goncalves, P. Quaresma

2008 Seventh Mexican International Conference on Artificial Intelligence > 94 - 100

2008 Seventh Mexican International Conference on Artificial Intelligence (MICAI)

This paper examines the role of various linguistic structures on text classification applying the study to the Portuguese language. Besides using a bag-of-words representation where we evaluate different measures and use linguistic knowledge for term selection, we do several experiments using syntactic information representing documents as strings of words and strings of syntactic parse trees. To...

chapter

Bayesian blind separation of mixed text patterns

Feng Su, Shijie Cai, A. Mohammad-Djafari

2008 International Conference on Audio, Language and Image Processing > 1373 - 1378

2008 International Conference on Audio, Language and Image Processing

In this paper we consider the problem of unsupervised separation of mixed text patterns based on blind source separation models. We propose a hierarchical Markov random field model for the source patterns, which enforces piece-wise regularity on both labels and intensities of image pixels. We also presented a hierarchical Bayesian BSS framework, in which the unknown sources and labels is estimated...

Filter options

Keywords:
WORD PROCESSING
DOCUMENT IMAGE PROCESSING

Publication date

Set your own date range

Publication type

book (7)
article (1)

Keywords

FEATURE EXTRACTION (3)
IMAGE RETRIEVAL (3)
IMAGE SEGMENTATION (3)
PIXEL (3)
ALGORITHM DESIGN AND ANALYSIS (2)
DOCUMENT IMAGE SEGMENTATION (2)
HIDDEN MARKOV MODELS (2)
INDEXING (2)
LAYOUT (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
OPTICAL CHARACTER RECOGNITION SOFTWARE (2)
WORD SPOTTING (2)
APPROXIMATION ALGORITHMS (1)
ARTIFICIAL NEURAL NETWORKS (1)
BAG-OF-VISUAL-WORDS (1)
BAG-OF-WORDS REPRESENTATION (1)
BAYESIAN BLIND SEPARATION (1)
BAYESIAN METHODS (1)
BIOMEDICAL IMAGING (1)
BLIND SOURCE SEPARATION (1)
BOOKS (1)
CATEGORY WEIGHT MAPPING TABLE (1)
CHARACTER RECOGNITION (1)
CHARACTER SHAPE (1)
CLASSIFICATION ALGORITHMS (1)
COMPUTER GRAPHICS (1)
COMPUTING SOCIETY (1)
CONTENT-BASED IMAGE RETRIEVAL (1)
CONTENT-BASED RETRIEVAL (1)
DIGITAL LIBRARIES (1)
DIGITAL PRINTING (1)
DIGITIZING PRINTED DOCUMENT (1)
DYNAMIC PROGRAMMING (1)
DYNAMIC TIME WARPING (1)
EDIT DISTANCE (1)
EDITABLE SCANNED DOCUMENTS (1)
ENGLISH MANUSCRIPT (1)
ESTIMATION (1)
FIGURE CAPTION RETRIEVAL (1)
FILTERING (1)
FORMATTED EDITABLE TEXT (1)
FORMATTING FEATURE (1)
FULL TEXT SEARCH SYSTEM (1)
FULL-TEXT SEARCH (1)
GAP CLASSIFICATION TECHNIQUE (1)
GENERIC ITERATIVE ALGORITHM (1)
GROUND TRUTH (1)
GROUND TRUTH CREATION (1)
HAND-WRITTEN CURSIVE DOCUMENT IMAGE (1)
HANDWRITTEN CHARACTER RECOGNITION (1)
HANDWRITTEN DOCUMENTS (1)
HANDWRITTEN MANUSCRIPT (1)
HEURISTIC ALGORITHMS (1)
HIERARCHICAL MARKOV RANDOM FIELD MODEL (1)
HISTOGRAMS (1)
HISTORICAL DOCUMENT IMAGE (1)
HISTORICAL MEDICAL DIGITAL LIBRARY (1)
HMM-BASED FILTERING (1)
HORIZONTAL PROFILING (1)
HOUGH TRANSFORM (1)
HOUGH TRANSFORMS (1)
IMAGE CLASSIFICATION (1)
IMAGE COLOR ANALYSIS (1)
IMAGE FEATURES (1)
IMAGE PIXELS (1)
IMAGE RECOGNITION (1)
IMAGE SCANNED DOCUMENT (1)
INFERENCE ALGORITHMS (1)
JAPANESE MANUSCRIPT (1)
KERNEL (1)
LAPLACE EQUATIONS (1)
LIBRARIES (1)
LINEAR CLASSIFLER (1)
LINGUISTIC INFORMATIONS (1)
LINGUISTICS (1)
MACHINE LEARNING TECHNIQUE (1)
MACHINE READABLE DOCUMENT (1)
MARKOV PROCESSES (1)
MAX-MARGIN LEARNING (1)
MIXED TEXT PATTERNS (1)
N GRAM-BASED QUERY STRING (1)
NATURAL LANGUAGE PROCESSING (1)
NATURAL LANGUAGES (1)
OCR TECHNIQUE (1)
OPEN-SOURCE WORD PROCESSING TOOL (1)
OPTICAL CHARACTER RECOGNITION (1)
OPTICAL IMAGING (1)
OPTIMISATION (1)
OPTIMIZATION (1)
OPTIMIZATION PROBLEM (1)
PAIR WISE SIMILARITY MATRIX (1)
PART-OF-SPEECH INFORMATION (1)
PARTITIONING ALGORITHMS (1)
PATTERN RECOGNITION (1)
PERFORMANCE EVALUATION (1)
PORTUGUESE LANGUAGE (1)
PORTUGUESE TEXT DOCUMENTS (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options