Search results

Items from 1 to 11 out of 11 results

chapter

Keyword Matching in Historical Machine-Printed Documents Using Synthetic Data, Word Portions and Dynamic Time Warping

T. Konidaris, B. Gatos, S.J. Perantonis, A. Kesidis

2008 The Eighth IAPR International Workshop on Document Analysis Systems > 539 - 545

2008 The Eighth IAPR International Workshop on Document Analysis Systems (DAS)

In this paper we propose a novel and efficient technique for finding keywords typed by the user in digitised machine-printed historical documents using the dynamic time warping (DTW) algorithm. The method uses word portions located at the beginning and end of each segmented word of the processed documents and try to

chapter

Keyword Extraction Using Language Network

Jianyi Liu, Jinghua Wang

2007 International Conference on Natural Language Processing and Knowledge Engineering > 129 - 134

2007 IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE '07)

In this paper, we introduced language network and described three kinds of networks. Keyword extraction is an important technology in many areas of document processing. In particularly, a keyword extraction algorithm based on language network and PageRank is proposed. Firstly a semantic network for a single document

chapter

Words Clustering Based on Keywords Indexing from Large-scale Categorization Corpora

Liu Hua

2009 Fifth International Conference on Information Assurance and Security > 1 > 407 - 410

2009 Fifth International Conference on Information Assurance and Security (IAS)

Keywords are indexed automatically for large-scale categorization corpora. Indexed keywords of more than 20 documents are selected as seed words, thus overcoming subjectivity of selecting seed words in clustering; at the same time, clustering is limited to particular category corpora and keywords indexed feature

chapter

An Information Extraction Model for Unconstrained Handwritten Documents

S Thomas, C Chatelain, L Heutte, T Paquet

2010 20th International Conference on Pattern Recognition > 3412 - 3415

2010 20th International Conference on Pattern Recognition (ICPR 2010)

In this paper, a new information extraction system by statistical shallow parsing in unconstrained handwritten documents is introduced. Unlike classical approaches found in the literature as keyword spotting or full document recognition, our approach relies on a strong and powerful global handwriting model. A entire

chapter

Probabilistic model for a distributed feature selection method

Z. Berenyi, I. Vajk

2009 3rd International Workshop on Soft Computing Applications > 27 - 32

2009 3rd International Workshop on Soft Computing Applications (SOFA 2009)

attributes must be shared to have at every node a more accurate estimation of the global classifier. When expanding the knowledge of the local classifiers, to reduce costs, the network traffic should be kept to a minimum. We propose a probabilistic model for a keyword selection method which makes a more thorough analysis

chapter

Statistical Sentence Extraction for Information Distillation

D. Hakkani-Tur, G. Tur

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-1 - IV-4

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

both manual and automatic transcriptions, for non-English documents, we use automatic translations. In this work, we use AdaBoost, a discriminative classification method with both lexical and semantic features. The results indicate 11%-13% relative improvement over a baseline keyword-spotting-based approach. We also show

chapter

A Graph-Based Approach for Multi-folder Email Classification

S Chakravarthy, A Venkatachalam, A Telang

2010 IEEE International Conference on Data Mining > 78 - 87

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

This paper presents a novel framework for multi-folder email classification using graph mining as the underlying technique. Although several techniques exist (e.g., SVM, TF-IDF, n-gram) for addressing this problem in a delimited context, they heavily rely on extracting high-frequency keywords, thus ignoring the

chapter

A keyphrase based approach to interactive meeting summarization

K. Riedhammer, B. Favre, D. Hakkani-Tur

2008 IEEE Spoken Language Technology Workshop > 153 - 156

2008 IEEE Workshop on Spoken Language Technology. SLT 2008

Rooted in multi-document summarization, maximum marginal relevance (MMR) is a widely used algorithm for meeting summarization (MS). A major problem in extractive MS using MMR is finding a proper query: the centroid based query which is commonly used in the absence of a manually specified query, can not significantly outperform a simple baseline system. We introduce a simple yet robust algorithm to...

chapter

Text classification in the Turkish marketing domain for context sensitive ad distribution

Melih Engin, T. Can

2009 24th International Symposium on Computer and Information Sciences > 105 - 110

2009 24th International Symposium on Computer and Information Sciences (ISCIS)

performance in terms of accuracy and speed on text documents expressed as keyword root features.

chapter

A Context-Awareness for Mechanical Maintenance

Kyeong-Jin Oh, Jin-Guk Jung, Geun-Sik Jo

2011 International Conference on Information Science and Applications > 1 - 5

2011 International Conference on Information Science and Applications (ICISA 2011)

contextual information and keywords extracted from documents. For our experiments, we preprocessed hundreads of TASKs in the aircraft's maintenance manual and made several cases for context. Our experiments showed that our proposing system could provide information related with context.

chapter

Clustering WSDL Documents to Bootstrap the Discovery of Web Services

Khalid Elgazzar, Ahmed E Hassan, Patrick Martin

2010 IEEE International Conference on Web Services > 147 - 154

2010 IEEE International Conference on Web Services (ICWS)

user's request, the user has to construct the request using the keywords that best describe the user's objective and match correctly with the Web Service name or location. Clustering Web services based on function similarities would greatly boost the ability of Web services search engines to retrieve the most relevant Web

Filter options

Keywords:
DOCUMENT HANDLING
FEATURE EXTRACTION

Publication date

Set your own date range

Content availability

Available (10)
None (1)

Keywords

DATA MINING (5)
ACCURACY (3)
EQUATIONS (3)
CLASSIFICATION ALGORITHMS (2)
CLUSTERING ALGORITHMS (2)
INDEXING (2)
INFORMATION EXTRACTION (2)
INFORMATION RETRIEVAL (2)
INTERNET (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
NATURAL LANGUAGE PROCESSING (2)
PATTERN CLASSIFICATION (2)
TRAINING (2)
ABSTRACTS (1)
ADABOOST (1)
AIRCRAFT (1)
AIRCRAFT MAINTENANCE (1)
ARTIFICIAL INTELLIGENCE (1)
AUDIO FORMAT (1)
AUTOMATIC TRANSCRIPTIONS (1)
BASELINE KEYWORD-SPOTTING-BASED APPROACH (1)
BOOTSTRAP (1)
CATEGORIZATION CORPORA (1)
CENTROID BASED QUERY (1)
CISTR CORPUS (1)
CLASSIFICATION PROBLEM (1)
CLUSTERING (1)
CLUSTERING WSDL DOCUMENTS (1)
COMPUTER BOOTSTRAPPING (1)
CONTEXT (1)
CONTEXT SENSITIVE AD DISTRIBUTION (1)
CONTEXT-AWARE SYSTEMS (1)
DATA STRUCTURES (1)
DATABASES (1)
DIGITAL INFORMATION (1)
DIGITISED MACHINE-PRINTED HISTORICAL DOCUMENTS (1)
DISCRIMINATIVE CLASSIFICATION METHOD (1)
DISTANCE MEASUREMENT (1)
DISTRIBUTED FEATURE SELECTION METHOD (1)
DISTRIBUTED MOBILE ENVIRONMENT (1)
DOCUMENT CLASSIFIER (1)
DOCUMENT PROCESSING (1)
DOCUMENT SOURCES (1)
DOMANIAL WORDS (1)
DYNAMIC TIME WARPING (1)
ELECTRONIC MAIL (1)
ESTIMATION (1)
FASTENERS (1)
FEATURE EXTRACTION TECHNIQUES (1)
FREQUENCY ESTIMATION (1)
FULL DOCUMENT RECOGNITION (1)
GRAPH BASED APPROACH (1)
GRAPH MINING (1)
GRAPH REPRESENTATION (1)
GRAPHICAL USER INTERFACE (1)
GRAPHICAL USER INTERFACES (1)
GRAPHS (1)
HANDWRITING RECOGNITION (1)
HEURISTIC ALGORITHMS (1)
HIDDEN MARKOV MODELS (1)
HIGH LEVEL LANGUAGES (1)
HISTORICAL DOCUMENTS (1)
HISTORICAL MACHINE-PRINTED DOCUMENTS (1)
HUMANS (1)
IMAGE SEGMENTATION (1)
INFORMATION DISTILLATION (1)
INFORMATION EXTRACTION MODEL (1)
INFORMATION SHARING (1)
INFORMATION TECHNOLOGY (1)
INTERACTIVE MEETING SUMMARIZATION (1)
INTERACTIVE SYSTEMS (1)
KERNEL (1)
KEYPHRASE BASED SYSTEM (1)
KEYWORD EXTRACTION (1)
KEYWORD EXTRACTION ALGORITHM (1)
KEYWORD GENERATION (1)
KEYWORD MATCHING (1)
KEYWORD SELECTION METHOD (1)
KEYWORD SPOTTING (1)
KEYWORDS INDEXING (1)
LANGUAGE NETWORK (1)
LANGUAGE UNDERSTANDING (1)
LARGE-SCALE CATEGORIZATION (1)
LEXICAL FEATURES (1)
LINEAR KERNEL CLASSIFIERS (1)
MACHINE LEARNING (1)
MAINTENANCE ENGINEERING (1)
MANUAL TRANSCRIPTIONS (1)
MARKETING DATA PROCESSING (1)
MATERIALS (1)
MATHEMATICAL MODEL (1)
MAXIMUM MARGINAL RELEVANCE (1)
MECHANICAL ENGINEERING (1)
MECHANICAL MAINTENANCE (1)
MEETING SUMMARIZATION (1)
METEOROLOGY (1)
MINIMISATION (1)
MMR ALGORITHM (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options