Search results

Items from 1 to 20 out of 57 results

chapter

Keyword spotting in degraded document using mixed OCR and word shape coding

Yong Xia, Guangri Quan, Yongdong Xu, Yushan Sun

2010 IEEE International Conference on Intelligent Computing and Intelligent Systems > 3 > 411 - 414

2010 IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2010)

This paper presents a new way for keyword spotting in degraded imaged document. Two prevalent word indexing, OCR and word shape coding, are combined compactly based on the recognition confidence evaluation. The basic procedures are as follows. First, OCR candidates are used for OCR indexing. Second, a new stoke

chapter

A modified approach to keyword extraction based on word-similarity

Meng Wenchao, Liu Lianchen, Dai Ting

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems > 3 > 388 - 392

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2009)

Two keyword-extraction ways are usually used, one is simply using the information from exactly single word like word frequency and TF.IDF, the other is based on the relationship between words. The relationship is usually described as word similarity which derives from a corpus (WordNet, HowNet) or man-made thesaurus

chapter

Keyword Extraction Using Word Co-occurrence

C Wartena, R Brussee, W Slakhorst

2010 Workshops on Database and Expert Systems Applications > 54 - 58

2010 21st International Conference on Database and Expert Systems Applications

A common strategy to assign keywords to documents is to select the most appropriate words from the document text. One of the most important criteria for a word to be selected as keyword is its relevance for the text. The tf.idf score of a term is a widely used relevance measure. While easy to compute and giving quite

chapter

Hot keyword identification for extracting web public opinion

Zhiqi Fang, Yue Ning, Tingshao Zhu

5th International Conference on Pervasive Computing and Applications > 116 - 121

2010 5th International Conference on Pervasive Computing and Applications (ICPCA 2010)

Internet is becoming an increasingly important platform for ordinary life and work. It is expected that keyword extraction can help people quickly find hot spots on the web, since keywords in a document provide important information about the content of the document. In this paper, we propose to use text clustering

chapter

A new keyword spotting approach

H. Bahi, N. Benati

2009 International Conference on Multimedia Computing and Systems > 77 - 80

2009 International Conference on Multimedia Computing and Systems (ICMCS'09)

Keyword spotting is the task of identifying the occurrences of certain desired keywords in an arbitrary speech signal. Keyword spotting has many applications one of them is telephone routing. In particular, we consider a big company which receives thousands of telephone calls daily. We are interested with the

chapter

Automatic Chinese Keyword Extraction Based on KNN for Implicit Subject Extraction

Zhang Qingguo, Zhang Chengzhi

2008 International Symposium on Knowledge Acquisition and Modeling > 689 - 692

2008 International Symposium on Knowledge Acquisition and Modeling (KAM)

In this paper, a method of automatic Chinese keyword extraction based on KNN is proposed. Firstly, it preprocesses the document by vector space model. Secondly, it constructs a set of candidate keywords based on KNN method and the labeled dataset. Finally, it post-processes on candidate keywords by the character of

chapter

News Keyword Extraction for Topic Tracking

Sungjick Lee, Han-Joon Kim

2008 Fourth International Conference on Networked Computing and Advanced Information Management > 2 > 554 - 559

2008 Fourth International Conference on Networked Computing and Advanced Information Management (NCM)

This paper presents a keyword extraction technique that can be used for tracking topics over time. In our work, keywords are a set of significant words in an article that gives high-level description of its contents to readers. Identifying keywords from a large amount of on-line news data is very useful in that it can

chapter

Contextual Ranking of Keywords Using Click Data

U. Irmak, V. von Brzeski, R. Kraft

2009 IEEE 25th International Conference on Data Engineering > 457 - 468

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

The problem of automatically extracting the most interesting and relevant keyword phrases in a document has been studied extensively as it is crucial for a number of applications. These applications include contextual advertising, automatic text summarization, and user-centric entity detection systems. All these

chapter

Image Clustering Using Visual and Text Keywords

R. Agrawal, Changhua Wu, W.I. Grosky, F. Fotouhi

2007 International Symposium on Computational Intelligence in Robotics and Automation > 49 - 54

IEEE International Symposium on Computational Intelligence in Robotics and Automation, 2007

In classical image classification approaches, low-level features have been used. But the high dimensionality of feature spaces poses a challenge in terms of feature selection and distance measurement during the clustering process. In this paper, we propose an approach to generate visual keyword and combine both visual

chapter

Improved algorithm for keywords extraction from documents without corpus

Jing Chen, Jianfeng Wu

2009 IEEE 10th International Conference on Computer-Aided Industrial Design&Conceptual Design > 2339 - 2341

2009 IEEE 10th International Conference on Computer-Aided Industrial Design & Conceptual Design. E-Business, Creative Design, Manufacturing. (CAID&CD 2009)

In this paper, an algorithm for extracting keywords without corpus is described. We use the co-occurrence information of the words and the biases of distribution to extract the more important words based on the most frequently appearing words so called reference words. Firstly, the most frequently terms are chosen

chapter

Study on question answering system for biomedical domain

Bo Xu, Hongfei Lin, Baoyan Liu

2009 IEEE International Conference on Granular Computing > 626 - 629

2009 IEEE International Conference on Granular Computing (GrC 2009)

This paper focuses on setting up a question-answering oriented biomedical domain, and it applies several different approaches to the different processing phases. Firstly, it uses shallow parser to identify the types of questions and extract the keywords, and the keywords are expanded with UMLS for the purpose of

chapter

Music Information Retrieval System Using Lyrics and Melody Information

Tao Wang, Dong-Ju Kim, Kwang-Seok Hong, Jeh-Seon Youn

2009 Asia-Pacific Conference on Information Processing > 2 > 601 - 604

2009 Asia-Pacific Conference on Information Processing, APCIP

the userpsilas acoustic signal from a singing voice and retrieves the music information using both lyrics and melody information. The lyrics recognition module uses a keyword spotting system based on text-content of the lyrics by an HMM comparison engine. The melody recognition module extracts pitch and MFCC features

chapter

HMM-based Word Spotting in Handwritten Documents Using Subword Models

A Fischer, A Keller, V Frinken, H Bunke

2010 20th International Conference on Pattern Recognition > 3416 - 3419

2010 20th International Conference on Pattern Recognition (ICPR 2010)

Handwritten word spotting aims at making document images amenable to browsing and searching by keyword retrieval. In this paper, we present a word spotting system based on Hidden Markov Models (HMM) that uses trained subword models to spot keywords. With the proposed method, arbitrary keywords can be spotted that do

chapter

Development of Retrieval Methods for RESTful Web Services Using Semantic Technologies

Seung-Jun Cha, Yun-Jeong Choi, Kyu-Chul Lee

2010 IEEE/ACIS 9th International Conference on Computer and Information Science > 912 - 917

2010 IEEE/ACIS 9th International Conference on Computer and Information Science (ICIS 2010)

With the advent of Web 2.0, RESTful web services are becoming increasingly popular to emphasize the web as platform. There are already many RESTful web services and the number of services is increasing rapidly. Thus, it can be difficult to find specific services using keyword based retrieval. To solve this problem, a

chapter

Building Concept Network-Based User Profile for Personalized Web Search

Han-joon Kim, Sungjick Lee, Byungjeong Lee, Sooyong Kang

2010 IEEE/ACIS 9th International Conference on Computer and Information Science > 567 - 572

2010 IEEE/ACIS 9th International Conference on Computer and Information Science (ICIS 2010)

FCA, a session interest concept is defined as a pair of extent and intent where the extent covers a set of documents selected by the user among the search results and the intent covers a set of keyword features extracted from the selected documents. And, in order to make a concept network grow, we need to calculate the

chapter

An Information Extraction Model for Unconstrained Handwritten Documents

S Thomas, C Chatelain, L Heutte, T Paquet

2010 20th International Conference on Pattern Recognition > 3412 - 3415

2010 20th International Conference on Pattern Recognition (ICPR 2010)

In this paper, a new information extraction system by statistical shallow parsing in unconstrained handwritten documents is introduced. Unlike classical approaches found in the literature as keyword spotting or full document recognition, our approach relies on a strong and powerful global handwriting model. A entire

chapter

An Image Retrieval Approach Based on Composite Features and Graph Matching

M.A. Helala, M.M. Selim, H.H. Zayed

2009 Second International Conference on Computer and Electrical Engineering > 1 > 466 - 473

2009 Second International Conference on Computer and Electrical Engineering (ICCEE 2009)

where our approach is tested on images retrieved from Google keyword based image search engine. The results show that a combination of our approach as a local image descriptor with another global descriptor outperforms other approaches.

chapter

A Latent Semantic Analysis Based Method of Getting the Category Attribute of Words

Zongli Jiang, Changdong Lu

2009 International Conference on Electronic Computer Technology > 141 - 146

2009 International Conference on Electronic Computer Technology. ICECT 2009

Current search engines have two problems, losing useful information and including useless information. These two problems are aroused by the keyword matching retrieval model, which is adopted by almost all search engines. We introduce the conception of category attribute of a word. According to the category attribute

chapter

Text mining for chat message analysis

S.C. Hui, Yulan He, Haichao Dong

2008 IEEE Conference on Cybernetics and Intelligent Systems > 411 - 416

2008 IEEE Conference on Cybernetics and Intelligent Systems

provide simple message analysis features such as browsing and simple keyword-based searching of the recorded messages. In this paper, we propose a system, called IMAnalysis, that supports intelligent chat message analysis using text mining techniques. The IMAnalysis system provides functions on chat message retrieval, social

chapter

Semi-supervised Chinese compound word extraction based on HMM

Hui He, Bo Chen, Jun Guo

2008 7th World Congress on Intelligent Control and Automation > 2077 - 2081

2008 7th World Congress on Intelligent Control and Automation

classification/clustering as features. Also, this approach can be applied in keyword recommendation system in advertisement for different kinds of advertisers because of its expansibility and versatility.

Keywords:
FEATURE EXTRACTION
INFORMATION RETRIEVAL

Publication date

Set your own date range

Publication type

book (55)
article (2)

Keywords

DATA MINING (33)
TEXT ANALYSIS (20)
INTERNET (14)
SEMANTICS (10)
ACCURACY (9)
DATABASES (9)
HIDDEN MARKOV MODELS (9)
INFORMATION EXTRACTION (9)
TRAINING (9)
SEARCH ENGINES (8)
IMAGE RETRIEVAL (7)
KEYWORD EXTRACTION (7)
MACHINE LEARNING (7)
ONTOLOGIES (7)
PATTERN CLASSIFICATION (7)
MUSIC (6)
WEB PAGES (6)
ALGORITHM DESIGN AND ANALYSIS (5)
LEARNING (ARTIFICIAL INTELLIGENCE) (5)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (5)
SUPPORT VECTOR MACHINES (5)
CLASSIFICATION ALGORITHMS (4)
COMPUTATIONAL MODELING (4)
CONTEXT (4)
HTML (4)
IMAGE COLOR ANALYSIS (4)
NATURAL LANGUAGE PROCESSING (4)
PATTERN CLUSTERING (4)
QUERY PROCESSING (4)
SEARCH ENGINE (4)
VISUALIZATION (4)
BOOTSTRAPPING (3)
BUILDINGS (3)
FILTERING (3)
HANDWRITING RECOGNITION (3)
HISTOGRAMS (3)
KEYWORD SPOTTING (3)
LIBRARIES (3)
MATHEMATICAL MODEL (3)
MATRIX DECOMPOSITION (3)
MULTIMEDIA COMMUNICATION (3)
MUSIC INFORMATION RETRIEVAL (3)
SINGULAR VALUE DECOMPOSITION (3)
SUPPORT VECTOR MACHINE (3)
TAGGING (3)
WEB SITES (3)
WORD PROCESSING (3)
ABSTRACTS (2)
AUDIO SIGNAL PROCESSING (2)
BAYES METHODS (2)
CBIR (2)
CLUSTERING ALGORITHMS (2)
CO-OCCURRENCE (2)
COMPUTERS (2)
CONTENT-BASED RETRIEVAL (2)
CORRELATION (2)
DICTIONARIES (2)
DIGITAL LIBRARIES (2)
DOCUMENT HANDLING (2)
ELECTRONIC MAIL (2)
ELECTRONIC PUBLISHING (2)
ENCYCLOPEDIAS (2)
EQUATIONS (2)
FEATURE SELECTION (2)
FULL-TEXT RETRIEVAL (2)
GAMES (2)
GAUSSIAN MIXTURE MODEL (2)
GRAPH THEORY (2)
HANDWRITTEN DOCUMENTS (2)
HIDDEN MARKOV MODEL (2)
HIERARCHICAL NAVIGATION FUNCTION (2)
HYPERMEDIA MARKUP LANGUAGES (2)
IMAGE EDGE DETECTION (2)
IMAGE REPRESENTATION (2)
INDEXING (2)
INFORMATION ANALYSIS (2)
INFORMATION RETRIEVAL SYSTEMS (2)
K-NEAREST NEIGHBOR (2)
KERNEL (2)
KNOWLEDGE ACQUISITION (2)
LARGE SCALE INTEGRATION (2)
MEASUREMENT (2)
NATURAL LANGUAGES (2)
NAVIGATION (2)
NIOBIUM (2)
ONTOLOGY (2)
OWL (2)
PATTERN MATCHING (2)
PROBABILITY DISTRIBUTION (2)
RESOURCE DESCRIPTION FRAMEWORK (2)
SEMANTIC WEB (2)
SHALLOW PARSING MODEL (2)
SILICON (2)
SPEECH PROCESSING (2)
STATISTICAL ANALYSIS (2)
TAXONOMY (2)
TEXT CATEGORIZATION (2)
TEXT CLASSIFICATION (2)
more

INFONA - science communication portal

Search results

Keyword spotting in degraded document using mixed OCR and word shape coding

A modified approach to keyword extraction based on word-similarity

Keyword Extraction Using Word Co-occurrence

Hot keyword identification for extracting web public opinion

A new keyword spotting approach

Automatic Chinese Keyword Extraction Based on KNN for Implicit Subject Extraction

News Keyword Extraction for Topic Tracking

Contextual Ranking of Keywords Using Click Data

Image Clustering Using Visual and Text Keywords

Improved algorithm for keywords extraction from documents without corpus

Study on question answering system for biomedical domain

Music Information Retrieval System Using Lyrics and Melody Information

HMM-based Word Spotting in Handwritten Documents Using Subword Models

Development of Retrieval Methods for RESTful Web Services Using Semantic Technologies

Building Concept Network-Based User Profile for Personalized Web Search

An Information Extraction Model for Unconstrained Handwritten Documents

An Image Retrieval Approach Based on Composite Features and Graph Matching

A Latent Semantic Analysis Based Method of Getting the Category Attribute of Words

Text mining for chat message analysis

Semi-supervised Chinese compound word extraction based on HMM

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options