Search results

Items from 1 to 11 out of 11 results

chapter

Chinese Automatic Text Summarization Based on Keyword Extraction

Jiang Xiao-yu

2009 First International Workshop on Database Technology and Applications > 225 - 228

2009 First International Workshop on Database Technology and Applications, DBTA

In order to over the shortcoming of the incomprehensive of summarization, a new lexical-chain-based keywords extraction and automatic summarization algorithm from Chinese texts based on the unknown word recognition using co-occurrence of neighbor words is proposed in this paper, and an algorithm for constructing

chapter

A corpus-based approach for keyword identification using supervised learning techniques

J. TeCho, C. Nattee, T. Theeramunkong

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology > 1 > 33 - 36

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

This paper presents a corpus-based approach for extracting keywords from a text written in a language that has no word boundary. Based on the concept of Thai character cluster, a Thai running text is preliminarily segmented into a sequence of inseparable units, called TCCs. To enable the handling of a large-scaled

chapter

A document comparison approach using hybrid keyword and structured full text vocabulary searches

K Boonsuk, P Sophatsathit

2011 3rd International Conference on Computer Research and Development > 1 > 252 - 257

2011 3rd International Conference on Computer Research and Development (ICCRD 2011)

This paper proposes a systematic full text search on document using a combined keyword and structural similarity of documents under consideration. The approach operates in two steps. The first step uses a set of designated keywords to acquire potential desired documents by means of an open source tool. The second step

chapter

Keyword Extraction Using Word Co-occurrence

C Wartena, R Brussee, W Slakhorst

2010 Workshops on Database and Expert Systems Applications > 54 - 58

2010 21st International Conference on Database and Expert Systems Applications

A common strategy to assign keywords to documents is to select the most appropriate words from the document text. One of the most important criteria for a word to be selected as keyword is its relevance for the text. The tf.idf score of a term is a widely used relevance measure. While easy to compute and giving quite

chapter

Theses cluster based on bilingual and synonymous keyword sets using mutual information

Chung-Yi Huang, Rung-Ching Chen

2009 International Conference on Machine Learning and Cybernetics > 5 > 2999 - 3004

2009 Eighth International Conference on Machine Learning and Cybernetics (ICMLC)

Searching published papers is a required activity for the researching process. Since articles are presented in various languages, it makes precise queries hard to achieve. In this paper, we propose an automatic theses clustering method based on bilingual and synonymous keyword sets which includes Chinese and English

chapter

An improved method of keywords extraction based on short technology text

Jun Wang, Lei Li, Fuji Ren

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) > 1 - 6

2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2010)

Keywords are the critical resources of information management and retrieval, automatic text classification and clustering. The keywords extraction plays an important role in the process of constructing structured text. Current algorithms of keywords extraction have matured in some ways. However the errors of word

chapter

LJParser: LING-JOIN web search & text mining development platform

Yiwei Wang

2010 4th International Universal Communication Symposium > 407

2010 4th International Universal Communication Symposium (IUCS 2010)

word segmentation and pas tagging, language modeling and term translation, text clustering, text categorization, text summarization, keywords identification in a single document and duplication detection. The application can invoke any module of LJParser in Windows and Linux using any language including C, C# and Java

chapter

A Novel Approach to Improve the Accuracy of Web Retrieval

Vitaly Klyuev, Vladimir Oleshchuk

2010 5th International Conference on Future Information Technology > 1 - 5

2010 5th International Conference on Future Information Technology (FutureTech)

when the sentence is analyzed. The goal is to put each noun and verb of the sentence on the right place on the tree. Taking this information into account, it is possible to solve the ambiguity problem for the query keywords and create the indicative summaries taking into account query words, and semantically related

chapter

Study on question classification approach mixing multiple semantic characteristics together

LiGuo Duan, YanQin Niu, JunJie Chen

2011 3rd International Conference on Computer Research and Development > 1 > 354 - 357

2011 3rd International Conference on Computer Research and Development (ICCRD 2011)

This article proposes such a question classification approach that integrates multiple semantic features. It is aimed at these two questions in Chinese question classification models: inaccurate semantic information extraction and too slow processing speed caused by too high Eigenvector dimension. With the help of HowNet and the support vector machine and syntactic and semantic information of question...

chapter

Marine literature categorization based on minimizing the labelled data

Wei Zhang, Qiuhong Wang, Ye Deng, Ranran Du

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) > 1 - 6

2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2010)

attribute labels to them. It can greatly boost the efficiency of text processing. For building up two views, we split features into two parts, each of which can form an independent view. One view is made up of the feature set of abstract, and the other is made up of the feature sets of title, keywords, creator and department

chapter

Single Word Term Extraction Using a Bilingual Semantic Lexicon-Based Approach

Hongying Zan, Guocheng Duan, Ming Fan

Third International Conference on Natural Computation (ICNC 2007) > 5 > 451 - 456

2007 3rd International Conference on Natural Computation

semantic lexicon for domain-specific term extraction. The experimental results show that our approach can get high precision in legal field. Keywords: automatic term recognition, bilingual seeds set, Chinese concept dictionary, legal terminology, single word term.

Filter options

Keywords:
TEXT ANALYSIS
WORD PROCESSING

Publication date

Set your own date range

Keywords

DATA MINING (5)
SEMANTICS (4)
ACCURACY (3)
ALGORITHM DESIGN AND ANALYSIS (3)
DICTIONARIES (3)
FEATURE EXTRACTION (3)
INFORMATION RETRIEVAL (3)
CLASSIFICATION ALGORITHMS (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
NATURAL LANGUAGE PROCESSING (2)
PATTERN CLASSIFICATION (2)
SEARCH ENGINES (2)
SUPPORT VECTOR MACHINES (2)
TRAINING (2)
ABSTRACTS (1)
ACTIVE-LEARNING (1)
AMBIGUITY PROBLEM (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTOMATIC SUMMARIZATION (1)
AUTOMATIC TERM RECOGNITION (1)
AUTOMATIC THESES CLUSTERING METHOD (1)
BAYES METHODS (1)
BAYESIAN METHODS (1)
BILINGUAL AND SYNONYMOUS KEYWORD (1)
BILINGUAL GLOSSARY (1)
BILINGUAL KEYWORD (1)
BILINGUAL SEMANTIC LEXICON (1)
CHINESE AUTOMATIC TEXT SUMMARIZATION (1)
CHINESE KEYWORDS EXTRACTION (1)
CHINESE QUESTION CLASSIFICATION MODEL (1)
CHINESE SINGLE WORD TERM EXTRACTION (1)
CHINESE WORD SEGMENTATION (1)
CO-OCCURRENCE (1)
CO-TRAINING (1)
COMPUTERS (1)
CONTEXT (1)
CONTEXTUAL MATCHING (1)
CORRELATION (1)
COTRAINING CATEGORIZATION METHOD (1)
COWS (1)
CYBERNETICS (1)
DATABASE MANAGEMENT SYSTEMS (1)
DATABASES (1)
DICTIONARY-BASED METHODS (1)
DIGITAL DICTIONARY (1)
DISTRIBUTIONAL HYPOTHESIS (1)
DOCUMENT CLUSTERING (1)
DOCUMENT COMPARISON APPROACH (1)
DOCUMENT TEXT (1)
EIGENVALUES AND EIGENFUNCTIONS (1)
EIGENVECTOR DIMENSION (1)
EMPLOYMENT (1)
ENTROPY (1)
ERROR PROBLEM (1)
EXTRACTION (1)
FACTUAL QUESTION SENTENCE CLASSIFICATION (1)
FULL TEXT SEARCH (1)
GLOSSARIES (1)
GRAMMARS (1)
HOWNET (1)
HOWNET KNOWLEDGE DATABASE (1)
IMPROVED METHOD (1)
INFORMATION ENTROPY (1)
INTERROGATIVE (1)
KEYWORD EXTRACTION (1)
KEYWORD EXTRACTION ALGORITHM (1)
KEYWORD IDENTIFICATION (1)
KEYWORD SEARCH (1)
KEYWORD SET (1)
KEYWORD SIMILARITY (1)
KEYWORDS (1)
KEYWORDS EXTRACTION (1)
KEYWORDS EXTRACTION METHOD (1)
KEYWORDS IDENTIFICATION (1)
LABELLED DATA MINIMIZATION (1)
LANGUAGE MODELING (1)
LANGUAGE TRANSLATION (1)
LEXICAL CHAIN (1)
LEXICAL-CHAIN-BASED KEYWORDS EXTRACTION (1)
LIBRARIES (1)
LING-JOIN WEB SEARCH (1)
LINUX (1)
LITERATURE (1)
LJPARSER (1)
MACHINE LEARNING (1)
MANUALS (1)
MARINE LITERATURE CATEGORIZATION (1)
MARKOV PROCESSES (1)
MATHEMATICAL MODEL (1)
MEDICAL TEXT (1)
MIDDLEWARE (1)
MULTIPLE LANGUAGE SEARCH (1)
MULTIPLE SEMANTIC FEATURE INTEGRATION (1)
MUTUAL INFORMATION (1)
NAIVE BAYES (1)
NAMED ENTITIES (1)
NATURAL LANGUAGE UNDERSTANDING (1)
NEW WORDS DETECTION (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options