Wyniki wyszukiwania

Pozycje od 1 do 17 spośród 17 wyników

rozdział

Keyword Extraction from Documents Using a Neural Network Model

Taeho Jo, Malrey Lee, Thomas Gatton

2006 International Conference on Hybrid Information Technology > 2 > 194 - 197

2006 International Conference on Hybrid Information Technology

A document surrogate is usually represented in a list of words. Because not all words in a document reflect its content, it is necessary to select important words from the document that relate to its content. Such important words are called keywords and are selected with a particular equation based on Term Frequency

rozdział

A modified approach to keyword extraction based on word-similarity

Meng Wenchao, Liu Lianchen, Dai Ting

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems > 3 > 388 - 392

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2009)

Two keyword-extraction ways are usually used, one is simply using the information from exactly single word like word frequency and TF.IDF, the other is based on the relationship between words. The relationship is usually described as word similarity which derives from a corpus (WordNet, HowNet) or man-made thesaurus

artykuł

Keyword Extraction and Clustering for Document Recommendation in Conversations

Maryam Habibi, Andrei Popescu-Belis

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2015 > 23 > 4 > 746 - 759

This paper addresses the problem of keyword extraction from conversations, with the goal of using these keywords to retrieve, for each short conversation fragment, a small number of potentially relevant documents, which can be recommended to participants. However, even a short fragment contains a variety of words

rozdział

Hot keyword identification for extracting web public opinion

Zhiqi Fang, Yue Ning, Tingshao Zhu

5th International Conference on Pervasive Computing and Applications > 116 - 121

2010 5th International Conference on Pervasive Computing and Applications (ICPCA 2010)

Internet is becoming an increasingly important platform for ordinary life and work. It is expected that keyword extraction can help people quickly find hot spots on the web, since keywords in a document provide important information about the content of the document. In this paper, we propose to use text clustering

rozdział

An improved keyword extraction method using graph based random walk model

M.R. Islam, M.R. Islam

2008 11th International Conference on Computer and Information Technology > 225 - 229

2008 11th International Conference on Computer and Information Technology (ICCIT)

Keywords can be considered as condensed versions of documents, which can play important role in some text processing tasks such as text indexing, summarization and categorization. However, there are many digital documents especially on the Internet that do not have a list of assigned keywords. Assigning keywords to

rozdział

Automatic Chinese Keyword Extraction Based on KNN for Implicit Subject Extraction

Zhang Qingguo, Zhang Chengzhi

2008 International Symposium on Knowledge Acquisition and Modeling > 689 - 692

2008 International Symposium on Knowledge Acquisition and Modeling (KAM)

In this paper, a method of automatic Chinese keyword extraction based on KNN is proposed. Firstly, it preprocesses the document by vector space model. Secondly, it constructs a set of candidate keywords based on KNN method and the labeled dataset. Finally, it post-processes on candidate keywords by the character of

rozdział

News Keyword Extraction for Topic Tracking

Sungjick Lee, Han-Joon Kim

2008 Fourth International Conference on Networked Computing and Advanced Information Management > 2 > 554 - 559

2008 Fourth International Conference on Networked Computing and Advanced Information Management (NCM)

This paper presents a keyword extraction technique that can be used for tracking topics over time. In our work, keywords are a set of significant words in an article that gives high-level description of its contents to readers. Identifying keywords from a large amount of on-line news data is very useful in that it can

rozdział

Using Citation-KNN for Automatic Keyword Assignment

Chengzhi Zhang, Hongjiao Xu

2009 International Conference on Electronic Commerce and Business Intelligence > 131 - 134

2009 International Conference on Electronic Commerce and Business Intelligence, ECBI

Currently, the automatic keywords extraction method can only extract keywords appeared in the articles and it cannot extract the implicit keyword which does not appear in the articles. It is a difficult work to extract implicit keywords in an article in the task of automatic keywords extraction. This work can also be

rozdział

Extracting Keywords of Web Users' Interests and Visualizing their Routine Visits

T. Murata, K. Saito

2006 9th International Conference on Control, Automation, Robotics and Vision > 1 - 6

2006 9th International Conference on Control, Automation, Robotics and Vision

Analyzing users' Web log data and extracting their interests of Web-watching behaviors are important and challenging research topics of Web usage mining. Users visit their favorite sites and sometimes search new sites by performing keyword search on search engines. Users' Web-watching behaviors can be regarded as a

rozdział

Suggesting biomedical topics for unseen research articles based on MeSH descriptors

Chae-Gyun Lim, Byeong-Soo Jeong, Ho-Jin Choi

2015 International Conference on Big Data and Smart Computing (BIGCOMP) > 51 - 54

2015 International Conference on Big Data and Smart Computing (BigComp)

Due to the huge number of research articles in the biomedical domain, it becomes more and more important to develop methods to find relevant articles of our specific research interests. Keyword extraction is a useful method to find important topics from documents and summarize their major information. Unfortunately

rozdział

Study on question answering system for biomedical domain

Bo Xu, Hongfei Lin, Baoyan Liu

2009 IEEE International Conference on Granular Computing > 626 - 629

2009 IEEE International Conference on Granular Computing (GrC 2009)

This paper focuses on setting up a question-answering oriented biomedical domain, and it applies several different approaches to the different processing phases. Firstly, it uses shallow parser to identify the types of questions and extract the keywords, and the keywords are expanded with UMLS for the purpose of

rozdział

Intelligent information mining from veterinary clinical records and open source repository

P. Tangtulyangkul, T.S. Hocking, Chun Che Fung

TENCON 2009 - 2009 IEEE Region 10 Conference > 1 - 6

TENCON 2009. 2009 IEEE Region 10 Conference

utilizes text-mining, Web service technologies and domain knowledge, in order to extract keywords, to retrieve related records from an external source, and to filter the extracted keywords list. This study meets a practical challenge encountered at the School of Veterinary and Biomedical Sciences at Murdoch University. The

rozdział

An information arrangement technique for a text classification and summarization based on a summarization frame

S. Tsuchiya, E. Yoshimura, H. Watabe

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 5

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

can be expected to be achieved in a QA system. Sentences are classified according to the content. Each classification is classified into a more detailed field. Important keywords are extracted from the sentences classified into the field. Moreover, the extracted keywords are classified into common and peculiar word for

rozdział

Building Concept Network-Based User Profile for Personalized Web Search

Han-joon Kim, Sungjick Lee, Byungjeong Lee, Sooyong Kang

2010 IEEE/ACIS 9th International Conference on Computer and Information Science > 567 - 572

2010 IEEE/ACIS 9th International Conference on Computer and Information Science (ICIS 2010)

FCA, a session interest concept is defined as a pair of extent and intent where the extent covers a set of documents selected by the user among the search results and the intent covers a set of keyword features extracted from the selected documents. And, in order to make a concept network grow, we need to calculate the

rozdział

Detection of Verbatim or Partial Duplication from Multiple Source Documents Using Data Mining Techniques and Case-Based Reasoning Methodologies

C Chaudhuri, A Chaudhuri

2011 Second International Conference on Emerging Applications of Information Technology > 129 - 132

Second International Conference on Emerging Applications of Information Technology (EAIT 2011)

. A third technique involves extraction of keywords and storing them in a properly indexed base. These then can serve the dual purpose of providing solutions to Lazy Learning classification for automatic subject-wise archiving and formation of relevant word sequences for detection of plagiarism using Association Rule

rozdział

News Contents Recommendation Model Based on Feedback of Web Usage

Ping Ni, Jianxin Liao, Xiaomin Zhu, Keyan Ren

2009 WRI World Congress on Computer Science and Information Engineering > 4 > 431 - 435

2009 WRI World Congress on Computer Science and Information Engineering, CSIE

In this paper, reclassification for the current classification through K-means would be implemented based on the feedback of Web usage mining in order to improve the accuracy of news recommendation and convergence of classification. It could extract most relative keywords and eliminate the disturbance of multi-vocal

rozdział

Study on similarity of simple questions based on the catering field

Qijun Dong, Shuicai Shi, Hongwei Wang, Xueqiang Lv

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 5

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

The similarity between sentences is a theoretical basis and key technology to the question answering system. The method presented in this paper is as follows. Firstly, the dependency question sets are obtained and the key words are extracted from the major components of the question sentences and the target question form the related libraries, and then the candidate question sets are obtained through...

Opcje filtrowania

Słowa kluczowe:
DATA MINING
KEYWORD EXTRACTION
INFORMATION RETRIEVAL

Data publikacji

Ustaw własny zakres dat

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu