Search results

chapter

Discovering “interesting” keyword patterns in Hadith chapter documents

Zuraini Zainol, Puteri N. E. Nohuddin, Mohd T. Hamid Jaymes, Syahaneim Marzukhi

2016 International Conference on Information and Communication Technology (ICICTM) > 104 - 108

2016 International Conference on Information and Communication Technology (ICICTM)

True Muslims around the world believe in Al Quran and Hadith. Al Quran is the principal religious text of Islam, a revelation from Allah. Hadith is also one of the fundamental sources of Islamic references and guidance for the Muslims after the Holy Book, Al-Quran. Hadith is referred to a report, statement, act, story, narration or discourse. Hadith, originally in Arabic, covers a wide range of issues...

chapter

Latent semantic analysis and keyword extraction for phishing classification

Gaston L'Huillier, Alejandro Hevia, Richard Weber, Sebastian Rios

2010 IEEE International Conference on Intelligence and Security Informatics > 129 - 131

2010 IEEE International Conference on Intelligence and Security Informatics (ISI 2010)

Phishing email fraud has been considered as one of the main cyber-threats over the last years. Its development has been closely related to social engineering techniques, where different fraud strategies are used to deceit a naïve email user. In this work, a latent semantic analysis and text mining methodology is proposed for the characterisation of such strategies, and further classification using...

chapter

A probabilistic relational model for keyword extraction

Alireza Pakgohar, Mohadeseh Khalili

2012 International Conference on Statistics in Science, Business and Engineering (ICSSBE) > 1 - 5

2012 International Conference on Statistics in Science, Business and Engineering (ICSSBE2012)

depend probabilistically both on other properties of that object and on properties of related objects. In this paper an attempt is made to heed keywords extraction. The keywords are not only essential for academic papers but also important for web page retrieval, text mining, and document classification. In this paper, a C

chapter

Totally automated keyword extraction

Tayfun Pay

2016 IEEE International Conference on Big Data (Big Data) > 3859 - 3863

2016 IEEE International Conference on Big Data (Big Data)

We develop and analyze an unsupervised and domain-independent method for extracting keywords from single documents. Our approach differs from the previous ones in the way of identifying candidate keywords, pruning the list of candidate keywords with several filtering heuristics and selecting keywords from the list of

chapter

Keyword Extraction from Documents Using a Neural Network Model

Taeho Jo, Malrey Lee, Thomas Gatton

2006 International Conference on Hybrid Information Technology > 2 > 194 - 197

2006 International Conference on Hybrid Information Technology

A document surrogate is usually represented in a list of words. Because not all words in a document reflect its content, it is necessary to select important words from the document that relate to its content. Such important words are called keywords and are selected with a particular equation based on Term Frequency

chapter

M2VSM: Extension of vector space model by introducing Meta keyword

Y. Takama, T. Ishibashi

2008 World Automation Congress > 1 - 6

2008 World Automation Congress

This paper proposes an extended vector space model (VSM), which is called M2VSM (meta keyword-based modified VSM). When conventional VSM is applied to document clustering, it is difficult to adjust the granularity of cluster in terms of topic. In order to solve the problem, M2VSM considers meta keywords such as

chapter

Keyword elicitation for patent retrieval by using bibliographic information

L.H. Wang, Y. R. Li

The 3rd International Conference on Data Mining and Intelligent Information Technology Applications > 163 - 167

2011 3rd International Conference on Data Mining and Intelligent Information Technology Applications (ICMiA)

Keyword selection is one of the most important tasks for patent retrieval. However, few researchers have focused on how to choose keywords appropriately in comparison with to improve retrieval performance via techniques from Bibliometics, such as patent counts, citation and so on. The paper has proposed, thus, a new

chapter

News Keyword Extraction for Topic Tracking

Sungjick Lee, Han-Joon Kim

2008 Fourth International Conference on Networked Computing and Advanced Information Management > 2 > 554 - 559

2008 Fourth International Conference on Networked Computing and Advanced Information Management (NCM)

This paper presents a keyword extraction technique that can be used for tracking topics over time. In our work, keywords are a set of significant words in an article that gives high-level description of its contents to readers. Identifying keywords from a large amount of on-line news data is very useful in that it can

chapter

Constructing an Issue Network from the Perspective of Common R&D Keywords

Namgyu Kim, William Wong Xiu Shun, Jieun Kim, Kee-Young Kwahk, more

2014 IEEE International Congress on Big Data > 772 - 773

2014 IEEE International Congress on Big Data (BigData Congress)

The demand for extracting keywords related to national issues from various sources and using them to retrieve R&D information has increased rapidly recently. In order to satisfy this demand, three methodologies are proposed in this study: a hybrid methodology for extracting and integrating national issue

chapter

An Effective Keywords Extraction Method Based on Deleting Actor Index

Yan Tang, Ruyi Kan

2010 International Conference on Web Information Systems and Mining > 1 > 227 - 231

2010 International Conference on Web Information Systems and Mining (WISM 2010)

Keywords Extraction plays a very important role in the text-mining domain, since the keywords can represent the asserted main point in a document. Based on the term network and deleting actor index, an effective keywords extraction algorithm is proposed to extract high frequent terms as well as important terms with

chapter

TF-IDF method in ranking keywords of Instagram users' image captions

Bernardus Ari Kuncoro, Bambang Heru Iswanto

2015 International Conference on Information Technology Systems and Innovation (ICITSI) > 1 - 5

2015 International Conference on Information Technology Systems and Innovation (ICITSI)

propose Term-Frequency and Inverse Document Frequency (TF-IDF) method to rank keywords of top twenty most followed Instagram users based on image captions of Instagram. The objective of this research is to automatically know the main idea of Instagram users based on 50 recent image captions posted. In our experiments, TF-IDF

chapter

The study on keywords frequency composite function of public opinion toward Macau's gambling industry: Using the Fruit Fly Optimization Algorithm

Shianghau Wu, Yongdong Shi

2013 International Conference on Engineering, Management Science and Innovation (ICEMSI) > 1 - 3

2013 International Conference on Engineering, Management Science and Innovation (ICEMSI)

This study at first used the text mining method to analyze the keywords of the Chinese news reports related to Macan's gambling industry from June to September 2012. The study got 19 major keywords at the first step. In order to comprehend the influence of each keyword in each document, the study applied the Fruit Fly

chapter

Combining Statistical Machine Learning Models to Extract Keywords from Chinese Documents

Chengzhi Zhang

Lecture Notes in Computer Science > Advanced Data Mining and Applications > Short Papers > 745-754

Keywords are subset of words or phrases from a document that can describe the meaning of the document. Many text mining applications can take advantage from it. Unfortunately, a large portion of documents still do not have keywords assigned. On the other hand, manual assignment of high quality keywords is time

chapter

Mapping the Intellectual Structure of Enterprise S&T Text: A Case Study

Hongjiang Yue, Ningsheng Hu

2009 Second Pacific-Asia Conference on Web Mining and Web-based Application > 237 - 241

2009 Second Pacific-Asia Conference on Web Mining and Web-based Application (WMWA)

literature infrastructure was obtained using bibliometrics and literature of the co-keyword network was visualized. It show how co-word analysis techniques can be used to study R&D in enterprises. The results of the study can help support strategic decision-making on the direction of S&T programs in enterprises.

chapter

Document classification efficiency of phrase-based techniques

N. Kapalavayi, S.N.J. Murthy, Gongzhu Hu

2009 IEEE/ACS International Conference on Computer Systems and Applications > 174 - 178

2009 7th IEEE/ACS International Conference on Computer Systems and Applications (AICCSA-2009)

Due to the exponential growth of available text documents in digital form, it is of great importance to develop techniques for automatic document classification based on the textual contents. Earlier document classification techniques have used keyword-based features and related statistics to achieve good results when

chapter

Identifying Documentation of Delirium in Clinical Notes through Topic Modeling

Yijun Shao, Charlene Weir, Qing Zeng-Treitler, Nicolette Estrada

2015 International Conference on Healthcare Informatics > 335 - 340

2015 International Conference on Healthcare Informatics (ICHI)

Pittsburgh dataset. We experimented with 3 different topic modeling methods including LDA and 2 ICD-based methods and a keyword search method for the identification of delirium related documents and sentences in clinical notes. As expected, the keyword search method is highly specific but insufficiently sensitive when searching

chapter

Extraction and Annotation of Personal Cliques from Social Networks

Maike Erdmann, Tomoya Takeyoshi, Gen Hattori, Chihiro Ono

2012 IEEE/IPSJ 12th International Symposium on Applications and the Internet > 172 - 177

2012 IEEE/IPSJ 12th International Symposium on Applications and the Internet (SAINT)

this problem by automatically dividing the social network of a Twitter user into personal cliques, and annotating each clique with keywords to identify the common ground of a clique. Our proposed clique annotation method extracts keywords from the tweet history of the clique members and individually weights the extracted

chapter

Richness evaluation of blogs on its topics using a generative model and probabilistic analysis

Jinhee Park, Jaedong Lee, Hye-Wuk Jung, Jee-Hyong Lee

The 6th International Conference on Soft Computing and Intelligent Systems, and The 13th International Symposium on Advanced Intelligence Systems > 381 - 385

2012 Joint 6th Intl. Conference on Soft Computing and Intelligent Systems (SCIS) and 13th Intl. Symposium on Advanced Intelligent Systems (ISIS)

Nowadays, blogs are one of important web services to publish and share various information. Accordingly, evaluation of various keywords in blogs is one of the important research topics for effective and efficient classification and retrieval of blogs in the blogosphere. In this paper, we propose a method to identify

chapter

Semantic graph based approach for text mining

Chandra Shekhar Yadav, Aditi Sharan, Manju Lata Joshi

2014 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT) > 596 - 601

2014 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT)

document. We think that our graph captures many properties of the text documents and can be used for different application in the field of text mining and NLP, such as keyword extraction and to know the nature of the document. Our approach to construct a semantic graph is independent of any language. We performed an

chapter

Reviewer Profiling Using Sparse Matrix Regression

E E Papalexakis, N D Sidiropoulos, M N Garofalakis

2010 IEEE International Conference on Data Mining Workshops > 1214 - 1219

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

-specific keywords. An automated profiling algorithm is proposed for this purpose, which starts from generic/noisy reviewer profiles extracted using Google Scholar and derives custom conference-centric reviewer and paper profiles. Each reviewer is expert on few sub-topics, whereas the pool of reviewers and the conference may

INFONA - science communication portal

Search results

Discovering “interesting” keyword patterns in Hadith chapter documents

Latent semantic analysis and keyword extraction for phishing classification

A probabilistic relational model for keyword extraction

Totally automated keyword extraction

Keyword Extraction from Documents Using a Neural Network Model

M2VSM: Extension of vector space model by introducing Meta keyword

Keyword elicitation for patent retrieval by using bibliographic information

News Keyword Extraction for Topic Tracking

Constructing an Issue Network from the Perspective of Common R&D Keywords

An Effective Keywords Extraction Method Based on Deleting Actor Index

TF-IDF method in ranking keywords of Instagram users' image captions

The study on keywords frequency composite function of public opinion toward Macau's gambling industry: Using the Fruit Fly Optimization Algorithm

Combining Statistical Machine Learning Models to Extract Keywords from Chinese Documents

Mapping the Intellectual Structure of Enterprise S&T Text: A Case Study

Document classification efficiency of phrase-based techniques

Identifying Documentation of Delirium in Clinical Notes through Topic Modeling

Extraction and Annotation of Personal Cliques from Social Networks

Richness evaluation of blogs on its topics using a generative model and probabilistic analysis

Semantic graph based approach for text mining

Reviewer Profiling Using Sparse Matrix Regression

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options