Search results

Items from 1 to 20 out of 25 results

chapter

ClRank: A Method for Keyword Extraction from Web Pages Using Clustering and Distribution of Nouns

Mohammad Rezaei, Najlah Gali, Pasi Franti

2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) > 1 > 79 - 84

2015 IEEE / WIC / ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)

Text analysis of a web page is more difficult than the analysis of the text of normal document due to the presence of additional information, such as HTML structure, styling codes, irrelevant text, and presence of hyperlinks. In this paper, we propose an unsupervised method to extract keywords from a web page. The

chapter

Hot keyword identification for extracting web public opinion

Zhiqi Fang, Yue Ning, Tingshao Zhu

5th International Conference on Pervasive Computing and Applications > 116 - 121

2010 5th International Conference on Pervasive Computing and Applications (ICPCA 2010)

Internet is becoming an increasingly important platform for ordinary life and work. It is expected that keyword extraction can help people quickly find hot spots on the web, since keywords in a document provide important information about the content of the document. In this paper, we propose to use text clustering

chapter

Words Clustering Based on Keywords Indexing from Large-scale Categorization Corpora

Liu Hua

2009 Fifth International Conference on Information Assurance and Security > 1 > 407 - 410

2009 Fifth International Conference on Information Assurance and Security (IAS)

Keywords are indexed automatically for large-scale categorization corpora. Indexed keywords of more than 20 documents are selected as seed words, thus overcoming subjectivity of selecting seed words in clustering; at the same time, clustering is limited to particular category corpora and keywords indexed feature

chapter

Topic Detection by Clustering Keywords

C. Wartena, R. Brussee

2008 19th International Conference on Database and Expert Systems Applications > 54 - 58

2008 19th International Conference on Database and Expert Systems Applications (DEXA)

We consider topic detection without any prior knowledge of category structure or possible categories. Keywords are extracted and clustered based on different similarity measures using the induced k-bisecting clustering algorithm. Evaluation on Wikipedia articles shows that clusters of keywords correlate strongly with

chapter

Hierarchical Clustering Based on Co-word for Web Information Retrieval

Fenglin Li, Zhoufang He

2010 International Conference on Computational and Information Sciences > 541 - 544

2010 International Conference on Computational and Information Sciences (ICCIS 2010)

This paper proposes a novel method to generate labels for grouping and organizing the search results returned by auxiliary search engines. It has applied statistical techniques to measure the quantities of co-occurrence keywords for forming the label matrix of them, and then agglomerated them into higher-level

chapter

HIMA: A Holistic Data Instance Matching Approach

Jiajia Miao, Guoyou Chen, Aiping Li, Jia Yan, more

2010 International Conference on Electrical and Control Engineering > 5242 - 5245

2010 International Conference on Electrical and Control Engineering (ICECE 2010)

addition, we use the keyword extracting method, which is based on the maximum entropy model, to get rid of the useless information. The experimental results show that the keyword extracting algorithm can get 70% precision, and the condition probabilistic based algorithm is more precise than the token-based algorithm. HIMA

chapter

Clustering Web Retrieval Results Accompanied by Removing Duplicate Documents

Xinye Li, Qinhai Yang, LinNa Zeng

2010 International Conference on Web Information Systems and Mining > 1 > 259 - 261

2010 International Conference on Web Information Systems and Mining (WISM 2010)

Since keyword-based search engine usually return large amount of results in which there are many unrelated documents and many documents with same content, automatic clustering technology is used to classify the retrieval results. While there are large amount of Web retrieval results, the clustering process usually

chapter

Discovery of a User Interests on the Internet

Fang Li, Yihong Li, Yanchen Wu, Kai Zhou, more

2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology > 1 > 359 - 362

2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology

This paper proposes a system for finding a userpsilas interests on the Internet. It is based on his browsing behaviors and the contents of his visited pages. The system has two features. One is building userpsilas browsing interests implicitly, multiple keyword vectors, one per interest. The other is that it can

chapter

An Asymmetric Similarity Measure for Tag Clustering on Flickr

Xiaochen Huang, Ying Zhou

2010 12th International Asia-Pacific Web Conference > 171 - 177

2010 12th Asia Pacific Web Conference (APWEB 2010)

Web 2.0 tools and environments have made tagging, the act of assigning keywords to on-line objects, a popular way to annotate shared resources. The success of now-prominent tagging systems makes tagging "the natural way for people to classify objects as well as an attractive way to discover new material". One of the

chapter

Information retrieval system based on ontology

Deepenti H. Deshmukh, Poonam B. Lohiya, Shwetambari G. Pundkar

2016 International Conference on Signal Processing, Communication, Power and Embedded System (SCOPES) > 1061 - 1065

2016 International conference on Signal Processing, Communication, Power and Embedded System (SCOPES)

users to shift through and find relevant information. The information retrievals commonly used are based on keywords. These techniques used keyword lists to describe the content of information, but one problem with such list is that they do not say anything about the symantic relationships between keywords, nor do they

chapter

Classification and clustering for neuroinformatics: Assessing the efficacy on reverse-mapped NeuroNLP data using standard ML techniques

Nidheesh Melethadathil, Priya Chellaiah, Bipin Nair, Shyam Diwakar

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1065 - 1070

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

quality of text-mined data while efficacy relied on the context of the choice of techniques. Although developments of automated keyword extraction methods have made differences in the quality of data selection, the efficacy of the Natural Language Processing (NLP) methods using verified keywords remain a challenge. In this

chapter

Recapitulization of tweets using graph-based clustering

Vivian Brian Lobo, Nazneen Ansari

2017 2nd International Conference on Communication Systems, Computing and IT Applications (CSCITA) > 101 - 106

2017 2nd International Conference on Communication Systems, Computing and IT Applications (CSCITA)

event can be effortlessly found using keyword matching, but there are numerous tweets that are likely to contain information that is semantically identical. Moreover, there exist many systems for recapitulating tweets related to a particular event, but they have numerous limitations and are unable to provide accurate

chapter

Search model for searching the evidence in digital forensic analysis

Sweedle Mascarnes, Prajyoti Lopes, Pratap Sakhare

2015 International Conference on Green Computing and Internet of Things (ICGCIoT) > 1353 - 1358

2015 International Conference on Green Computing and Internet of Things (ICGCIoT)

keyword specified by the investigator or suggested by system. Experiments were conducted on dummy crime dataset to test the accuracy and the scalability of the proposed system. Experimental results proved that subject suggestion improved the accuracy and thus speeds up the process of searching the evidence.

chapter

An Intelligent Anti-phishing Strategy Model for Phishing Website Detection

Weiwei Zhuang, Qingshan Jiang, Tengke Xiong

2012 32nd International Conference on Distributed Computing Systems Workshops > 51 - 56

2012 32nd International Conference on Distributed Computing Systems Workshops (ICDCS Workshops)

title, keyword and link text information to represent the website. Heterogeneous classifiers are then built based on these different features. We propose a principled ensemble classification algorithm to combine the predicted results from different phishing detection classifiers. Hierarchical clustering technique has been

chapter

Text processing by using projective ART neural networks

Radoslav Forgac, Roman Krakovsky

2016 New Trends in Signal Processing (NTSP) > 1 - 5

2016 New Trends in Signal Processing (NTSP)

This paper presents the summary of experience obtained with the modified clustering algorithm of Projective Adaptive Resonance Theory. The algorithm was proposed by authors, and was tested for text processing. Possible usage of the algorithm is exemplified by text document clustering, and generation of keyword

chapter

Empirical Comparison of Automatic Image Annotation Systems

M. Maher Ben Ismail, H. Frigui, J. Caudill

2008 First Workshops on Image Processing Theory, Tools and Applications > 1 - 8

2008 First Workshops on Image Processing Theory, Tools and Applications (IPTA)

integrating both low level-visual features and high-level textual keywords. Unfortunately, manual image annotation is a tedious process and may not be possible for large image databases. To overcome this limitation, several approaches that can annotate images in a semi-supervised or unsupervised way have emerged. In this paper

chapter

A pragmatic analysis of query expansion based on unsupervised learning

A. Muthulakshmi, R. Kaviya, M. Indra Devi

2013 IEEE CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES > 890 - 893

2013 IEEE Conference on Information & Communication Technologies (ICT)

In this paper, we examine the significance of expansion of the user query by two techniques namely Efficient Clustering-By-Direction and Theme Clustering. These two techniques produce the clusters of keywords extracted from the set of retrieved documents for the user query. The former clustering is based on

chapter

WE-LDA: A Word Embeddings Augmented LDA Model for Web Services Clustering

Min Shi, Jianxun Liu, Dong Zhou, Mingdong Tang, more

2017 IEEE International Conference on Web Services (ICWS) > 9 - 16

2017 IEEE International Conference on Web Services (ICWS)

as the services management. Existing methods for Web services clustering mostly focus on utilizing directly key features from WSDL documents, e.g., input/output parameters and keywords from description text. Probabilistic topic model Latent Dirichlet Allocation (LDA) is also adopted, which extracts latent topic features

chapter

Automatic and Adaptive Clusters for Information Extraction

B.S. Charulatha, Paul Rodrigues, T. Chitralekha

2014 International Conference on Soft Computing and Machine Intelligence > 60 - 63

2014 International Conference on Soft Computing & Machine Intelligence (ISCMI)

done on a set of data is chosen to form the basis as done with keywords. If the base data is chosen arbitrarily, it is automatic, whereas some 'knowledge' or 'background' is put in the choice it is adaptive. Statistical features of the images are extracted from the pixel map of the image. The extracted features are

chapter

Lexical similarity using fuzzy Euclidean distance

Heba Ayeldeen, Aboul Ella Hassanien, Aly A. Fahmy

2014 International Conference on Engineering and Technology (ICET) > 1 - 6

2014 International Conference on Engineering and Technology (ICET)

fuzzy Euclidean distance clustering algorithm after using MeSH ontology on medical theses data for better categorization of the keywords within the data.

Keywords:
CLUSTERING ALGORITHMS
CLUSTERING

Publication date

Set your own date range

Keywords

DATA MINING (8)
SEMANTICS (8)
FEATURE EXTRACTION (7)
INTERNET (7)
INFORMATION RETRIEVAL (5)
PATTERN CLUSTERING (5)
ACCURACY (4)
CLASSIFICATION ALGORITHMS (4)
SEARCH ENGINES (4)
ALGORITHM DESIGN AND ANALYSIS (3)
TAGGING (3)
WEB PAGES (3)
CLASSIFICATION (2)
CLUSTERING ALGORITHM (2)
COMPUTATIONAL MODELING (2)
CONFERENCES (2)
DICTIONARIES (2)
DOCUMENT HANDLING (2)
ENTROPY (2)
HOME APPLIANCES (2)
INFORMATION SERVICES (2)
KEYWORDS EXTRACTION (2)
LABELING (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
MULTIMEDIA (2)
ONTOLOGIES (2)
TEXT CATEGORIZATION (2)
TRAINING (2)
VISUAL DATABASES (2)
VISUALIZATION (2)
WEB SITES (2)
AD HOC NETWORKS (1)
AD-HOC NETWORK (1)
ADAPTIVE SYSTEMS (1)
ANNOTATED DATA (1)
ASYMMETRIC SIMILARITY MEASURE (1)
ATMOSPHERIC MEASUREMENTS (1)
AUTOMATIC CLUSTERING (1)
AUTOMATIC IMAGE ANNOTATION SYSTEMS (1)
AUTOMATIC LABELING (1)
AUXILIARY SEARCH ENGINES (1)
BAYES METHODS (1)
BEHAVIOURAL SCIENCES COMPUTING (1)
BIOLOGY (1)
BIRDS (1)
BROWSING BEHAVIORS (1)
BUILT-IN FUNCTIONS (1)
CATEGORIZATION CORPORA (1)
CLASSIFICATION ENSEMBLE (1)
CLUSTERING ANALYSIS (1)
CLUSTERING METHODS (1)
CLUSTERING PROCESS (1)
CLUSTERING-BY-DIRECTION (1)
CO-OCCURRENCE KEYWORDS (1)
COMMUNICATION NETWORK (1)
COMMUNICATION SYSTEMS (1)
COMPUTERS (1)
CONSTRAINED CLUSTERING (1)
CONTENT-BASED IMAGE RETRIEVAL (1)
CONTENT-BASED RETRIEVAL (1)
CONTROL MESSAGES (1)
COOCCURRENCE KEYWORD (1)
COSINE SIMILARITY (1)
COUPLINGS (1)
CRIME INVESTIGATION (1)
DATA ANALYSIS (1)
DATA SECURITY (1)
DATAMINING (1)
DESCRIPTION BASED ADDRESSING AND ROUTING (1)
DESCRIPTION BASED CLUSTERED AD HOC NETWORK (1)
DESCRIPTION-BASED CLUSTERED AD HOC NETWORK (1)
DESTINATION REGARDLESS (1)
DIGITAL AUDIO BROADCASTING (1)
DIGITAL FORENSIC INVESTIGATION (1)
DIGITAL FORENSICS (1)
DISTANCE DEPENDENT CHINESE RESTAURANT PROCESSES (1)
DISTANCE MEASUREMENT (1)
DOMANIAL WORDS (1)
DUPLICATE DOCUMENTS (1)
ECONOMIC INDICATORS (1)
EDONKEY (1)
EDONKEY SYSTEMS (1)
EDUCATIONAL INSTITUTIONS (1)
ELECTRONIC LEARNING (1)
ELECTRONIC PUBLISHING (1)
ENCYCLOPEDIAS (1)
EQUATIONS (1)
ERROR ANALYSIS (1)
EUCLIDEAN DISTANCE (1)
F-MEASURE INDEX (1)
FEATURE DISCRIMINATION (1)
FILTERING ALGORITHMS (1)
FINGERPRINT RECOGNITION (1)
FLICKR (1)
FOLKSONOMY (1)
FREQUENCY MEASUREMENT (1)
FUZZY C MEANS (1)
FUZZY EUCLIDEAN DISTANCE ALGORITHM (1)
more

INFONA - science communication portal

Search results

ClRank: A Method for Keyword Extraction from Web Pages Using Clustering and Distribution of Nouns

Hot keyword identification for extracting web public opinion

Words Clustering Based on Keywords Indexing from Large-scale Categorization Corpora

Topic Detection by Clustering Keywords

Hierarchical Clustering Based on Co-word for Web Information Retrieval

HIMA: A Holistic Data Instance Matching Approach

Clustering Web Retrieval Results Accompanied by Removing Duplicate Documents

Discovery of a User Interests on the Internet

An Asymmetric Similarity Measure for Tag Clustering on Flickr

Information retrieval system based on ontology

Classification and clustering for neuroinformatics: Assessing the efficacy on reverse-mapped NeuroNLP data using standard ML techniques

Recapitulization of tweets using graph-based clustering

Search model for searching the evidence in digital forensic analysis

An Intelligent Anti-phishing Strategy Model for Phishing Website Detection

Text processing by using projective ART neural networks

Empirical Comparison of Automatic Image Annotation Systems

A pragmatic analysis of query expansion based on unsupervised learning

WE-LDA: A Word Embeddings Augmented LDA Model for Web Services Clustering

Automatic and Adaptive Clusters for Information Extraction

Lexical similarity using fuzzy Euclidean distance

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options