Search results

Items from 1 to 13 out of 13 results

chapter

Supporting Web Search with Near Keywords

Hanxiong Chen, K. Yamamoto, K. Furuse, N. Ohbo

Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007) > 2 > 411 - 415

2007 International Conference on Fuzzy Systems and Knowledge Discovery

database, cannot be applied to Web search. We propose a new method to support Web query refinement. Our methods is based on local analysis which clustering the search result. Unlike other clustering-base approaches, we take into consideration the distance between keywords, and guarantee no information loss. A Web search

chapter

Fuzzy named entity-based document clustering

T.H. Cao, H.T. Do, D.T. Hong, T.T. Quan

2008 IEEE International Conference on Fuzzy Systems (IEEE World Congress on Computational Intelligence) > 2028 - 2034

2008 IEEE 16th International Conference on Fuzzy Systems (FUZZ-IEEE)

Traditional keyword-based document clustering techniques have limitations due to simple treatment of words and hard separation of clusters. In this paper, we introduce named entities as objectives into fuzzy document clustering, which are the key elements defining document semantics and in many cases are of user

chapter

Weighted Feature Subset Non-negative Matrix Factorization and Its Applications to Document Understanding

Dingding Wang, Tao Li, Chris Ding

2010 IEEE International Conference on Data Mining > 541 - 550

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

Keyword (Feature) selection enhances and improves many Information Retrieval (IR) tasks such as document categorization, automatic topic discovery, etc. The problem of keyword selection is usually solved using supervised algorithms. In this paper, we propose an unsupervised approach that combines keyword selection and

chapter

Document space dimension reduction by Latent Semantic Analysis and Hebbian neural network

I. Mokris, L. Skovajsova

2008 6th International Symposium on Intelligent Systems and Informatics > 1 - 4

2008 6th International Symposium on Intelligent Systems and Informatics (SISY 2008)

This paper presents the comparison of the text document space dimension reduction and the text document clustering and also the keyword space dimension reduction and keyword clustering by the latent semantic analysis and by the Hebbian neural network with Oja learning rule. Results of this neural network are compared

chapter

A k-Nearest-Neighbour Method for Classifying Web Search Results with Data in Folksonomies

Ching-man Au Yeung, N. Gibbins, N. Shadbolt

2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology > 1 > 70 - 76

2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology

Traditional Web search engines mostly adopt a keyword-based approach. When the keyword submitted by the user is ambiguous, search result usually consists of documents related to various meanings of the keyword, while the user is probably interested in only one of them. In this paper we attempt to provide a solution to

chapter

An Efficient Algorithm for Clustering Search Engine Results

Hui Zhang, Bin Pang, Ke Xie, Hui Wu

2006 International Conference on Computational Intelligence and Security > 2 > 1429 - 1434

2006 International Conference on Computational Intelligence and Security

With the increasing number of Web documents in the Internet, the most popular keyword-matching-based search engines, such as Google, often return a long list of search results ranked based on their relevance and importance to the query. To cluster the search engine results can help users find the results in several

chapter

Clustering Web Retrieval Results Accompanied by Removing Duplicate Documents

Xinye Li, Qinhai Yang, LinNa Zeng

2010 International Conference on Web Information Systems and Mining > 1 > 259 - 261

2010 International Conference on Web Information Systems and Mining (WISM 2010)

Since keyword-based search engine usually return large amount of results in which there are many unrelated documents and many documents with same content, automatic clustering technology is used to classify the retrieval results. While there are large amount of Web retrieval results, the clustering process usually

chapter

Multi-Document summarization based on improved features and clustering

Ying Xiong, Hongyan Liu, Lei Li

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) > 1 - 5

2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2010)

two ways for exactly extracting keywords. Experimental results demonstrate that our improved method performs better than the traditional one.

chapter

The PARIS Algorithm for Determining Latent Topics

M Aharon, I Cohen, A Itskovitch, I Marhaim, more

2010 IEEE International Conference on Data Mining Workshops > 1092 - 1099

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

We introduce a new method for discovering latent topics in sets of objects, such as documents. Our method, which we call PARIS (for Principal Atoms Recognition In Sets), aims to detect principal sets of elements, representing latent topics in the data, that tend to appear frequently together. These latent topics, which we refer to as `atoms', are used as the basis for clustering, classification, collaborative...

chapter

Web search result refinement by document clustering

Ming Hei Tsui, B. Lim, Daming Shi

2007 IEEE International Conference on Systems, Man and Cybernetics > 3081 - 3086

IEEE International Conference on Systems, Man and Cybernetics, 2007

A simple search keyword usually returns million of search results. The result count may appear impressive, at the same time it confuse the users. User usually will not wish to browse through million of entries. This paper proposed a query refinement method by iterative clustering of information from the Web page

article

A Fuzzy Ontological Knowledge Document Clustering Methodology

A. Trappey, C.V. Trappey, Fu-Chiang Hsu, D.W. Hsiao

IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) > 2009 > 39 > 3 > 806 - 814

This correspondence presents a novel hierarchical clustering approach for knowledge document self-organization, particularly for patent analysis. Current keyword-based methodologies for document content management tend to be inconsistent and ineffective when partial meanings of the technical content are used for

chapter

Organizing Hidden-Web Databases by Clustering Visible Web Documents

L. Barbosa, J. Freire, A. Silva

2007 IEEE 23rd International Conference on Data Engineering > 326 - 335

2007 IEEE 23rd International Conference on Data Engineering

metadata, our approach is able to handle a wide range of forms, including content-rich forms that contain multiple attributes, as well as simple keyword-based search interfaces. An experimental evaluation over real Web data shows that our strategy generates high-quality clusters - measured both in terms of entropy and F

chapter

Self-organising map for document categorization using latent semantic analysis

B. Mahalakshmi, K. Duraiswamy

2010 International Conference on Innovative Computing Technologies (ICICT) > 1 - 6

2010 International Conference on Innovative Computing Technologies (ICICT)

With the increasing amount of unstructured content available electronically on the web, content categorization becomes very important for efficient information retrieval. The basic approaches for information retrieval in text documents are searching using keywords, categorization of the documents and filtering out the

Filter options

Keywords:
PATTERN CLUSTERING
DOCUMENT HANDLING

Publication date

Set your own date range

Publication type

book (12)
article (1)

Keywords

CLUSTERING ALGORITHMS (5)
INFORMATION RETRIEVAL (5)
INTERNET (5)
SEARCH ENGINES (4)
DOCUMENT CLUSTERING (3)
MATRIX DECOMPOSITION (3)
WEB SEARCH (3)
ARTIFICIAL NEURAL NETWORKS (2)
CLUSTERING METHODS (2)
DATABASE MANAGEMENT SYSTEMS (2)
FEATURE SELECTION (2)
FUZZY SET THEORY (2)
ITERATIVE METHODS (2)
KEYWORD EXTRACTION (2)
LATENT SEMANTIC ANALYSIS (2)
OPTIMIZATION (2)
QUERY REFINEMENT (2)
SEMANTICS (2)
2D GRID (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ARCHITECTURE (1)
ATOMIC MEASUREMENTS (1)
AUTOMATIC CLUSTERING (1)
AUTOMATIC CONTROL (1)
AUTOMATIC DISCOVERY (1)
BRIDGES (1)
CHEMICAL ANALYSIS (1)
CLASSIFICATION (1)
CLASSIFICATION ALGORITHMS (1)
CLASSIFY DOCUMENT (1)
CLASSIFYING WEB SEARCH ENGINE (1)
CLUSTER (1)
CLUSTERING (1)
CLUSTERING METHOD (1)
CLUSTERING PROCESS (1)
COLLABORATIVE TAGGING (1)
COLLABORATIVE TAGGING SYSTEM (1)
COMPUTER SCIENCE (1)
CONFERENCES (1)
CONTENT MANAGEMENT (1)
CONTEXT (1)
COST FUNCTION (1)
DATA CLUSTERING (1)
DATA COLLECTION (1)
DATA MINING (1)
DATA MODELS (1)
DATA STRUCTURES (1)
DATA VISUALIZATION (1)
DICTIONARIES (1)
DOCUMENT CATEGORIZATION (1)
DOCUMENT DATABASE (1)
DOCUMENT SEMANTICS (1)
DUPLICATE DOCUMENTS (1)
ELECTRON TUBES (1)
ENTITY-BASED DOCUMENT CLUSTERING (1)
ENTROPY (1)
FEATURE SELECTION METHOD (1)
FINGERPRINT RECOGNITION (1)
FOLKSONOMY (1)
FREQUENCY MEASUREMENT (1)
FUZZY DOCUMENT CLUSTERING (1)
FUZZY INFERENCE CONTROL (1)
FUZZY INFORMATION VARIATION (1)
FUZZY LOGIC (1)
FUZZY ONTOLOGICAL KNOWLEDGE DOCUMENT CLUSTERING METHODOLOGY (1)
FUZZY SYSTEMS (1)
GOOGLE (1)
GROUPWARE (1)
HEBBIAN LEARNING (1)
HEBBIAN NEURAL NETWORK (1)
HEURISTIC ALGORITHMS (1)
HIDDEN MARKOV MODELS (1)
HIDDEN-WEB DATABASE (1)
HIERARCHICAL CLUSTERING (1)
HIGH DIMENSIONAL DOCUMENT VECTOR (1)
HYPERLINKED OBJECTS (1)
INDEXING (1)
INTELLECTUAL PROPERTY (1)
ITERATIVE CLUSTERING (1)
ITERATIVE PROCEDURE (1)
K-MEANS (1)
K-NEAREST-NEIGHBOUR METHOD (1)
KEY-FEATURE CLUSTERING (1)
KEYWORD AMBIGUITY (1)
KEYWORD BASED SEARCH ENGINE (1)
KEYWORD CLUSTERING (1)
KEYWORD MATCHING (1)
KEYWORD SELECTION (1)
KEYWORD SPACE DIMENSION REDUCTION (1)
KEYWORD-BASED DOCUMENT CLUSTERING TECHNIQUE (1)
KEYWORD-BASED VECTOR SPACE MODEL (1)
KNN (1)
LARGE SCALE INTEGRATION (1)
LATENT CONCEPTS (1)
LATENT SEMANTIC ANALYSIS(LSA) (1)
LATENT TOPICS DETERMINATION (1)
LSI (1)
MACHINE LEARNING (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options