Search results

Items from 1 to 7 out of 7 results

chapter

Most Clusters Can Be Retrieved with Short Disjunctive Queries

Vinay Deolalikar

2013 IEEE 13th International Conference on Data Mining > 1019 - 1024

2013 IEEE International Conference on Data Mining (ICDM)

Simple keyword based searches are ubiquitous in today's internet age. It is hard to imagine an information system today that does not permit a simple keyword based search. This method of information retrieval has the obvious benefits of being highly interpretable, and having wide usage. However, a general perception

chapter

Weighted Feature Subset Non-negative Matrix Factorization and Its Applications to Document Understanding

Dingding Wang, Tao Li, Chris Ding

2010 IEEE International Conference on Data Mining > 541 - 550

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

Keyword (Feature) selection enhances and improves many Information Retrieval (IR) tasks such as document categorization, automatic topic discovery, etc. The problem of keyword selection is usually solved using supervised algorithms. In this paper, we propose an unsupervised approach that combines keyword selection and

chapter

An Unsupervised Approach to Cluster Web Search Results Based on Word Sense Communities

Jiyang Chen, O.R. Zaiane, R. Goebel

2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology > 1 > 725 - 729

2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology

sense discovery problem. Given a query and a list of result pages, our unsupervised method detects word sense communities in the extracted keyword network. The documents are assigned to several refined word sense communities to form clusters. We use the modularity score of the discovered keyword community structure to

chapter

Document Clustering Method Based on Visual Features

Yucong Liu, Bofeng Zhang, Kun Xing, Bo Zhou

2011 International Conference on Internet of Things and 4th International Conference on Cyber, Physical and Social Computing > 458 - 462

2011 IEEE Int'l Conference on Internet of Things (iThings) & 4th IEEE Int'l Conference on Cyber, Physical and Social Computing (CPSCom)

appearance characteristics, so called visual features. This paper proposes a method to cluster the scientific documents based on visual features, so called VF-Clustering algorithm. Five kinds of visual features of documents are de-fined, including body, abstract, subtitle, keyword and title. The thought of crossover and

chapter

Text Categorization with Considering Temporal Patterns of Term Usages

H Abe, S Tsumoto

2010 IEEE International Conference on Data Mining Workshops > 800 - 807

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

In document categorization method by using similarity measures based on word vectors, it is important to determine key words to characterize each document. However, conventional methods select the key words based on their frequency or/and particular importance index such as tf-idf. In this paper, we propose a method to characterize each document by using temporal clusters of technical term usages...

chapter

Textual Document Clustering Using Topic Models

Xiaoping Sun

2014 10th International Conference on Semantics, Knowledge and Grids > 1 - 4

2014 Tenth International Conference on Semantics, Knowledge and Grids (SKG)

Document clustering is to group documents according to a certain semantic features defined on the document set for measuring the similarities between two documents. The keyword models such as the TFIDF model of document have been widely used as features for document clustering. But it lacks of semantic structure

chapter

KeyGraph and WordNet hypernyms for topic detection

Kasun Perera, Damitha Karunarathne

2015 12th International Joint Conference on Computer Science and Software Engineering (JCSSE) > 303 - 308

2015 12th International Joint Conference on Computer Science and Software Engineering (JCSSE)

from these data collections. KeyGraph is a word co-occurrence based algorithm for topic modeling. We provide an extension for KeyGraph algorithm by incorporating WordNet hypernyms for Keywords in the data collection. Our results show that incorporating hypernyms for KeyGraph algorithm would result improved topic and

Filter options

Keywords:
CLUSTERING ALGORITHMS
DOCUMENT CLUSTERING

Publication date

Set your own date range

Keywords

PATTERN CLUSTERING (3)
ALGORITHM DESIGN AND ANALYSIS (2)
FEATURE EXTRACTION (2)
SEMANTICS (2)
STANDARDS (2)
ANIMALS (1)
ARTIFICIAL NEURAL NETWORKS (1)
BIBLIOGRAPHICAL DOCUMENT (1)
CLUSTERING METHODS (1)
COMMUNITIES (1)
COMMUNITY MINING (1)
COMPUTATIONAL MODELING (1)
COMPUTER SCIENCE (1)
DATA CLUSTERING (1)
DATA MINING (1)
DATA VISUALIZATION (1)
DISJUNCTIVE QUERIES (1)
DOCUMENT CATEGORIZATION METHOD (1)
DOCUMENT HANDLING (1)
FEATURE SELECTION (1)
FREQUENCY CONVERSION (1)
FREQUENT ITEMSETS (1)
GENETIC ALGORITHM (1)
GENETIC ALGORITHMS (1)
GOLD (1)
HEURISTIC ALGORITHMS (1)
HIDDEN MARKOV MODELS (1)
INDEXES (1)
INFORMATION EXTRACTION (1)
INFORMATION RETRIEVAL (1)
INTERNET (1)
ITEMSETS (1)
JAVA (1)
K-MEANS (1)
KEYGRAPH (1)
KEYWORD DETERMINATION (1)
KEYWORD NETWORK (1)
KEYWORD SEARCH (1)
KEYWORD SELECTION (1)
LARGE SCALE INTEGRATION (1)
LATENT DIRICHLET ALLOCATION (LDA) (1)
MATRIX DECOMPOSITION (1)
MEASUREMENT (1)
MODULARITY SCORE (1)
NATURAL LANGUAGES (1)
NON-NEGATIVE MATRIX FACTORIZATION (1)
NONNEGATIVE MATRIX FACTORIZATION (1)
ONTOLOGIES (1)
OPTIMIZATION (1)
ORGANIZATIONS (1)
PAGE CLUSTERING (1)
PATTERN CLASSIFICATION (1)
PROBABILISTIC LOGIC (1)
PROBABILISTIC TOPIC MODEL (1)
QUERY PROCESSING (1)
QUERY SENSE IDENTIFICATION (1)
RELEVANCE FEEDBACK (1)
RELEVANT DOCUMENTS (1)
RETRIEVAL (1)
SEARCH ENGINES (1)
SECURITY (1)
SENSITIVITY ANALYSIS (1)
SIMILARITY MEASURE (1)
SOFTWARE (1)
TECHNICAL TERM USAGE (1)
TEMPORAL CLUSTER (1)
TEMPORAL PATTERN (1)
TEMPORAL PATTERNS (1)
TERM USAGE INDEX (1)
TEXT ANALYSIS (1)
TEXT CATEGORIZATION (1)
TEXT MINING (1)
TOPIC MODEL (1)
UNSUPERVISED FEATURE SELECTION (1)
UNSUPERVISED LEARNING (1)
UNSUPERVISED WEB SEARCH RESULT CLUSTERING (1)
USER NAVIGATION (1)
VECTORS (1)
VISUAL FEATURES (1)
VISUALIZATION (1)
VOCABULARY (1)
WEB PAGES (1)
WEB SEARCH RESULT ORGANIZATION (1)
WEIGHTED FEATURE SUBSET NON-NEGATIVE MATRIX FACTORIZATION (1)
WEIGHTED FEATURE SUBSET SELECTION (1)
WORD SENSE COMMUNITY (1)
WORD SENSE DISCOVERY PROBLEM (1)
WORD VECTOR (1)
WORDNET (1)
more

INFONA - science communication portal

Search results

Most Clusters Can Be Retrieved with Short Disjunctive Queries

Weighted Feature Subset Non-negative Matrix Factorization and Its Applications to Document Understanding

An Unsupervised Approach to Cluster Web Search Results Based on Word Sense Communities

Document Clustering Method Based on Visual Features

Text Categorization with Considering Temporal Patterns of Term Usages

Textual Document Clustering Using Topic Models

KeyGraph and WordNet hypernyms for topic detection

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options