Search results

Items from 1 to 5 out of 5 results

chapter

Chinese Keyword Spotting Using Knowledge-Based Clustering

Yong Xia, Kuanquan Wang, Mingwei Li

2011 International Conference on Document Analysis and Recognition > 789 - 793

2011 International Conference on Document Analysis and Recognition (ICDAR)

images automatically. Cluster IDs are adopted to index the characters. A Dream of Red Mansions, a famous classical Chinese literature work including near one million characters, is used to evaluate the performance of Chinese keyword spotting. Experimental results confirm the effectiveness of knowledge-based clustering and

chapter

Efficient and effective ranking in Top-k exploration for keyword search on RDF

Roberto De Virgilio

2011 IEEE International Conference on Information Reuse & Integration > 66 - 70

2011 IEEE International Conference on Information Reuse & Integration (IRI)

Ranking solutions is an important issue in Information Retrieval because it greatly influences the quality of results. In this context, keyword based search approaches use to consider solutions sorting as least step of the overall process. Ranking and building solutions are completely separate steps running

chapter

Top-k computations in MapReduce: A case study on recommendations

Vasilis Efthymiou, Kostas Stefanidis, Eirini Ntoutsi

2015 IEEE International Conference on Big Data (Big Data) > 2820 - 2822

2015 IEEE International Conference on Big Data (Big Data)

single machine. Our motivating application is recommenders, which typically deal with big numbers of users and items, but other applications might benefit as well, like keyword search. In this paper, we propose a parallel top-k MapReduce algorithm that, unlike existing MapReduce solutions, manages to handle cases in which

chapter

Automatic textual aggregation approach of scientific articles in OLAP context

Bouakkaz Mustapha, Loudcher Sabine, Ouinten Youcef

2014 10th International Conference on Innovations in Information Technology (IIT) > 30 - 35

2014 10th International Conference on Innovations in Information Technology (INNOVATIONS)

aggregation function for textual data. Our approach is based on the affinity between keywords and uses the search of cycles in a graph to find the aggregated keywords. We also present performances and a comparison with three other methods. The experimental study shows good results for our approach.

article

An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data

Liping Jing, M.K. Ng, J.Z. Huang

IEEE Transactions on Knowledge and Data Engineering > 2007 > 19 > 8 > 1026 - 1041

of terms or keywords. The keywords for one cluster may not occur in the documents of other clusters. This is a data sparsity problem faced in clustering high-dimensional data. In the new algorithm, we extend the k-means clustering process to calculate a weight for each dimension in each cluster and use the weight values

Filter options

Keywords:
CLUSTERING ALGORITHMS
COMPLEXITY THEORY

Publication date

Set your own date range

Publication type

book (4)
article (1)

Keywords

AGGREGATION FUNCTION (1)
BENCHMARK TESTING (1)
BIG DATA (1)
BUILDINGS (1)
CLUSTERING METHODS (1)
CONTENT-BASED CHINESE KEYWORD SPOTTING (1)
CONTEXT (1)
DATA HANDLING (1)
DATA MODELS (1)
DATA SPARSITY PROBLEM (1)
DATABASES (1)
DISPERSION (1)
DOCUMENT IMAGE SYNTHESIS (1)
ENTROPY (1)
ENTROPY WEIGHTING K-MEANS ALGORITHM (1)
FEATURE EXTRACTION (1)
GRAPH (1)
HIGH-DIMENSIONAL DATA. (1)
HIGH-DIMENSIONAL OBJECT CLUSTERING (1)
HIGH-DIMENSIONAL SPARSE DATA (1)
IMAGE RETRIEVAL (1)
INDEXING (1)
KEYWORD SEARCH (1)
KNOWLEDGE BASED SYSTEMS (1)
KNOWLEDGE-BASED CLUSTERING (1)
K{\HBOX{-}}{\RM{MEANS}} CLUSTERING (1)
MEASUREMENT (1)
MINIMIZATION (1)
OBJECT RECOGNITION (1)
OLAP (1)
OPTICAL CHARACTER RECOGNITION SOFTWARE (1)
PATTERN CLUSTERING (1)
PRAGMATICS (1)
RADIATION DETECTORS (1)
RECOMMENDER SYSTEMS (1)
RESOURCE DESCRIPTION FRAMEWORK (1)
RUNTIME (1)
SEMANTICS (1)
SERIAL BATCH CLUSTERING (1)
SORTING (1)
SUBSPACE CLUSTERING (1)
TEXT CLUSTERING (1)
TEXTUAL DATA (1)
VARIABLE WEIGHTING (1)
more

INFONA - science communication portal

Search results

Chinese Keyword Spotting Using Knowledge-Based Clustering

Efficient and effective ranking in Top-k exploration for keyword search on RDF

Top-k computations in MapReduce: A case study on recommendations

Automatic textual aggregation approach of scientific articles in OLAP context

An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options