Advanced search

Advanced search in people

From:

To:

Items from 1 to 6 out of 6 results

chapter

A Hybrid Method for XML Clustering

Yong Piao, Chen Liu, Xiu-kun Wang

2010 3rd International Symposium on Parallel Architectures, Algorithms and Programming > 286 - 290

Third International Symposium on Parallel Architectures, Algorithms and Programming (PAAP 2010)

An effective XML cluster method called neighbor center clustering algorithm (NCC) is presented in this paper, whose similarity is obtained through both structural and content information contained in XML files. Structural similarity is measured by the idea of Longest Common Subsequence, while content similarity is achieved using TF-IDF principles. It reduces computation complexity by avoiding direct...

chapter

An efficient framework on large-scale video genre classification

Ning Zhang, Ling Guan

2010 IEEE International Workshop on Multimedia Signal Processing > 481 - 486

2010 IEEE 12th International Workshop on Multimedia Signal Processing (MMSP)

Efficient data mining and indexing is important for multimedia analysis and retrieval. In the field of large-scale video analysis, effective genre categorization plays an important role and serves one of the fundamental steps for data mining. Existing works utilize domain-knowledge dependent feature extraction, which is limited from genre diversification as well as data volume scalability. In this...

chapter

Dynamic K-Nearest-Neighbor with Distance and attribute weighted for classification

Jia Wu, Zhihua Cai, Zhechao Gao

2010 International Conference on Electronics and Information Engineering > 1 > V1-356 - V1-360

2010 International Conference on Electronics and Information Engineering (ICEIE 2010)

K-Nearest-Neighbor (KNN) as an important classification method based on closest training examples has been widely used in data mining due to its simplicity, effectiveness, and robustness. However, the class probability estimation, the neighborhood size and the type of distance function confronting KNN may affect its classification accuracy. Many researchers have been focused on improving the accuracy...

chapter

A k-Means-Based Projected Clustering Algorithm

Yufen Sun, Gang Liu, Kun Xu

2010 Third International Joint Conference on Computational Science and Optimization > 1 > 466 - 470

Third International Joint Conference on Computational Sciences and Optimization (CSO 2010)

In high dimensional data space, clusters are likely to exist in different subspaces. K-means is a classic clustering algorithm, but it cannot be used to find subspace clusters. In this paper, an algorithm called GKM is designed to generalize k-means algorithm for high dimensional data. In the objective function of GKM, we associate a weight vector with each cluster to indicate which dimensions are...

chapter

A Discretization Algorithm of Continuous Attributes Based on Supervised Clustering

Haiyang Hua, Huaici Zhao

2009 Chinese Conference on Pattern Recognition > 1 - 5

2009 Chinese Conference on Pattern Recognition. (CCPR 2009) and the First CJK Joint Workshop on Pattern Recognition (CJKPR)

Many machine learning algorithms can be applied only to data described by categorical attributes. So discretization of continuous attributes is one of the important steps in preprocessing of extracting knowledge. Traditional discretization algorithms based on clustering need a pre-determined clustering number k, also typically are applied in an unsupervised learning framework. This paper describes...

chapter

Name Disambiguation Using Semantic Association Clustering

Hai Jin, Li Huang, Pingpeng Yuan

2009 IEEE International Conference on e-Business Engineering > 42 - 48

2009 IEEE International Conference on e-Business Engineering. ICEBE 2009

Due to homonyms, abbreviations, etc., name ambiguity is widely available in Web and e-document. For example, when integrating heterogeneous literature databases, because there are different name specifications, different authors may be thought of as the same author, and vice versa. Therefore, name ambiguity makes data robust even dirty and lowers the precision of information retrieval. In this paper,...

Filter options

Keywords:
ACCURACY
PATTERN CLUSTERING
DATA MINING
MATHEMATICAL MODEL

Publication date

Set your own date range

Content availability

Available (5)
None (1)

Keywords

CLUSTERING ALGORITHMS (4)
ALGORITHM DESIGN AND ANALYSIS (2)
CLASSIFICATION ALGORITHMS (2)
EQUATIONS (2)
FEATURE EXTRACTION (2)
HEURISTIC ALGORITHMS (2)
ATTRIBUTE WEIGHTED (1)
ATTRIBUTE WEIGHTED CLASSIFICATION (1)
BOTTOM-UP TWO-LAYER K-MEANS CLUSTERING (1)
BOW MODEL (1)
CATEGORICAL ATTRIBUTES (1)
CATEGORY THEORY (1)
CITESSEER (1)
CLASS DISTRIBUTION (1)
CLASSIFICATION ACCURACY (1)
CLOSEST TRAINING (1)
CLUSTING (1)
COMPLEXITY THEORY (1)
COMPUTATION COMPLEXITY (1)
COMPUTATIONAL COMPLEXITY (1)
COMPUTATIONAL MODELING (1)
CONTENT INFORMATION (1)
CONTINUOUS ATTRIBUTES (1)
DATA DIVERSITY (1)
DATA VOLUME SCALABILITY (1)
DBLP (1)
DISCRETIZATION ALGORITHM (1)
DISCRETIZATION SCHEMA (1)
DISTANCE WEIGHTED (1)
DISTANCE WEIGHTED CLASSIFICATION (1)
DOCUMENT HANDLING (1)
DOMAIN-KNOWLEDGE INDEPENDENT DESCRIPTORS (1)
DYNAMIC (1)
E-DOCUMENT (1)
EXTENSIBLE MARKUP LANGUAGE (1)
F-MEASURE VALUE (1)
GENERALIZE K-MEANS ALGORITHM (1)
GENRE DIVERSIFICATION (1)
GPU HARDWARE (1)
GRAPHICS PROCESSING UNIT (1)
HETEROGENEOUS LITERATURE DATABASE (1)
HIGH DIMENSIONS (1)
HISTOGRAM-BASED DISTRIBUTION (1)
HISTOGRAMS (1)
IMAGE CLASSIFICATION (1)
INDEXING (1)
INFORMATION RETRIEVAL (1)
INNOVATIVE CODEBOOK GENERATION (1)
K-MEANS (1)
K-MEANS CLUSTERING (1)
K-NEAREST NEIGHBOR CLASSIFIER (1)
K-NEAREST NEIGHBOR METHOD (1)
K-NEAREST-NEIGHBOR (1)
KNOWLEDGE ACQUISITION (1)
KNOWLEDGE EXTRACTION (1)
LARGE-SCALE VIDEO GENRE CLASSIFICATION (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LIBRA (1)
LONGEST COMMON SUBSEQUENCE (1)
MACHINE LEARNING ALGORITHMS (1)
MERGING (1)
MIXED DYNAMIC METHOD (1)
MODIFIED LATENT DIRICHLET ALLOCATION BASED DISTRIBUTION (1)
MULTIMEDIA ANALYSIS (1)
MULTIMEDIA RETRIEVAL (1)
NAME DISAMBIGUATION (1)
NAME SPECIFICATION (1)
NEAREST NEIGHBOR SEARCHES (1)
NEIGHBOR CENTER CLUSTER (1)
NEIGHBOR CENTER CLUSTERING (1)
NEIGHBORHOOD SIZE (1)
OBJECTIVE FUNCTION (1)
PARTITIONING ALGORITHMS (1)
PATTERN CLASSIFICATION (1)
PERIODIC STRUCTURES (1)
PROBABILITY DENSITY FUNCTION (1)
PROBABILITY ESTIMATION (1)
PROJECTED CLUSTERING (1)
PROJECTED CLUSTERING ALGORITHM (1)
SAND (1)
SCALE INVARIANT FEATURE TRANSFORM LOCAL DESCRIPTOR (1)
SEMANTIC ASSOCIATION (1)
SEMANTIC ASSOCIATION CLUSTERING (1)
SEMANTIC ASSOCIATION-BASED NAME DISAMBIGUATION METHOD (1)
STATISTICAL ANALYSIS (1)
STRUCTURAL INFORMATION (1)
STRUCTURAL SIMILARITY (1)
SUPERVISED CLUSTERING (1)
SUPERVISED X-MEANS (1)
SX-MEANS (1)
TEXT ANALYSIS (1)
TEXT MINING (1)
TF-IDF PRINCIPLE (1)
TRAINING (1)
TRAINING DATA (1)
UNSUPERVISED LEARNING (1)
more

INFONA - science communication portal

Advanced search

Advanced search in people

A Hybrid Method for XML Clustering

An efficient framework on large-scale video genre classification

Dynamic K-Nearest-Neighbor with Distance and attribute weighted for classification

A k-Means-Based Projected Clustering Algorithm

A Discretization Algorithm of Continuous Attributes Based on Supervised Clustering

Name Disambiguation Using Semantic Association Clustering

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options