Search results

chapter

A three-part input-output clustering-based approach to fuzzy system identification

Shin-Jye Lee, Xiao-Jun Zeng

2010 10th International Conference on Intelligent Systems Design and Applications > 55 - 60

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

This article presents a clustering-based approach to fuzzy system identification. In order to construct an effective initial fuzzy model, this article tries to present a modular method to identify fuzzy systems based on a hybrid clustering-based technique. Moreover, the determination of the proper number of clusters and the appropriate location of clusters are one of primary considerations on constructing...

chapter

AK-Modes: A weighted clustering algorithm for finding similar case subsets

Lianhang Ma, Yefang Chen, Hao Huang

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering > 218 - 223

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2010)

Finding similar crime case subsets is an important task for intelligence analysts in crime investigation. It can not only provide multiple clues to solve crimes but also improve efficiency to catch the criminals. However, the conventional approach by querying specific attributes in relational databases has two defects: first, it is relatively of poor efficiency when a lot of incidents have to be handled;...

chapter

A clustering-based approach on sentiment analysis

Gang Li, Fei Liu

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering > 331 - 337

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2010)

This paper introduces the clustering-based sentiment analysis approach which is a new approach to sentiment analysis. By applying a TF-IDF weighting method, voting mechanism and importing term scores, an acceptable and stable clustering result can be obtained. It has competitive advantages over the two existing kinds of approaches: symbolic techniques and supervised learning methods. It is a well...

chapter

Experiences with discriminating TCP loss using K-Means clustering

M Sooriyabandara, P Kulkarni, Lu Li, T Lewis, more

2010 International Conference on Information and Communication Technology Convergence (ICTC) > 352 - 357

2010 International Conference on Information and Communication Technology Convergence (ICTC)

Protocols such as TCP depend on loss detection and recovery algorithms to provide a reliable data delivery service. TCP detects loss events using either retransmission timeout or receipt of duplicate acknowledgements. Since, TCP does not have any explicit knowledge about the cause of packet loss, it always treats it as a congestion indication and then adjusts sending rate conservatively to maintain...

chapter

Combining clustering coefficient-based active learning and semi-supervised learning on networked data

Xiaoqi He, Yangguang Liu, Bin Xu, Xiaogang Jin

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering > 305 - 309

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2010)

Active learning and semi-supervised learning are both important techniques to improve the learned model using unlabeled data, when labeled data is difficult to obtain, and unlabeled data is available in large quantity and easy to collect. Combining active learning with a semi-supervised learning algorithm that uses Gaussian field and harmonic functions was suggested recently. This work showed that...

chapter

New Cluster Detection Based on Multi-Representation Index Tree Text Clustering

Hui Song, Lifeng Wang, Baiyan Li, Xiaoqiang Liu

2010 2nd International Workshop on Database Technology and Applications > 1 - 4

2010 2nd International Workshop on Database Technology and Applications (DBTA 2010)

Traditional Clustering is a powerful technique for revealing the "hot" topics among documents. However, it's hard to discover the new type events coming out gradually. In this paper, we propose a novel model for detecting new clusters from time-streaming documents. It consists of three parts: the cluster definition based on Multi-Representation Index Tree (MI-Tree), the new cluster detecting...

chapter

Improving GMM-based spectral conversion with optimal conversion function selection

Hsin-Te Hwang, Wen-Liang Wu, Sin-Horng Chen

2010 7th International Symposium on Chinese Spoken Language Processing > 392 - 396

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

We address the problem in the conventional Gaussian mixture model (GMM)-based spectral conversion from the viewpoint of optimal conversion function selection. The proposed method is motivated by that if the optimal conversion function based on minimum mel-cepstral distortion (MMCD) criterion can be selected during the conversion stage, the conversion performance in terms of mel-cepstral distortion...

chapter

Active Learning for Co-Clustering Based Collaborative Filtering

Quang Thang Le, Minh Phuong Tu

2010 IEEE RIVF International Conference on Computing&Communication Technologies, Research, Innovation, and Vision for the Future (RIVF) > 1 - 4

2010 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future (RIVF)

Collaborative filtering, a technique for making predictions about user preferences by exploiting behavior patterns of groups of users, has become a main prediction technique in recommender systems. One crucial problem for collaborative filtering algorithms is how best to know about the preferences of a new user, who has rated none or few examples. Active learning provides effective strategies to select...

chapter

Support Vector Machine ensembles using features distribution among subsets for enhancing microarray data classification

E Ahmed, N El-Gayar, I A El-Azab

2010 10th International Conference on Intelligent Systems Design and Applications > 1242 - 1246

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

Support Vector Machines (SVMs) ensembles have been widely used to improve classification accuracy in complicated pattern recognition tasks. In this work we propose to apply an ensemble of SVMs coupled with feature-subset selection methods to aleviate the curse of dimensionality associated with expression-based classification of DNA microarray data. We compare the single SVM classifier to SVM ensembles...

chapter

A Personalized Recommendation Model Based on Social Tags

Xiufeng Xia, Shu Zhang, Xiaoming Li

2010 2nd International Workshop on Database Technology and Applications > 1 - 5

2010 2nd International Workshop on Database Technology and Applications (DBTA 2010)

In traditional e-commerce websites, social tags are used in product classification only, and not applied in the domain of personalized recommendation technology. In this paper, we propose a personalized recommendation model based on social tags. We build a user interest model for products by reflecting user interest and product features directly through social tags, and optimize the interest model...

chapter

An Improved Initialization Method for Clustering High-Dimensional Data

Yanping Zhang, Qingshan Jiang

2010 2nd International Workshop on Database Technology and Applications > 1 - 4

2010 2nd International Workshop on Database Technology and Applications (DBTA 2010)

Searching initial centers in high dimensional space is an interesting and important problem which is relevant for the wide various types of K-Means algorithm. However, this is a very difficult problem, due to the"curse of dimensionality"and the inherently sparse data.Algorithm IMSND is one of the latest initialization methods that are based on the idea of sharing neighborhood density. Concerning...

chapter

The Hybrid of Genetic Algorithms and K-Prototypes Clustering Approach for Classification

Chaochang Chiu, Huaichun Chi, Rueijiau Sung, Ju-Yun Yuang

2010 International Conference on Technologies and Applications of Artificial Intelligence > 327 - 330

2010 International Conference on Technologies and Applications of Artificial Intelligence (TAAI 2010)

This study proposes a novel classification technique of GA/k-prototypes in combination with a genetic algorithm to take the advantage of k-prototypes clustering mechanism for supporting the classification purpose. A genetic algorithm is used to adjust the weight applied to input attributes in order to enable a majority of the data records in each cluster to be with the same outcome class. We conduct...

chapter

An objective method to find better RBF networks in classification

Hyontai Sug

5th International Conference on Computer Sciences and Convergence Information Technology > 373 - 376

2010 5th International Conference on Computer Sciences and Convergence Information Technology (ICCIT 2010)

RBF networks are good at prediction tasks of data mining, and k-means clustering algorithm is one of the mostly used clustering algorithms for basis functions of RBF networks. K-means clustering algorithm needs the number of clusters for initialization, and depending on the number of clusters, the accuracy of RBF networks change. But we cannot resort to increasing the number of clusters in the RBF...

chapter

Text document clustering based on frequent concepts

R Baghel, R Dhir

2010 First International Conference On Parallel, Distributed and Grid Computing (PDGC 2010) > 366 - 371

2010 1st International Conference on Parallel, Distributed and Grid Computing (PDGC 2010)

This paper presents a novel technique of document clustering based on frequent concepts. The proposed FCDC (Frequent Concepts based Document Clustering), a clustering algorithm works with frequent concepts rather than frequent itemsets used in traditional text mining techniques. Many well known clustering algorithms deal with documents as bag of words while they ignore the important relationship between...

chapter

A Comparative Study on the Use of Correlation Coefficients for Redundant Feature Elimination

P A Jaskowiak, R J G B Campello, Thiago F Covões, E R Hruschka

2010 Eleventh Brazilian Symposium on Neural Networks > 13 - 18

2010 Eleventh Brazilian Symposium on Neural Networks (SBRN 2010)

Simplified Silhouette Filter (SSF) is a recently introduced feature selection method that automatically estimates the number of features to be selected. To do so, a sampling strategy is combined with a clustering algorithm that seeks clusters of correlated (potentially redundant) features. It is well known that the choice of a similarity measure may have great impact in clustering results. As a consequence,...

chapter

Algorithm of the Text Copy Detection Based on Topic Bag

Wang Sen, Wang Yu

2010 International Conference on Web Information Systems and Mining > 1 > 285 - 288

2010 International Conference on Web Information Systems and Mining (WISM 2010)

In order to resolve the current problem about seriously academic plagiarism in the web environment, this article proposes an algorithm of the text copy detection on the topic bag and the algorithm uses the idea of semantic clustering and multi-instance learning. Firstly, a paper is divided into three layers construction tree: a leaf node denotes a sentence; a branch node represents a topic bag, and...

chapter

An Improved Consensus Clustering for Nonnegative Matrix Factorization in Molecular Cancer Class Discovery

Weixiang Liu, Kehong Yuan, Tianfu Wang, Siping Chen

2010 Chinese Conference on Pattern Recognition (CCPR) > 1 - 4

2010 Chinese Conference on Pattern Recognition (CCPR 2010)

Recently nonnegative matrix factorization (NMF) has been proven powerful for nonnegative data analysis, especially in analyzing gene expression data. We propose an modified consensus clustering mechanism with soft sample assignment to improve the clustering accuracy. The idea is to use normalized inner product or cosine similarity matrix for the connectivity matrix of the consensus clustering. The...

chapter

Approach to Construct Cluster in Unstructured P2P Networks Based on Small-World Theory

Zhen Zhang

2010 Third International Symposium on Information Processing > 117 - 120

2010 Third International Symposium on Information Processing (ISIP 2010)

Node clustering has wide-ranging applications in decentralized P2P networks such as P2P file sharing systems, mobile ad-hoc networks, P2P sensor networks, and so forth. This paper proposes an approach to construct clusters in unstructured P2P networks based on small-world theory. In contrast to centralized graph clustering algorithms, our scheme is completely decentralized and it only uses the knowledge...

chapter

Learning human actions with an adaptive codebook

Yu Kong, Xiaoqin Zhang, Weiming Hu, Yunde Jia

2010 16th International Conference on Virtual Systems and Multimedia > 13 - 20

2010 16th International Conference on Virtual Systems and Multimedia (VSMM 2010)

Learning a compact and yet discriminative codebook for classifying human actions is a challenging problem. One difficulty lies in that the learning procedure is split into two independent phases (dimension reduction and clustering) and thus results in the loss of discriminative information which clustering requires. Besides, traditional used principal component analysis is not optimized for class...

chapter

An Application of Grey Relational Analysis in Clusterng

Der-Bang Wu, Hsiu-Lan Ma, J Wey Chen, Shun-Jyh Wu

2010 International Conference on Artificial Intelligence and Computational Intelligence > 2 > 557 - 561

2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI 2010)

The grey relational analysis is widely used in many fields, such as education, decision-making in economics, marketing research, medicine, computer science, system modeling, social science, chemistry, management, etc. In this paper, the algorithms between grey relational analysis and fuzzy c-mean are compared. Finally, one real data set was applied to prove that the performance of the Grey Relational...

INFONA - science communication portal

Search results

A three-part input-output clustering-based approach to fuzzy system identification

AK-Modes: A weighted clustering algorithm for finding similar case subsets

A clustering-based approach on sentiment analysis

Experiences with discriminating TCP loss using K-Means clustering

Combining clustering coefficient-based active learning and semi-supervised learning on networked data

New Cluster Detection Based on Multi-Representation Index Tree Text Clustering

Improving GMM-based spectral conversion with optimal conversion function selection

Active Learning for Co-Clustering Based Collaborative Filtering

Support Vector Machine ensembles using features distribution among subsets for enhancing microarray data classification

A Personalized Recommendation Model Based on Social Tags

An Improved Initialization Method for Clustering High-Dimensional Data

The Hybrid of Genetic Algorithms and K-Prototypes Clustering Approach for Classification

An objective method to find better RBF networks in classification

Text document clustering based on frequent concepts

A Comparative Study on the Use of Correlation Coefficients for Redundant Feature Elimination

Algorithm of the Text Copy Detection Based on Topic Bag

An Improved Consensus Clustering for Nonnegative Matrix Factorization in Molecular Cancer Class Discovery

Approach to Construct Cluster in Unstructured P2P Networks Based on Small-World Theory

Learning human actions with an adaptive codebook

An Application of Grey Relational Analysis in Clusterng

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options