Advanced search

chapter

Analysis of Deputy and Party Similarities through Hierarchical Clustering

G. Lefait, T. Kechadi

2010 Fourth International Conference on Digital Society > 264 - 268

2010 Fourth International Conference on the Digital Society (ICDS 2010)

Speeches delivered in the French Parliament by deputies and government members are analysed, similarities between individuals are induced by the word corpus used, and finally deputies are grouped through hierarchical clustering. Similarity measures between political individuals are compared on a classification task: assigning a party to each actor. Finally, this analysis lead to a new organisation...

chapter

Network Traffic Classification Using Semi-Supervised Approach

Amita Shrivastav, Aruna Tiwari

2010 Second International Conference on Machine Learning and Computing > 345 - 349

2nd International Conference on Machine Learning and Computing (ICMLC 2010)

A semi-supervised approach for classification of network flows is analyzed and implemented. This traffic classification methodology uses only flow statistics to classify traffic. Specifically, a semi-supervised method that allows classifiers to be designed from training data consisting of only a few labeled and many unlabeled flows. The approach consists of two steps, clustering and classification...

article

Automatic Face Annotation in Personal Photo Collections Using Context-Based Unsupervised Clustering and Face Information Fusion

Jae Young Choi, Wesley De Neve, Y M Ro, K N Plataniotis

IEEE Transactions on Circuits and Systems for Video Technology > 2010 > 20 > 10 > 1292 - 1309

In this paper, a novel face annotation framework is proposed that systematically leverages context information such as situation awareness information with current face recognition (FR) solutions. In particular, unsupervised situation and subject clustering techniques have been developed that are aided by context information. Situation clustering groups together photos that are similar in terms of...

chapter

Authorship attribution of web forum posts

S R Pillay, T Solorio

2010 eCrime Researchers Summit > 1 - 7

2010 eCrime Researchers Summit (eCrime 2010)

Extracting useful information from user generated text on the web is an important ongoing research in natural language processing, machine learning, and data mining. Online tools like emails, news groups, blogs, and web forums provide an effective communication platform for millions of users around the globe and also provide an added advantage of anonymity. Millions of people post information on different...

chapter

A Novel Approach for High Dimensional Data Clustering

A. Alijamaat, M. Khalilian, N. Mustapha

2010 Third International Conference on Knowledge Discovery and Data Mining > 264 - 267

2010 3rd International Conference on Knowledge Discovery and Data Mining (WKDD 2010)

Clustering is considered as the most important unsupervised learning problem. It aims to find some structure in a collection of unlabeled data. Dealing with a large quantity of data items can be problematic because of time complexity. On the other hand high dimensional data is a challenge arena in data clustering e.g. time series data. Novel algorithms are needed to be robust, scalable, efficient...

chapter

Cat Swarm Optimization for Clustering

B. Santosa, M.K. Ningrum

2009 International Conference of Soft Computing and Pattern Recognition > 54 - 59

2009 International Conference of Soft Computing and Pattern Recognition

Cat swarm optimization (CSO) is one of the new heuristic optimization algorithm which based on swarm intelligence. Previous research shows that this algorithm has better performance compared to the other heuristic optimization algorithms: Particle swarm optimization (PSO) and weighted-PSO in the cases of function minimization. In this research a new CSO algorithm for clustering problem is proposed...

chapter

Clustering microarray time-series data using expectation maximization and multiple profile alignment

N. Subhani, L. Rueda, A. Ngom, C.J. Burden

2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop > 2 - 7

2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop, BIBMW

A common problem in biology is to partition a set of experimental data into clusters in such a way that the data points within the same cluster are highly similar while data points in different clusters are very different. In this direction, clustering microarray time-series data via pairwise alignment of piece-wise linear profiles has been recently introduced. We propose a EM clustering approach...

chapter

Multiagent Approach for Identifying Cancer Biomarkers

A. Qabaja, M. Alshalalfa, R. Alhajj, J. Rokne

2009 IEEE International Conference on Bioinformatics and Biomedicine > 228 - 233

2009 IEEE International Conference on Bioinformatics and Biomedicine. BIBM 2009

This paper addresses an important and vital problem within the general area of disease recognition, namely identifying disease biomarker genes. Given the complexity of this domain, the basic idea tacked in this paper is employing multiple agents to handle this problem. Though the developed methodology is general enough to be applied to any other domain, we concentrate on identifying cancer biomarkers...

chapter

A cooperative feature gene extraction algorithm that combines classification and clustering

Chi Kin Chow, Hailong Zhu, J. Lacy, M.W. Lingen, more

2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop > 197 - 202

2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop, BIBMW

In feature gene selection, filtering model concerns classification accuracy while ignoring gene redundancy problem. On the other hand, gene clustering finds correlated genes without considering their predictive abilities. It is valuable to enhance their performances by the help of each other. We report a new feature gene extraction algorithm, namely double-thresholding extraction of feature gene (DEFG),...

chapter

A novel random fuzziness clustering with entropy criterion

Nianyun Shi, Liang Yan, Jiuyun Xu, Youxiang Duan

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems > 3 > 279 - 283

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2009)

As a newly-proposed clustering algorithm based on random fuzziness model, RFKM has improved performance compared with other fuzzy clustering algorithms. However the low mobility of accuracy will lead to local optimal solution. To solve this problem, we present an Entropy-based FRKM (ERFKM) algorithm. Meanwhile, in order better to facilitate the optimal operation of the ERFKM, this paper applies entropy...

chapter

A Sensitivity Clustering Method for Memetic Training of Radial Basis Function Neural Networks

F. Fernandez-Navarro, P.A. Gutierrez, C. Hervas-Martinez

2009 Ninth International Conference on Intelligent Systems Design and Applications > 187 - 192

2009 Ninth International Conference on Intelligent Systems Design and Applications (ISDA 2009)

In this paper, we propose a memetic algorithm (MA) for classifier optimization based on a clustering method that applies the k-means algorithm over a specific derived space. In this space, each classifier or individual is represented by the set of the accuracies of the classifier for each class of the problem. The proposed sensitivity clustering is able to obtain groups of individuals that perform...

chapter

A three-step clustering algorithm over an evolving data stream

Liu Li-xiong, Kang Jing, Guo Yun-fei, Huang Hai

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems > 1 > 160 - 164

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2009)

Distinguishing potential new cluster data from outliers is a main problem in mining new pattern from evolving data streams. Meanwhile, all the clustering algorithms inherited from CluStream framework are distribution-based learning which are realized via a sliding window, so this problem becomes more obvious. This paper proposes a three-step clustering algorithm, rDenStream, based on DenStream, which...

chapter

An Approach of Finding Localized Preferences Based-On Clustering for Collaborative Filtering

Zhang Liang, Xiao Bo, Guo Jun

2009 International Conference on Web Information Systems and Mining > 19 - 22

2009 International Conference on Web Information Systems and Mining (WISM 2009)

Collaborative filtering has been very successful in both research and applications. Current collaborative filtering based on clustering compute the whole set of items during the process of clustering or selecting nearest-neighbors, because the researchers believed if users have similar preferences on some of items, they will have the similar preferences on other items. But we think that users have...

chapter

A Methodology for Clustering XML Documents Based on Labeled Tree

Lei Liu, Yongqing Zheng, Baoshi Ding, Haiyan Liu

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery > 1 > 397 - 401

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2009)

The amount of XML documents is increasing rapidly. In order to analyze the information represented in XML documents efficiently, researches on XML document clustering are actively in progress. The key issue is how to devise the similarity measure between XML documents to be used for clustering. Since XML documents have hierarchical structure, it is not appropriate to cluster them by using a general...

chapter

Towards automatic identification of network applications

Yu Wang, Shun-Zheng Yu

2009 ISECS International Colloquium on Computing, Communication, Control, and Management > 2 > 222 - 225

2009 ISECS International Colloquium on Computing, Communication, Control, and Management (CCCM)

Traditional application identification based on port numbers has become increasingly inaccurate. A more accurate alternative is to inspect the application payloads of traffic flows. The main drawback of such method is that target applications must be manually analyzed beforehand. Another alternative is to exploit the distinctive statistical properties of traffic flows and apply machine learning techniques...

chapter

A new framework for an adaptive classifier model

Iltae Lee, K. Kianmehr, N. Koochakzadeh, R. Alhajj, more

2009 IEEE International Conference on Information Reuse&Integration > 138 - 144

2009 IEEE International Conference on Information Reuse & Integration (IRI 2009)

In this paper, a new framework to build an adaptive classifier is introduced. At first, a clustering algorithm, density-based spatial clustering of applications with noise (DBSCAN) is applied to a set of sample data to form initial set of clusters. The clusters are represented as classes. Using support vector machine (SVM), a classifier model is generated. In real world application, data comes in...

chapter

Character Recognition under Severe Perspective Distortion

Peng Zhou, Linlin Li, Chew Lim Tan

2009 10th International Conference on Document Analysis and Recognition > 676 - 680

2009 10th International Conference on Document Analysis and Recognition (ICDAR)

Perspective deformation is one of the main issues needed to be addressed in real-scene character recognition. An effective recognition approach, which is able to handle severe perspective deformation, is to employ cross ratio spectrum and dynamic time warping techniques. However, this solution suffers from a time complexity of O(n4). In this paper, a clustering based indexing method is proposed to...

chapter

Rules Extraction from ANN Based on Clustering

Jie Ma, Dongwei Guo, Miao Liu, Yu Ma, more

2009 International Conference on Computational Intelligence and Natural Computing > 2 > 19 - 21

2009 International Conference on Computational Intelligence and Natural Computing (CINC)

We propose a novel algorithm based on clustering to extract rules from artificial neural networks. After networks Beijing trained and pruned successfully, inner-rules are generated by discrete activation values of hidden units. Then, weights between input and hidden units are clustered to decrease the complexity of rules extraction. In clustering phase, the clustered number of weights can be adjusted...

chapter

Learning Scaling Coefficient in Possibilistic Latent Variable Algorithm from Complex Diagnosis Data

Zong-Xian Yin

2009 Ninth IEEE International Conference on Bioinformatics and BioEngineering > 341 - 343

2009 Ninth IEEE International Conference on Bioinformatics and BioEngineering (BIBE)

The Possibilistic Latent Variable (PLV) clustering algorithm is a powerful tool for the analysis of complex datasets due to its robustness toward data distributions of different types and its ability to accurately identify the inherent clusters within the data. The scaling coefficient in the PLV algorithm plays a key role in reducing the effects of noise, thereby improving the precision of the clustering...

chapter

Clustering for music search results

Yi-Hsuan Yang, Yu-Ching Lin, H. Chen

2009 IEEE International Conference on Multimedia and Expo > 874 - 877

2009 IEEE International Conference on Multimedia and Expo (ICME)

Clustering for better representation of the diversity of text or image search results has been studied extensively. In this paper, we extend this methodology to the novel domain of music search. We conduct empirical evaluation of different clustering algorithms, audio feature representations, and the incorporation of lyrics for music clustering. Our evaluation shows the fusion of audio and text features...

INFONA - science communication portal

Advanced search

Advanced search in people

Analysis of Deputy and Party Similarities through Hierarchical Clustering

Network Traffic Classification Using Semi-Supervised Approach

Automatic Face Annotation in Personal Photo Collections Using Context-Based Unsupervised Clustering and Face Information Fusion

Authorship attribution of web forum posts

A Novel Approach for High Dimensional Data Clustering

Cat Swarm Optimization for Clustering

Clustering microarray time-series data using expectation maximization and multiple profile alignment

Multiagent Approach for Identifying Cancer Biomarkers

A cooperative feature gene extraction algorithm that combines classification and clustering

A novel random fuzziness clustering with entropy criterion

A Sensitivity Clustering Method for Memetic Training of Radial Basis Function Neural Networks

A three-step clustering algorithm over an evolving data stream

An Approach of Finding Localized Preferences Based-On Clustering for Collaborative Filtering

A Methodology for Clustering XML Documents Based on Labeled Tree

Towards automatic identification of network applications

A new framework for an adaptive classifier model

Character Recognition under Severe Perspective Distortion

Rules Extraction from ANN Based on Clustering

Learning Scaling Coefficient in Possibilistic Latent Variable Algorithm from Complex Diagnosis Data

Clustering for music search results

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options