Search results

chapter

A novel peak alignment method for LC-MS data analysis using cluster-based techniques

Yu-Cheng Liu, Lien-Chin Chen, Hui-Yin Chang, Hsin-Yi Wu, more

2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW) > 525 - 530

2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW 2010)

Recently, liquid chromatography coupled to mass spectrometry (LC-MS) has become a standard technique for identifying differential abundance of peaks as biomarkers. Two major problems in the preprocessing of LC-MS data analysis are how to adjust and align multiple LC-MS datasets efficiently and correctly. Hence, an effective algorithm is needed to adjust the variation in retention time and align protein...

chapter

Performance improvement in automatic gender identification using hierarchical clustering

M A Keyvanrad, M M Homayounpour

2010 5th International Symposium on Telecommunications > 900 - 903

2010 5th International Symposium on Telecommunications (IST)

In this paper a hierarchical structure is proposed for automatic gender identification (AGI). In this structure two clustering techniques are used. The first technique is divisive clustering for dividing speakers from each gender to some classes of speakers. The second clustering technique is agglomerative clustering for creating a hierarchical structure. Feature reduction is done by SOAP feature...

chapter

An efficient K-Means clustering algorithm for reducing time complexity using uniform distribution data points

D Napoleon, P G Lakshmi

Trendz in Information Sciences&Computing(TISC2010) > 42 - 45

2nd International Conference on Trendz in Information Sciences & Computing (TISC 2010)

Data mining has been defined as "The nontrivial extraction of implicit, previously unknown, and potentially useful information from data". Clustering is the automated search for group of related observations in a data set. The K-Means method is one of the most commonly used clustering techniques for a variety of applications. This paper proposes a method for making the K-Means algorithm...

chapter

An improved clustering technique based on statistical model preprocessing for gene expression dataset

N Tajunisha, V Saravanan

Trendz in Information Sciences&Computing(TISC2010) > 46 - 49

2nd International Conference on Trendz in Information Sciences & Computing (TISC 2010)

Data mining has become an important topic in effective analysis of gene expression data due to its wide application in the biomedical industry. Within a gene expression matrix there are usually several particular macroscopic phenotypes of samples. Selection of genes most relevant and informative for certain phenotypes is an important aspect in gene expression analysis. Currently most of the research...

chapter

A Hybrid Method for XML Clustering

Yong Piao, Chen Liu, Xiu-kun Wang

2010 3rd International Symposium on Parallel Architectures, Algorithms and Programming > 286 - 290

Third International Symposium on Parallel Architectures, Algorithms and Programming (PAAP 2010)

An effective XML cluster method called neighbor center clustering algorithm (NCC) is presented in this paper, whose similarity is obtained through both structural and content information contained in XML files. Structural similarity is measured by the idea of Longest Common Subsequence, while content similarity is achieved using TF-IDF principles. It reduces computation complexity by avoiding direct...

chapter

Gene clustering by structural prior based local factor analysis model under Bayesian Ying-Yang harmony learning

Lei Shi, Shikui Tu, Lei Xu

2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 696 - 699

2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2010)

We propose a clustering algorithm based on a structural prior based Local Factor Analysis (spLFA) model under the Bayesian Ying-Yang harmony learning, which automatically determines the hidden dimensionalities during parameter learning, reduces the number of free parameters by projecting the mean vectors onto a low dimensional manifold, imposes the sparseness by a Normal-Jeffreys prior. Experiments...

chapter

Improved varied density based spatial clustering algorithm with noise

S Vijayalakshmi, M Punithavalli

2010 IEEE International Conference on Computational Intelligence and Computing Research > 1 - 4

2010 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC 2010)

VDBSCAN is very famous Density based clustering algorithm. Handling highly dense data point is a challenging task in clustering. VDBSCAN algorithm handles widely varied density data points well and also over comes the problem of noise and outlier. But this algorithm is depends on the input parameters Eps and Minpts. The careful selection of these input parameters plays an important role in proper...

chapter

A self-growing Bayesian network classifier for online learning of human motion patterns

Zhuo Chen, N H C Yung

2010 International Conference of Soft Computing and Pattern Recognition > 182 - 187

2010 International Conference of Soft Computing and Pattern Recognition (SoCPaR 2010)

This paper proposes a new self-growing Bayesian network classifier for online learning of human motion patterns (HMPs) in dynamically changing environments. The proposed classifier is designed to represent HMP classes based on a set of historical trajectories labeled by unsupervised clustering. It then assigns HMP class labels to current trajectories. Parameters of the proposed classifier are recalculated...

chapter

An Improved Data Clustering Algorithm for Mining Web Documents

O H Odukoya, G A Aderounmu, E R Adagunodo

2010 International Conference on Computational Intelligence and Software Engineering > 1 - 8

2010 International Conference on Computational Intelligence and Software Engineering (CiSE 2010)

This paper formulates, simulates and assess an improved data clustering algorithm for mining web documents with a view to preserving their conceptual similarities and eliminating the problem of speed while increasing accuracy. The improved data clustering algorithm was formulated using the concept of K-means algorithm. Real and artificial datasets were used to test the proposed and existing algorithm...

chapter

Experimental Research on Impacts of Dimensionality on Clustering Algorithms

Hai-Dong Meng, Jin-Hui Ma, Guan-Dong Xu

2010 International Conference on Computational Intelligence and Software Engineering > 1 - 4

2010 International Conference on Computational Intelligence and Software Engineering (CiSE 2010)

Experiments are carried out on datasets with different dimensions selected from UCI datasets by using two classical clustering algorithms. The results of the experiments indicate that when the dimensionality of the real dataset is less than or equal to 30, the clustering algorithms based on distance are effective. For high-dimensional datasets--dimensionality is greater than 30, the clustering algorithms...

chapter

Clustering-based methodology with minimal user supervision for displaying cell-phenotype signatures in image-based screening

William-Chandra Tjhi, Lee Kee Khoon, T Hung, Ong Yew Soon, more

2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW) > 252 - 257

2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW 2010)

Most quantitative cell image-based screening analyses are dependent on thorough user supervision based on assay-specific knowledge. To minimize human bias in analysis, we introduce an automated methodology of displaying screen phenotypes using clustering that provides intuitive visuals to guide user supervision when required. Our premise is to automatically present to users an overview of screen phenotype-contents...

chapter

Hard and soft updating centroids for clustering Y-short tandem repeats (Y-STR) data

A Seman, Z A Bakar, N Daud

2010 IEEE Conference on Open Systems (ICOS 2010) > 6 - 11

2010 IEEE Conference on Open Systems (ICOS 2010)

This paper compares hard and soft updating centroids for clustering Y-STR data. The hard centroids represented by New Fuzzy k-Modes clustering algorithm, whereas the soft centroids represented through k-Population algorithm. These two algorithms are experimented through two datasets, Y-STR haplogroups and Y-STR Surnames. The results show that the soft centroid performance is better than the hard centroid...

chapter

Supervised gene clustering for extraction of discriminative features from microarray data

C Das, P Maji, S Chattopadhyay

2010 Annual IEEE India Conference (INDICON) > 1 - 4

2010 Annual IEEE India Conference (INDICON 2010)

Among the large number of genes presented in microarray data, only a small fraction of them are effective for performing a certain diagnostic test. However, it is very difficult to identify these genes for disease diagnosis. In this regard, a new supervised gene clustering algorithm is proposed to cluster genes from microarray data. The proposed method directly incorporates the information of response...

chapter

Missing Features Restoration Using Clustering Methods

H T Rassem, P N Girija

2010 Sixth International Conference on Signal-Image Technology and Internet Based Systems > 123 - 126

Sixth International Conference on Signal-Image Technology & Internet-Based Systems (SITIS 2010)

The performance of the Automatic Speech Recognition (ASR) system reduces greatly when speech is corrupted by noise. In spectrogram representation of a speech signal, after deleting low SNR elements, incomplete spectrogram is obtained. In this case, the speech recognizer should make modifications to spectrogram to restore the missing elements, which is one direction. In another direction speech recognizer...

chapter

Semi-supervised k-means clustering for outlier detection in mammogram classification

K Thangavel, A K Mohideen

Trendz in Information Sciences&Computing(TISC2010) > 68 - 72

2nd International Conference on Trendz in Information Sciences & Computing (TISC 2010)

Detection of outliers and relevant features are the most important process before classification. In this paper, a novel semi-supervised k-means clustering is proposed for outlier detection in mammogram classification. Initially the shape features are extracted from the digital mammograms, and k-means clustering is applied to cluster the features, the number of clusters is equal with the number of...

chapter

Constrained Nonnegative Tensor Factorization for Clustering

Wei Peng

2010 Ninth International Conference on Machine Learning and Applications > 954 - 957

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Constrained clustering through matrix factorization has been shown to largely improve clustering accuracy by incorporating prior knowledge into the factorization process. Although it has been well studied, none of them deal with constrained multi-way data factorization. Multi-way data or Tensors are encoded as high-order data structures. They can be seen as the generalization of matrices. One typical...

chapter

Multi-view Clustering of Visual Words Using Canonical Correlation Analysis for Human Action Recognition

B Saghafi, D Rajan

2010 Ninth International Conference on Machine Learning and Applications > 661 - 666

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

In this paper we propose a novel approach for introducing semantic relations into the bag-of-words framework for recognizing human actions. We represent visual words in two different views: the original features and the document co-occurrence representation. The latter view conveys semantic relations but is large, sparse and noisy. We use canonical correlation analysis between the two views to find...

chapter

On Dynamic Selection of the Most Informative Samples in Classification Problems

Edwin Lughofer

2010 Ninth International Conference on Machine Learning and Applications > 573 - 579

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

In this paper, we propose a dynamic technique for selecting the most informative samples in classification problems as coming in two stages: the first stage conducts sample selection in batch off-line mode based on unsupervised criteria extracted from cluster partitions, the second phase proposes an active learning scheme during on-line adaptation of classifiers in non-stationary environments. This...

chapter

MCW: A new weighting method for linear combination of regressors in WSNs

Hadi Shakibian, Nasrollah Moghadam Charkari

2010 5th International Symposium on Telecommunications > 231 - 236

2010 5th International Symposium on Telecommunications (IST)

The ultimate goal in a multiple classifier system (MCS) is to obtain a global and more accurate model through the combination of several base learners. Among the popular combining rules, averaging has been emphasized as a well qualified option. The averaging rule can be applied with equal (simple averaging) or non-equal (weighted averaging) weights vector for the linear combination. When the formed...

chapter

Enhancing GSOM text clustering with Latent Semantic Analysis

S Matharage, D Alahakoon

2010 Fifth International Conference on Information and Automation for Sustainability > 441 - 446

2010 5th International Conference on Information and Automation for Sustainability (ICIAfS)

Growing Self Organizing Map (GSOM) has proven benefits in text clustering. Latent Semantic Analysis (LSA) also has been used in text clustering to capture the latent concepts from text. This paper presents a novel combination of GSOM and LSA to improve text clustering results compared to using GSOM on its own. LSA is an inherently global algorithm that looks at trends and patterns globally and GSOM...

INFONA - science communication portal

Search results

A novel peak alignment method for LC-MS data analysis using cluster-based techniques

Performance improvement in automatic gender identification using hierarchical clustering

An efficient K-Means clustering algorithm for reducing time complexity using uniform distribution data points

An improved clustering technique based on statistical model preprocessing for gene expression dataset

A Hybrid Method for XML Clustering

Gene clustering by structural prior based local factor analysis model under Bayesian Ying-Yang harmony learning

Improved varied density based spatial clustering algorithm with noise

A self-growing Bayesian network classifier for online learning of human motion patterns

An Improved Data Clustering Algorithm for Mining Web Documents

Experimental Research on Impacts of Dimensionality on Clustering Algorithms

Clustering-based methodology with minimal user supervision for displaying cell-phenotype signatures in image-based screening

Hard and soft updating centroids for clustering Y-short tandem repeats (Y-STR) data

Supervised gene clustering for extraction of discriminative features from microarray data

Missing Features Restoration Using Clustering Methods

Semi-supervised k-means clustering for outlier detection in mammogram classification

Constrained Nonnegative Tensor Factorization for Clustering

Multi-view Clustering of Visual Words Using Canonical Correlation Analysis for Human Action Recognition

On Dynamic Selection of the Most Informative Samples in Classification Problems

MCW: A new weighting method for linear combination of regressors in WSNs

Enhancing GSOM text clustering with Latent Semantic Analysis

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options