Biclustering Gene Expression Data Using MSR Difference Threshold

S. Das; S.M. Idicula

doi:10.1109/INDCON.2009.5409395

Source

2009 Annual IEEE India Conference > 1 - 4

Abstract

Biclustering is simultaneous clustering of both rows and columns of a data matrix. A measure called mean squared residue (MSR) is used to simultaneously evaluate the coherence of rows and columns within a submatrix. In this paper a novel algorithm is developed for biclustering gene expression data using the newly introduced concept of MSR difference threshold. In the first step high quality bicluster seeds are generated using K-means clustering algorithm. Then more genes and conditions (node) are added to the bicluster. Before adding a node the MSR X of the bicluster is calculated. After adding the node again the MSR Y is calculated. The added node is deleted if Y minus X is greater than MSR difference threshold or if Y is greater than MSR threshold which depends on the dataset. The MSR difference threshold is different for gene list and condition list and it depends on the dataset also. Proper values should be identified through experimentation in order to obtain biclusters of high quality. The results obtained on bench mark dataset clearly indicate that this algorithm is better than many of the existing biclustering algorithms.

Identifiers

book ISBN :	978-1-4244-4858-6
book e-ISBN :	978-1-4244-4859-3
DOI	10.1109/INDCON.2009.5409395

Keywords

pattern clustering biology computing data mining genetics biclustering gene expression data mean squared residue difference threshold K-means clustering algorithm Clustering algorithms Gene expression Computational modeling Evolutionary computation Bioinformatics Coherence

Additional information

Data set: ieee

Publisher

IEEE

INFONA - science communication portal

Biclustering Gene Expression Data Using MSR Difference Threshold

Source

Abstract

Identifiers

Authors

Das, S.

Idicula, S.M.

Keywords

Additional information

Publisher


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Biclustering Gene Expression Data Using MSR Difference Threshold $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Das, S.

Idicula, S.M.

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Biclustering Gene Expression Data Using MSR Difference Threshold