Nonlinear Dimensionality Reduction by Isomap and MLEdim as Applied to Amino-Acid Distribution in Yeast ORFs

A. Bartkowiak

doi:10.1109/CISIM.2008.44

Source

2008 7th Computer Information Systems and Industrial Management Applications > 183 - 188

Abstract

We consider the multivariate distribution of amino-acids coding for proteins in Open Reading Frames (ORFs). An appropriate statistical model of this distribution might throw some light on the interdependency of the 20 amino-acids and contribute to the problem of verification of known ORFs (At the date 3. April 2008 only 71.02\% of known ORFs were verified). From a graphical analysis od the data we deduce that the data cloud mightbe modelled by a curvilinear manifold of smaller dimension embedded in a larger, 20-dimensional space. To check that assumption we have applied to the recorded data (containing frequency of appearing 20 amino-acids in ORFs found in the 7th yeast chromosome) two nonlinear methods referred to as the Isomap (Tennenbaum et al., 2000 ) and MLEdim (Levina and Bickel, 2005). These two methods, based on complete different principles, gave similar results: the true 'intrinsic' dimension of the investigated data appears several dimensions smaller as originally supposed.

Identifiers

book ISBN :	978-0-7695-3184-7
DOI	10.1109/CISIM.2008.44

Keywords

statistical distributions biology computing data reduction genetics proteins genetic code nonlinear dimensionality reduction protein amino-acid multivariate distribution yeast open reading frame statistical model graphical analysis Isomap MLEdim Distance measurement Reactive power Distributed databases Standardization Manifolds Biological cells MLEdim estimator intrinsic dimension reduction of dimensionality Open Reading Frames (ORFs) in yeast

Additional information

Data set: ieee

Publisher

IEEE

INFONA - science communication portal

Nonlinear Dimensionality Reduction by Isomap and MLEdim as Applied to Amino-Acid Distribution in Yeast ORFs

Source

Abstract

Identifiers

Authors

Bartkowiak, A.

Keywords

Additional information

Publisher


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Nonlinear Dimensionality Reduction by Isomap and MLEdim as Applied to Amino-Acid Distribution in Yeast ORFs $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Bartkowiak, A.

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Nonlinear Dimensionality Reduction by Isomap and MLEdim as Applied to Amino-Acid Distribution in Yeast ORFs