Search results

article

A Comparative Study of Predicting DBH and Stem Volume of Individual Trees in a Temperate Forest Using Airborne Waveform LiDAR

Jianwei Wu, Wei Yao, Sungho Choi, Taejin Park, more

IEEE Geoscience and Remote Sensing Letters > 2015 > 12 > 11 > 2267 - 2271

Using airborne full-waveform LiDAR metrics derived by 3-D tree segmentation, this study estimated single tree's diameter at breast height (DBH) and stem volume (STV). Four regression models were used, including multilinear regression and three up-to-date regression models (i.e., least square boosting trees regression, random forest, and

$\varepsilon$

-support vector regression) from the machine learning...

chapter

A learning-based approach for Romanian syllabification and stress assignment

Diana Balc, Anamaria Beleiu, Rodica Potolea, Camelia Lemnaru

2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) > 37 - 42

2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)

This paper tackles the Romanian syllabification and stress assignment problems, and proposes an efficient machine learning based solution. We show that by designing the appropriate feature sets for each specific problem, learning algorithms achieve satisfactory accuracy rates for both problems (∼92% for syllabification, ∼85% for stress assignment), even for relatively small training set sizes. We...

chapter

An effective hybridized classifier for breast cancer diagnosis

Dishant Mittal, Dev Gaurav, Sanjiban Sekhar Roy

2015 IEEE International Conference on Advanced Intelligent Mechatronics (AIM) > 1026 - 1031

2015 IEEE International Conference on Advanced Intelligent Mechatronics (AIM)

After lung cancer, breast cancer is known to be the greatest cause for death among females [20]. The improving effectiveness of machine learning approaches is being given a lot of importance by medical practitioners for breast cancer diagnosis. The paper proposes an effective hybridized classifier for breast cancer diagnosis. The classifier is made by combining an unsupervised artificial neural network...

chapter

Network traffic classification — A comparative study of two common decision tree methods: C4.5 and Random forest

Alhamza Munther, Alabass Alalousi, Shahrul Nizam, Rozmie R. Othman, more

2014 2nd International Conference on Electronic Design (ICED) > 210 - 214

2014 2nd International Conference on Electronic Design (ICED)

Network traffic classification gains continuous interesting while many applications emerge on the different kinds of networks with obfuscation techniques. Decision tree is a supervised machine learning method used widely to identify and classify network traffic. In this paper, we introduce a comparative study focusing on two common decision tree methods namely: C4.5 and Random forest. The study offers...

chapter

Online Classification for Time-Domain Astronomy

Kitty K. Lo, Tara Murphy, Umaa Rebbapragada, Kiri Wagstaff

2013 IEEE 13th International Conference on Data Mining Workshops > 24 - 31

2013 IEEE 13th International Conference on Data Mining Workshops (ICDMW)

The advent of synoptic sky surveys has spurred the development of techniques for real-time classification of astronomical sources in order to ensure timely follow-up with appropriate instruments. Previous work has focused on algorithm selection or improved light curve representations, and naively convert light curves into structured feature sets without regard for the time span or phase of the light...

chapter

Human Activity Recognition for Physical Rehabilitation

Daniel Leightley, John Darby, Baihua Li, Jamie S. McPhee, more

2013 IEEE International Conference on Systems, Man, and Cybernetics > 261 - 266

2013 IEEE International Conference on Systems, Man and Cybernetics (SMC 2013)

The recognition of human activity is a challenging topic for machine learning. We present an analysis of Support Vector Machines (SVM) and Random Forests (RF) in their ability to accurately classify Kinect kinematic activities. Twenty participants were captured using the Microsoft Kinect performing ten physical rehabilitation activities. We extracted the kinematic location, velocity and energy of...

chapter

Network traffic clustering using Random Forest proximities

Yu Wang, Yang Xiang, Jun Zhang

2013 IEEE International Conference on Communications (ICC) > 2058 - 2062

ICC 2013 - 2013 IEEE International Conference on Communications

The recent years have seen extensive work on statistics-based network traffic classification using machine learning (ML) techniques. In the particular scenario of learning from unlabeled traffic data, some classic unsupervised clustering algorithms (e.g. K-Means and EM) have been applied but the reported results are unsatisfactory in terms of low accuracy. This paper presents a novel approach for...

chapter

An empirical study to address the problem of Unbalanced Data Sets in sentiment classification

Asmaa Mountassir, Houda Benbrahim, Ilham Berrada

2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 3298 - 3303

2012 IEEE International Conference on Systems, Man and Cybernetics - SMC

With the emergence of Web 2.0, Sentiment Analysis is receiving more and more attention. Several interesting works were performed to address different issues in Sentiment Analysis. Nevertheless, the problem of Unbalanced Data Sets was not enough tackled within this research area. This paper presents the study we have carried out to address the problem of unbalanced data sets in supervised sentiment...

chapter

Dependance of critical dimension on learning machines and ranking methods

Divya Suryakumar, Andrew H. Sung, Qingzhong Liu

2012 IEEE 13th International Conference on Information Reuse & Integration (IRI) > 738 - 739

2012 IEEE 13th International Conference on Information Reuse & Integration (IRI)

Feature reduction is a major problem in data mining. Though traditional methods such as feature ranking and subset selection have been widely used, there has been little consideration given to assuring satisfactory performance of a learning machine in relation to the minimum of features required or the “critical dimension”. This critical dimension is unique to a specific dataset, learning machine,...

chapter

Variable interaction measures with random forest classifiers

Cassidy Kelly, Kazunori Okada

2012 9th IEEE International Symposium on Biomedical Imaging (ISBI) > 154 - 157

2012 IEEE 9th International Symposium on Biomedical Imaging (ISBI 2012)

Novel variable interaction measures with random forest classifiers are proposed. The proposed methods efficiently measure the change in classification performance due to non-linear interactions between variables by exploiting random permutation of out-of-bag samples in random forests. They can be readily extended to measure n-subset interactions in multi-class bagging ensembles with any base supervised...

chapter

CudaRF: A CUDA-based implementation of Random Forests

Hakan Grahn, Niklas Lavesson, Mikael Hellborg Lapajne, Daniel Slat

2011 9th IEEE/ACS International Conference on Computer Systems and Applications (AICCSA) > 95 - 101

2011 9th IEEE/ACS International Conference on Computer Systems and Applications (AICCSA)

Machine learning algorithms are frequently applied in data mining applications. Many of the tasks in this domain concern high-dimensional data. Consequently, these tasks are often complex and computationally expensive. This paper presents a GPU-based parallel implementation of the Random Forests algorithm. In contrast to previous work, the proposed algorithm is based on the compute unified device...

chapter

Classifying Connectivity Graphs Using Graph and Vertex Attributes

Jonas Richiardi, Sophie Achard, Edward Bullmore, Dimitri Van De Ville

2011 International Workshop on Pattern Recognition in NeuroImaging > 45 - 48

2011 International Workshop on Pattern Recognition in Neuroimaging (PRNI)

Qualitative and quantitative description of functional connectivity graphs using graph attributes is of great interest to neuroscience, and has led to remarkable insights in the field. However, the statistical techniques used have generally been limited to whole-group, post-hoc studies. In this paper, we propose instead a novel approach to perform predictive inference on single subjects. It is based...

chapter

Boosted Spectral Embedding (BoSE): Applications to content-based image retrieval of histopathology

A Sridhar, S Doyle, A Madabhushi

2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro > 1897 - 1900

2011 8th IEEE International Symposium on Biomedical Imaging: From Nano to Macro (ISBI 2011)

In machine learning, non-linear dimensionality reduction (NLDR) is commonly used to embed high-dimensional data into a low-dimensional space while preserving local object adjacencies. However, the majority of NLDR methods define object adjacencies using distance metrics that do not account for the quality of the features in the high-dimensional space. In this paper we present Boosted Spectral Embedding...

chapter

Using per-Source measurements to improve performance of Internet traffic classification

S Bregni, D Lucerna, C Rottondi, G Verticale

2010 IEEE Latin-American Conference on Communications > 1 - 5

2010 IEEE Latin-American Conference on Communications (LATINCOM)

Obfuscated and encrypted protocols hinder traffic classification by classical techniques such as port analysis or deep packet inspection. Therefore, there is growing interest for classification algorithms based on statistical analysis of the length of the first packets of flows. Most classifiers proposed in literature are based on machine learning techniques and consider each flow independently of...

chapter

Automated analysis of Human Protein Atlas immunofluorescence images

J.Y. Newberg, Jieyue Li, A. Rao, F. Ponten, more

2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro > 1023 - 1026

2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro (ISBI)

The Human Protein Atlas is a rich source of location proteomics data. In this work, we present an automated approach for processing and classifying major subcellular patterns in the Atlas images. We demonstrate that two different classification frameworks (support vector machine and random forest) are effective at determining subcellular locations; we can analyze over 3500 Atlas images with a high...

chapter

Application of Random Forest in Predicting Fault-Prone Classes

A. Kaur, R. Malhotra

2008 International Conference on Advanced Computer Theory and Engineering > 37 - 43

2008 International Conference on Advanced Computer Theory and Engineering (ICACTE)

There are available metrics for predicting fault prone classes, which may help software organizations for planning and performing testing activities. This may be possible due to proper allocation of resources on fault prone parts of the design and code of the software. Hence, importance and usefulness of such metrics is understandable, but empirical validation of these metrics is always a great challenge...

chapter

Semi-supervised method for gene expression data classification with Gaussian fields and harmonic functions

Yun-Chao Gong, Chuan-Liang Chen

2008 19th International Conference on Pattern Recognition > 1 - 4

ICPR 2008 19th International Conference on Pattern Recognition

In real world applications, there are great many of DNA expressed microarray data, many supervised classification algorithms such as decision tree, KNN and SVM in the machine learning field have been introduced for microarray data classification. However, in real worlds, the labeled examples, especially gene expression data examples are often very difficult and expensive to obtain. The traditional...

INFONA - science communication portal

Search results

A Comparative Study of Predicting DBH and Stem Volume of Individual Trees in a Temperate Forest Using Airborne Waveform LiDAR

A learning-based approach for Romanian syllabification and stress assignment

An effective hybridized classifier for breast cancer diagnosis

Network traffic classification — A comparative study of two common decision tree methods: C4.5 and Random forest

Online Classification for Time-Domain Astronomy

Human Activity Recognition for Physical Rehabilitation

Network traffic clustering using Random Forest proximities

An empirical study to address the problem of Unbalanced Data Sets in sentiment classification

Dependance of critical dimension on learning machines and ranking methods

Variable interaction measures with random forest classifiers

CudaRF: A CUDA-based implementation of Random Forests

Classifying Connectivity Graphs Using Graph and Vertex Attributes

Boosted Spectral Embedding (BoSE): Applications to content-based image retrieval of histopathology

Using per-Source measurements to improve performance of Internet traffic classification

Automated analysis of Human Protein Atlas immunofluorescence images

Application of Random Forest in Predicting Fault-Prone Classes

Semi-supervised method for gene expression data classification with Gaussian fields and harmonic functions

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options